# Tensorfuse ## Docs - [Blog](https://tensorfuse.io/docs/blogs/blog.md) - [Handling Unhealthy Nodes in EKS](https://tensorfuse.io/docs/blogs/handling_unhealthy_nodes_in_eks.md) - [Lazy loading isn’t the magic pill to fix AI Inference](https://tensorfuse.io/docs/blogs/lazy_loading_performance_degradation.md) - [Understanding Multi GPU Communication and Nvidia NCCL for finetuning models](https://tensorfuse.io/docs/blogs/multi_gpu_communication_while_training.md) - [Reducing GPU Cold Start Time when using vLLM](https://tensorfuse.io/docs/blogs/reducing_gpu_cold_start.md) - [Small Language Models are the Future of Agentic AI](https://tensorfuse.io/docs/blogs/small_language_model.md) - [Introducing Fastpull: Start AI containers in seconds](https://tensorfuse.io/docs/blogs/snapshotter_graphs.md): Fastpull uses lazy-loading snapshotters like SOCI and Nydus to start large AI/ML container images up to 10x faster, reducing cold start times from minutes to seconds. - [CloudFormation won't fix your life but it can fix your infra](https://tensorfuse.io/docs/blogs/understanding_cfn.md): A look into how efficiently managing your cloud infrastructure will definitely make your life easier, even though it may not fix it - [Adding Other Members](https://tensorfuse.io/docs/concepts/adding_members.md): Add members to your team and give them access to your cluster - [Build](https://tensorfuse.io/docs/concepts/architecture/build.md) - [Configure](https://tensorfuse.io/docs/concepts/architecture/configure.md): Architecture of resources provisioned during configure - [Jobs](https://tensorfuse.io/docs/concepts/architecture/jobs.md) - [Serve](https://tensorfuse.io/docs/concepts/architecture/serve.md) - [Deployment Configuration](https://tensorfuse.io/docs/concepts/configuration.md): Configure your deployments using a deployment.yaml file for reproducible, version-controlled infrastructure. - [Custom domains with TLS](https://tensorfuse.io/docs/concepts/custom_domains_with_tls.md): Add custom domains to your ML endpoints with HTTPS - [Deployments](https://tensorfuse.io/docs/concepts/deployments.md): Learn how to deploy your containerized applications as serverless, auto-scaling API endpoints on Tensorfuse. - [Dev Containers](https://tensorfuse.io/docs/concepts/devcontainers.md): Hot-reloading GPU-enabled containers for seamless testing and development - [Environments](https://tensorfuse.io/docs/concepts/environments.md): Environments help you isolate your resources and manage your deployments. - [Finetune Llama 3 70B on your AWS account](https://tensorfuse.io/docs/concepts/finetuning.md): Finetune LoRA adapters for popular models using axolotl styled declarative configs - [Getting Started](https://tensorfuse.io/docs/concepts/getting_started_tensorkube.md): Deploy serverless GPU applications on your AWS account - [Using Tensorkube with Github Actions](https://tensorfuse.io/docs/concepts/github_actions.md): How to create a tensorkube deployment with github actions - [Tensorfuse](https://tensorfuse.io/docs/concepts/introduction.md): AI inference and finetuning on your AWS account - [Job Queues](https://tensorfuse.io/docs/concepts/job_queues.md): Deploy your jobs and queue them programmatically with different parameters - [Handling Node Failures](https://tensorfuse.io/docs/concepts/node_failures.md): Understanding how Tensorfuse deals with node failures - [Debugging Tensorfuse Deployments and Jobs with logs](https://tensorfuse.io/docs/concepts/observability/debugging_with_logs.md): Debug your applications by querying past logs - [Secrets](https://tensorfuse.io/docs/concepts/secrets.md): Use secrets in your deployments for sensitive information - [Troubleshooting](https://tensorfuse.io/docs/concepts/troubleshooting.md): Guidelines and best practices for deploying applications on Tensorkube - [Volumes](https://tensorfuse.io/docs/concepts/volumes.md): Use volumes in your tensorkube deployments - [Sub-accounts](https://tensorfuse.io/docs/enterprise/sub_account_setup.md): Keep Tensorfuse runtime isolated and control Billing - [Run SAM2 on your own AWS on Serverless GPUs](https://tensorfuse.io/docs/guides/SAM2.md): Deploy serverless GPU applications on your AWS account - [Install AWS-CLI on Mac OS](https://tensorfuse.io/docs/guides/aws_cli.md): Deploy serverless GPU applications on your AWS account - [Finetuning using Axolotl on your AWS Account](https://tensorfuse.io/docs/guides/axolotl_finetuning_guide.md): Complete guide to running flexible Axolotl finetuning jobs on TensorFuse with support for multiple dataset formats and configurable parameters - [Cheatsheet for Tensorkube users](https://tensorfuse.io/docs/guides/cheatsheet.md): Deploy serverless GPU applications on your AWS account - [Deploy Stable Diffusion XL model using ComfyUI on your AWS](https://tensorfuse.io/docs/guides/comfyui_stable_diffusion_xl.md): Deploy serverless GPU applications on your AWS account - [Deploy Deepseek R1 671B on Serverless GPUs](https://tensorfuse.io/docs/guides/deepseek_r1.md): Deploy Deepseek R1 671B param model using Tensorfuse - [Finetune Llama 3 70B on your AWS account](https://tensorfuse.io/docs/guides/finetuning_llama_70b.md): Finetune LoRA adapters for popular models using axolotl styled declarative configs - [Deploy GGUF quants of Deepseek R1 671B on Serverless GPUs](https://tensorfuse.io/docs/guides/integrations/llama_cpp.md): Deploy GGUF quants of Deepseek R1 671B param model using Tensorfuse - [Deploy Jina Embeddings model on your AWS](https://tensorfuse.io/docs/guides/jina_embeddings.md): Deploy serverless GPU applications on your AWS account - [Deploy Llama-3.3-70B-Instruct on Serverless GPUs](https://tensorfuse.io/docs/guides/llama_guide.md): Deploy serverless GPU applications on your AWS account - [Deploy Mochi 1 Preview Video Model using ComfyUI on your AWS](https://tensorfuse.io/docs/guides/mochi_comfy.md): Deploy serverless GPU applications on your AWS account - [Deploying Sesame-CSM-1B on Serverless GPUs](https://tensorfuse.io/docs/guides/modality/audio/sesame-csm-1b.md): Deploy a serverless Sesame CSM 1B model on your AWS account - [Deploying FLUX.1-dev on Serverless GPUs](https://tensorfuse.io/docs/guides/modality/image/flux_dev.md): Deploy serverless GPU applications on your AWS account - [Deploy Llama 4 Models on your AWS account](https://tensorfuse.io/docs/guides/modality/text/llama_4.md): Deploy Llama 4 Scout and Maverick from Meta models using Tensorfuse - [Deploy OpenAI OSS Models in your AWS account](https://tensorfuse.io/docs/guides/modality/text/openai_oss.md): Deploy GPT-OSS models from OpenAI in your AWS using Tensorfuse - [Deploying Pixtral-12B-Quantised-4-Bit on Serverless GPUs](https://tensorfuse.io/docs/guides/pixtral_12b_quantized_4bit.md): Deploy serverless GPU applications on your AWS account - [Deploy Qwen QwQ 32B on Serverless GPUs](https://tensorfuse.io/docs/guides/reasoning/qwen_qwq.md): Deploy Qwen QwQ 32B using Tensorfuse - [Transforming Qwen 7B into Your Own Reasoning Model](https://tensorfuse.io/docs/guides/reasoning/unsloth/qwen7b.md): Fine-tune Qwen 7B for reasoning tasks on your AWS account using tensorfuse and unsloth with GRPO. - [Deploying Resnet on serverless GPUs](https://tensorfuse.io/docs/guides/resnet_deployment.md): Deploy serverless GPU applications on your AWS account - [Deploying Stable Diffusion 3 Medium on Serverless GPUs](https://tensorfuse.io/docs/guides/sd3_medium.md): Deploy serverless GPU applications on your AWS account - [Deploying SpeechT5 on serverless GPUs](https://tensorfuse.io/docs/guides/tts_depl.md): Deploy serverless GPU applications on your AWS account - [Changelog](https://tensorfuse.io/docs/reference/changelog/changelog.md): Changes made to the Tensorkube CLI - [tensorkube account](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_account.md) - [tensorkube cluster](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_cluster.md) - [tensorkube configure](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_configure.md) - [tensorkube datasets](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_datasets.md) - [tensorkube deploy](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_deploy.md) - [tensorkube deployment](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_deployment.md) - [tensorkube dev](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_dev.md) - [tensorkube domain](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_domain.md) - [tensorkube environment](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_environment.md) - [tensorkube get-permissions-command](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_get_permissions_command.md) - [tensorkube get-principal-arn](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_get_principal_arn.md) - [tensorkube give-cluster-access](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_give_cluster_access.md) - [tensorkube install-prerequisites](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_install_prerequisites.md) - [tensorkube job](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_job.md) - [tensorkube list](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_list.md) - [tensorkube login](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_login.md) - [tensorkube reset](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_reset.md) - [tensorkube secret](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_secret.md) - [tensorkube sync](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_sync.md) - [tensorkube teardown](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_teardown.md) - [tensorkube train](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_train.md) - [tensorkube upgrade](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_upgrade.md) - [tensorkube version](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_version.md) - [tensorkube volume](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_volume.md) - [Permissions for Configure](https://tensorfuse.io/docs/reference/permissions/configure_permissions.md): AWS Permissions required for running `tensorkube configure` - [Permissions to connect Tensorfuse to your account](https://tensorfuse.io/docs/reference/permissions/frontend_connect_permissions.md): AWS Permissions required for connecting Tensorfuse to your AWS account