# Tensorfuse

## Docs

- [Blog](https://tensorfuse.io/docs/blogs/blog.md)
- [Handling Unhealthy Nodes in EKS](https://tensorfuse.io/docs/blogs/handling_unhealthy_nodes_in_eks.md)
- [Lazy loading isn’t the magic pill to fix AI Inference](https://tensorfuse.io/docs/blogs/lazy_loading_performance_degradation.md)
- [Understanding Multi GPU Communication and Nvidia NCCL for finetuning models](https://tensorfuse.io/docs/blogs/multi_gpu_communication_while_training.md)
- [Reducing GPU Cold Start Time when using vLLM](https://tensorfuse.io/docs/blogs/reducing_gpu_cold_start.md)
- [Small Language Models are the Future of Agentic AI](https://tensorfuse.io/docs/blogs/small_language_model.md)
- [Introducing Fastpull: Start AI containers in seconds](https://tensorfuse.io/docs/blogs/snapshotter_graphs.md): Fastpull uses lazy-loading snapshotters like SOCI and Nydus to start large AI/ML container images up to 10x faster, reducing cold start times from minutes to seconds.
- [CloudFormation won't fix your life but it can fix your infra](https://tensorfuse.io/docs/blogs/understanding_cfn.md): A look into how efficiently managing your cloud infrastructure will definitely make your life easier, even though it may not fix it
- [Adding Other Members](https://tensorfuse.io/docs/concepts/adding_members.md): Add members to your team and give them access to your cluster
- [Build](https://tensorfuse.io/docs/concepts/architecture/build.md)
- [Configure](https://tensorfuse.io/docs/concepts/architecture/configure.md): Architecture of resources provisioned during configure
- [Jobs](https://tensorfuse.io/docs/concepts/architecture/jobs.md)
- [Serve](https://tensorfuse.io/docs/concepts/architecture/serve.md)
- [Deployment Configuration](https://tensorfuse.io/docs/concepts/configuration.md): Configure your deployments using a deployment.yaml file for reproducible, version-controlled infrastructure.
- [Custom domains with TLS](https://tensorfuse.io/docs/concepts/custom_domains_with_tls.md): Add custom domains to your ML endpoints with HTTPS
- [Deployments](https://tensorfuse.io/docs/concepts/deployments.md): Learn how to deploy your containerized applications as serverless, auto-scaling API endpoints on Tensorfuse.
- [Dev Containers](https://tensorfuse.io/docs/concepts/devcontainers.md): Hot-reloading GPU-enabled containers for seamless testing and development
- [Environments](https://tensorfuse.io/docs/concepts/environments.md): Environments help you isolate your resources and manage your deployments.
- [Finetune Llama 3 70B on your AWS account](https://tensorfuse.io/docs/concepts/finetuning.md): Finetune LoRA adapters for popular models using axolotl styled declarative configs
- [Getting Started](https://tensorfuse.io/docs/concepts/getting_started_tensorkube.md): Deploy serverless GPU applications on your AWS account
- [Using Tensorkube with Github Actions](https://tensorfuse.io/docs/concepts/github_actions.md): How to create a tensorkube deployment with github actions
- [Tensorfuse](https://tensorfuse.io/docs/concepts/introduction.md): AI inference and finetuning on your AWS account
- [Job Queues](https://tensorfuse.io/docs/concepts/job_queues.md): Deploy your jobs and queue them programmatically with different parameters
- [Handling Node Failures](https://tensorfuse.io/docs/concepts/node_failures.md): Understanding how Tensorfuse deals with node failures
- [Debugging Tensorfuse Deployments and Jobs with logs](https://tensorfuse.io/docs/concepts/observability/debugging_with_logs.md): Debug your applications by querying past logs
- [Secrets](https://tensorfuse.io/docs/concepts/secrets.md): Use secrets in your deployments for sensitive information
- [Troubleshooting](https://tensorfuse.io/docs/concepts/troubleshooting.md): Guidelines and best practices for deploying applications on Tensorkube
- [Volumes](https://tensorfuse.io/docs/concepts/volumes.md): Use volumes in your tensorkube deployments
- [Sub-accounts](https://tensorfuse.io/docs/enterprise/sub_account_setup.md): Keep Tensorfuse runtime isolated and control Billing
- [Run SAM2 on your own AWS on Serverless GPUs](https://tensorfuse.io/docs/guides/SAM2.md): Deploy serverless GPU applications on your AWS account
- [Install AWS-CLI on Mac OS](https://tensorfuse.io/docs/guides/aws_cli.md): Deploy serverless GPU applications on your AWS account
- [Finetuning using Axolotl on your AWS Account](https://tensorfuse.io/docs/guides/axolotl_finetuning_guide.md): Complete guide to running flexible Axolotl finetuning jobs on TensorFuse with support for multiple dataset formats and configurable parameters
- [Cheatsheet for Tensorkube users](https://tensorfuse.io/docs/guides/cheatsheet.md): Deploy serverless GPU applications on your AWS account
- [Deploy Stable Diffusion XL model using ComfyUI on your AWS](https://tensorfuse.io/docs/guides/comfyui_stable_diffusion_xl.md): Deploy serverless GPU applications on your AWS account
- [Deploy Deepseek R1 671B on Serverless GPUs](https://tensorfuse.io/docs/guides/deepseek_r1.md): Deploy Deepseek R1 671B param model using Tensorfuse
- [Finetune Llama 3 70B on your AWS account](https://tensorfuse.io/docs/guides/finetuning_llama_70b.md): Finetune LoRA adapters for popular models using axolotl styled declarative configs
- [Deploy GGUF quants of Deepseek R1 671B on Serverless GPUs](https://tensorfuse.io/docs/guides/integrations/llama_cpp.md): Deploy GGUF quants of Deepseek R1 671B param model using Tensorfuse
- [Deploy Jina Embeddings model on your AWS](https://tensorfuse.io/docs/guides/jina_embeddings.md): Deploy serverless GPU applications on your AWS account
- [Deploy Llama-3.3-70B-Instruct on Serverless GPUs](https://tensorfuse.io/docs/guides/llama_guide.md): Deploy serverless GPU applications on your AWS account
- [Deploy Mochi 1 Preview Video Model using ComfyUI on your AWS](https://tensorfuse.io/docs/guides/mochi_comfy.md): Deploy serverless GPU applications on your AWS account
- [Deploying Sesame-CSM-1B on Serverless GPUs](https://tensorfuse.io/docs/guides/modality/audio/sesame-csm-1b.md): Deploy a serverless Sesame CSM 1B model on your AWS account
- [Deploying FLUX.1-dev on Serverless GPUs](https://tensorfuse.io/docs/guides/modality/image/flux_dev.md): Deploy serverless GPU applications on your AWS account
- [Deploy Llama 4 Models on your AWS account](https://tensorfuse.io/docs/guides/modality/text/llama_4.md): Deploy Llama 4 Scout and Maverick from Meta models using Tensorfuse
- [Deploy OpenAI OSS Models in your AWS account](https://tensorfuse.io/docs/guides/modality/text/openai_oss.md): Deploy GPT-OSS models from OpenAI in your AWS using Tensorfuse
- [Deploying Pixtral-12B-Quantised-4-Bit on Serverless GPUs](https://tensorfuse.io/docs/guides/pixtral_12b_quantized_4bit.md): Deploy serverless GPU applications on your AWS account
- [Deploy Qwen QwQ 32B on Serverless GPUs](https://tensorfuse.io/docs/guides/reasoning/qwen_qwq.md): Deploy Qwen QwQ 32B using Tensorfuse
- [Transforming Qwen 7B into Your Own Reasoning Model](https://tensorfuse.io/docs/guides/reasoning/unsloth/qwen7b.md): Fine-tune Qwen 7B for reasoning tasks on your AWS account using tensorfuse and unsloth with GRPO.
- [Deploying Resnet on serverless GPUs](https://tensorfuse.io/docs/guides/resnet_deployment.md): Deploy serverless GPU applications on your AWS account
- [Deploying Stable Diffusion 3 Medium on Serverless GPUs](https://tensorfuse.io/docs/guides/sd3_medium.md): Deploy serverless GPU applications on your AWS account
- [Deploying SpeechT5 on serverless GPUs](https://tensorfuse.io/docs/guides/tts_depl.md): Deploy serverless GPU applications on your AWS account
- [Changelog](https://tensorfuse.io/docs/reference/changelog/changelog.md): Changes made to the Tensorkube CLI
- [tensorkube account](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_account.md)
- [tensorkube cluster](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_cluster.md)
- [tensorkube configure](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_configure.md)
- [tensorkube datasets](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_datasets.md)
- [tensorkube deploy](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_deploy.md)
- [tensorkube deployment](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_deployment.md)
- [tensorkube dev](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_dev.md)
- [tensorkube domain](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_domain.md)
- [tensorkube environment](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_environment.md)
- [tensorkube get-permissions-command](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_get_permissions_command.md)
- [tensorkube get-principal-arn](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_get_principal_arn.md)
- [tensorkube give-cluster-access](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_give_cluster_access.md)
- [tensorkube install-prerequisites](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_install_prerequisites.md)
- [tensorkube job](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_job.md)
- [tensorkube list](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_list.md)
- [tensorkube login](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_login.md)
- [tensorkube reset](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_reset.md)
- [tensorkube secret](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_secret.md)
- [tensorkube sync](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_sync.md)
- [tensorkube teardown](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_teardown.md)
- [tensorkube train](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_train.md)
- [tensorkube upgrade](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_upgrade.md)
- [tensorkube version](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_version.md)
- [tensorkube volume](https://tensorfuse.io/docs/reference/cli_reference/tensorkube_volume.md)
- [Permissions for Configure](https://tensorfuse.io/docs/reference/permissions/configure_permissions.md): AWS Permissions required for running `tensorkube configure`
- [Permissions to connect Tensorfuse to your account](https://tensorfuse.io/docs/reference/permissions/frontend_connect_permissions.md): AWS Permissions required for connecting Tensorfuse to your AWS account