Submissions from pytorch.org

		Quantization-Aware Training for Large Language Models with PyTorch (2024) (pytorch.org)
		2 points by tosh 9 months ago \| past
		TorchServe is no longer actively maintained (pytorch.org)
		2 points by tbobm 9 months ago \| past
		PyTorch 2.6 (pytorch.org)
		1 point by tosh 11 months ago \| past
		VLLM Joins PyTorch Ecosystem (pytorch.org)
		2 points by reqo on Dec 10, 2024 \| past \| 1 comment
		Distilling Llama3.1 8B into 1B in torchtune (pytorch.org)
		1 point by tosh on Dec 10, 2024 \| past
		PyTorch Deprecation of Conda Nightly Builds (pytorch.org)
		3 points by yeldarb on Nov 17, 2024 \| past \| 1 comment
		PyTorch Deprecation of Conda Nightly Builds (pytorch.org)
		3 points by nmstoker on Nov 15, 2024 \| past \| 1 comment
		Torch.load flipping default to weights_only=True (pytorch.org)
		2 points by formalsystem on Nov 4, 2024 \| past
		PyTorch 2.5.0 Release, SDPA CuDNN backend, Flex Attention (pytorch.org)
		1 point by lnyan on Oct 18, 2024 \| past
		PyTorch Conference 2024 Recap (pytorch.org)
		1 point by jonbaer on Oct 3, 2024 \| past
		PyTorch Native Architecture Optimization: Torchao (pytorch.org)
		169 points by jonbaer on Sept 28, 2024 \| past \| 52 comments
		Async Tensor Parallelism in PyTorch (pytorch.org)
		2 points by lnyan on Sept 17, 2024 \| past
		CUDA-Free Inference for LLMs (pytorch.org)
		3 points by ororm on Sept 5, 2024 \| past
		PyTorch 2.4 Now Supports Intel GPUs for Faster Workloads (pytorch.org)
		19 points by soulbadguy on Aug 31, 2024 \| past \| 2 comments
		FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention (pytorch.org)
		210 points by limoce on Aug 8, 2024 \| past \| 24 comments
		A guide on good usage of non_blocking and pin_memory() in PyTorch (pytorch.org)
		1 point by yu3zhou4 on Aug 1, 2024 \| past
		Torchchat (pytorch.org)
		2 points by jonbaer on Aug 1, 2024 \| past
		Torchchat: Accelerating Local LLM Inference on Laptop, Desktop and Mobile (pytorch.org)
		2 points by OutOfHere on July 30, 2024 \| past
		PyTorch 2.4: Python 3.12, AOTInductor freezing (pytorch.org)
		4 points by DreamFlasher on July 24, 2024 \| past
		Int4 Decoding GQA CUDA Optimizations for LLM Inference (pytorch.org)
		1 point by jxmorris12 on June 13, 2024 \| past
		ExecuTorch Alpha: Taking LLMs and AI to the Edge (pytorch.org)
		4 points by brainer on May 1, 2024 \| past \| 1 comment
		PyTorch 2.3 (pytorch.org)
		3 points by WithinReason on April 25, 2024 \| past
		Large Scale Transformer Model Training with Tensor Parallel (pytorch.org)
		2 points by pama on April 25, 2024 \| past
		PyTorch 2.3 Release Blog (pytorch.org)
		3 points by nothrowaways on April 25, 2024 \| past
		Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org)
		2 points by g42gregory on April 17, 2024 \| past
		Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org)
		6 points by minimaxir on April 16, 2024 \| past
		Accelerating Moe Model Inference with Locality-Aware Kernel Design (pytorch.org)
		2 points by ABudai on April 5, 2024 \| past
		PyTorch Word Embeddings Tutorial (pytorch.org)
		1 point by jxmorris12 on March 28, 2024 \| past
		What Every User Should Know About Mixed Precision Training in PyTorch (2022) (pytorch.org)
		69 points by yu3zhou4 on March 15, 2024 \| past \| 24 comments
		PyTorch 2 Faster ML Dynamic Python Bytecode Transformation and Graph Compilation [pdf] (pytorch.org)
		2 points by matt_d on Feb 8, 2024 \| past
		More