| | Quantization-Aware Training for Large Language Models with PyTorch (2024) (pytorch.org) |
| 2 points by tosh 9 months ago | past |
|
| | TorchServe is no longer actively maintained (pytorch.org) |
| 2 points by tbobm 9 months ago | past |
|
| | PyTorch 2.6 (pytorch.org) |
| 1 point by tosh 11 months ago | past |
|
| | VLLM Joins PyTorch Ecosystem (pytorch.org) |
| 2 points by reqo on Dec 10, 2024 | past | 1 comment |
|
| | Distilling Llama3.1 8B into 1B in torchtune (pytorch.org) |
| 1 point by tosh on Dec 10, 2024 | past |
|
| | PyTorch Deprecation of Conda Nightly Builds (pytorch.org) |
| 3 points by yeldarb on Nov 17, 2024 | past | 1 comment |
|
| | PyTorch Deprecation of Conda Nightly Builds (pytorch.org) |
| 3 points by nmstoker on Nov 15, 2024 | past | 1 comment |
|
| | Torch.load flipping default to weights_only=True (pytorch.org) |
| 2 points by formalsystem on Nov 4, 2024 | past |
|
| | PyTorch 2.5.0 Release, SDPA CuDNN backend, Flex Attention (pytorch.org) |
| 1 point by lnyan on Oct 18, 2024 | past |
|
| | PyTorch Conference 2024 Recap (pytorch.org) |
| 1 point by jonbaer on Oct 3, 2024 | past |
|
| | PyTorch Native Architecture Optimization: Torchao (pytorch.org) |
| 169 points by jonbaer on Sept 28, 2024 | past | 52 comments |
|
| | Async Tensor Parallelism in PyTorch (pytorch.org) |
| 2 points by lnyan on Sept 17, 2024 | past |
|
| | CUDA-Free Inference for LLMs (pytorch.org) |
| 3 points by ororm on Sept 5, 2024 | past |
|
| | PyTorch 2.4 Now Supports Intel GPUs for Faster Workloads (pytorch.org) |
| 19 points by soulbadguy on Aug 31, 2024 | past | 2 comments |
|
| | FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention (pytorch.org) |
| 210 points by limoce on Aug 8, 2024 | past | 24 comments |
|
| | A guide on good usage of non_blocking and pin_memory() in PyTorch (pytorch.org) |
| 1 point by yu3zhou4 on Aug 1, 2024 | past |
|
| | Torchchat (pytorch.org) |
| 2 points by jonbaer on Aug 1, 2024 | past |
|
| | Torchchat: Accelerating Local LLM Inference on Laptop, Desktop and Mobile (pytorch.org) |
| 2 points by OutOfHere on July 30, 2024 | past |
|
| | PyTorch 2.4: Python 3.12, AOTInductor freezing (pytorch.org) |
| 4 points by DreamFlasher on July 24, 2024 | past |
|
| | Int4 Decoding GQA CUDA Optimizations for LLM Inference (pytorch.org) |
| 1 point by jxmorris12 on June 13, 2024 | past |
|
| | ExecuTorch Alpha: Taking LLMs and AI to the Edge (pytorch.org) |
| 4 points by brainer on May 1, 2024 | past | 1 comment |
|
| | PyTorch 2.3 (pytorch.org) |
| 3 points by WithinReason on April 25, 2024 | past |
|
| | Large Scale Transformer Model Training with Tensor Parallel (pytorch.org) |
| 2 points by pama on April 25, 2024 | past |
|
| | PyTorch 2.3 Release Blog (pytorch.org) |
| 3 points by nothrowaways on April 25, 2024 | past |
|
| | Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org) |
| 2 points by g42gregory on April 17, 2024 | past |
|
| | Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org) |
| 6 points by minimaxir on April 16, 2024 | past |
|
| | Accelerating Moe Model Inference with Locality-Aware Kernel Design (pytorch.org) |
| 2 points by ABudai on April 5, 2024 | past |
|
| | PyTorch Word Embeddings Tutorial (pytorch.org) |
| 1 point by jxmorris12 on March 28, 2024 | past |
|
| | What Every User Should Know About Mixed Precision Training in PyTorch (2022) (pytorch.org) |
| 69 points by yu3zhou4 on March 15, 2024 | past | 24 comments |
|
| | PyTorch 2 Faster ML Dynamic Python Bytecode Transformation and Graph Compilation [pdf] (pytorch.org) |
| 2 points by matt_d on Feb 8, 2024 | past |
|
|
| More |