Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Quantization-Aware Training for Large Language Models with PyTorch (2024) (pytorch.org)
2 points by tosh 9 months ago | past
TorchServe is no longer actively maintained (pytorch.org)
2 points by tbobm 9 months ago | past
PyTorch 2.6 (pytorch.org)
1 point by tosh 11 months ago | past
VLLM Joins PyTorch Ecosystem (pytorch.org)
2 points by reqo on Dec 10, 2024 | past | 1 comment
Distilling Llama3.1 8B into 1B in torchtune (pytorch.org)
1 point by tosh on Dec 10, 2024 | past
PyTorch Deprecation of Conda Nightly Builds (pytorch.org)
3 points by yeldarb on Nov 17, 2024 | past | 1 comment
PyTorch Deprecation of Conda Nightly Builds (pytorch.org)
3 points by nmstoker on Nov 15, 2024 | past | 1 comment
Torch.load flipping default to weights_only=True (pytorch.org)
2 points by formalsystem on Nov 4, 2024 | past
PyTorch 2.5.0 Release, SDPA CuDNN backend, Flex Attention (pytorch.org)
1 point by lnyan on Oct 18, 2024 | past
PyTorch Conference 2024 Recap (pytorch.org)
1 point by jonbaer on Oct 3, 2024 | past
PyTorch Native Architecture Optimization: Torchao (pytorch.org)
169 points by jonbaer on Sept 28, 2024 | past | 52 comments
Async Tensor Parallelism in PyTorch (pytorch.org)
2 points by lnyan on Sept 17, 2024 | past
CUDA-Free Inference for LLMs (pytorch.org)
3 points by ororm on Sept 5, 2024 | past
PyTorch 2.4 Now Supports Intel GPUs for Faster Workloads (pytorch.org)
19 points by soulbadguy on Aug 31, 2024 | past | 2 comments
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention (pytorch.org)
210 points by limoce on Aug 8, 2024 | past | 24 comments
A guide on good usage of non_blocking and pin_memory() in PyTorch (pytorch.org)
1 point by yu3zhou4 on Aug 1, 2024 | past
Torchchat (pytorch.org)
2 points by jonbaer on Aug 1, 2024 | past
Torchchat: Accelerating Local LLM Inference on Laptop, Desktop and Mobile (pytorch.org)
2 points by OutOfHere on July 30, 2024 | past
PyTorch 2.4: Python 3.12, AOTInductor freezing (pytorch.org)
4 points by DreamFlasher on July 24, 2024 | past
Int4 Decoding GQA CUDA Optimizations for LLM Inference (pytorch.org)
1 point by jxmorris12 on June 13, 2024 | past
ExecuTorch Alpha: Taking LLMs and AI to the Edge (pytorch.org)
4 points by brainer on May 1, 2024 | past | 1 comment
PyTorch 2.3 (pytorch.org)
3 points by WithinReason on April 25, 2024 | past
Large Scale Transformer Model Training with Tensor Parallel (pytorch.org)
2 points by pama on April 25, 2024 | past
PyTorch 2.3 Release Blog (pytorch.org)
3 points by nothrowaways on April 25, 2024 | past
Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org)
2 points by g42gregory on April 17, 2024 | past
Torchtune: Easily fine-tune LLMs using PyTorch (pytorch.org)
6 points by minimaxir on April 16, 2024 | past
Accelerating Moe Model Inference with Locality-Aware Kernel Design (pytorch.org)
2 points by ABudai on April 5, 2024 | past
PyTorch Word Embeddings Tutorial (pytorch.org)
1 point by jxmorris12 on March 28, 2024 | past
What Every User Should Know About Mixed Precision Training in PyTorch (2022) (pytorch.org)
69 points by yu3zhou4 on March 15, 2024 | past | 24 comments
PyTorch 2 Faster ML Dynamic Python Bytecode Transformation and Graph Compilation [pdf] (pytorch.org)
2 points by matt_d on Feb 8, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: