| | Hyperlink: On-device AI agent that searches and summarizes all your local files (nexa.ai) |
| 4 points by alanzhuly 67 days ago | past |
|
| | Qwen3-VL-4B and 8B runs locally on NPU, GPU, and CPU with one SDK (nexa.ai) |
| 3 points by alanzhuly 3 months ago | past |
|
| | We Ran OpenAI GPT-OSS 20B Locally on a Phone (nexa.ai) |
| 2 points by alanzhuly 3 months ago | past |
|
| | We Ran GPT‑OSS 20B Local on a Phone (nexa.ai) |
| 1 point by BUFU 3 months ago | past |
|
| | Hyperlink, an Offline Private AI Agent for Local Files (nexa.ai) |
| 2 points by jinqueeny 3 months ago | past |
|
| | Nexa SDK, Run, build and ship local AI in minutes (nexa.ai) |
| 2 points by jinqueeny 3 months ago | past |
|
| | Show HN: Even Ollama says this local AI inference is cool – Nexa SDK for NPU (nexa.ai) |
| 4 points by ks1225 3 months ago | past |
|
| | New Engine to Run SOTA AI Models on Qualcomm NPU Across Phone, PC, Cars, and IoT (nexa.ai) |
| 1 point by BUFU 4 months ago | past |
|
| | OmniNeural-4B: First NPU-Aware Multimodal AI Model (nexa.ai) |
| 1 point by jinqueeny 5 months ago | past |
|
| | Nexa AI Blogs (nexa.ai) |
| 1 point by BUFU 5 months ago | past |
|
| | On-Device SLM Leaderboard (nexa.ai) |
| 2 points by mountainview 11 months ago | past |
|
| | Quantized DeepSeek R1 Distill Models with Original Model Accuracy (nexa.ai) |
| 2 points by BUFU 11 months ago | past |
|
| | On-Device Gen AI Multimodal Benchmarks Across Devices (nexa.ai) |
| 1 point by jinqueeny 11 months ago | past |
|
| | NexaQuant: Llama.cpp-Compatible Model Compression with 100%+ Accuracy Recovery (nexa.ai) |
| 3 points by BUFU on Jan 3, 2025 | past | 1 comment |
|
| | How to unify Gemma and Whisper to build a super fast local voice LLM (nexa.ai) |
| 2 points by alanzhuly on Dec 17, 2024 | past |
|
| | OmniAudio-2.6B: Fastest Audio Language Model for Edge Deployment (nexa.ai) |
| 2 points by BUFU on Dec 13, 2024 | past | 1 comment |
|
| | Run Qwen Audio Language Model on Local Devices for Voice Chat and Audio Analysis (nexa.ai) |
| 4 points by BUFU on Nov 25, 2024 | past |
|
| | Omnivision-968M: Vision Language Model with 9x Tokens Reduction for Edge Devices (nexa.ai) |
| 69 points by BUFU on Nov 15, 2024 | past | 12 comments |
|
| | Tiny (1B/3B) LLMs in a local RAG system (nexa.ai) |
| 2 points by jinqueeny on Nov 8, 2024 | past |
|
| | What can you do with tiny (1B/3B) LLMs in a local RAG system? (nexa.ai) |
| 3 points by jinqueeny on Nov 6, 2024 | past |
|
| | What you can do with tiny (1B/3B) LLMs in a local RAG system? (nexa.ai) |
| 1 point by alanzhuly on Nov 5, 2024 | past |
|