From my experimentation it seems like there's some significant loss in accuracy ...

		antimatter15 on March 16, 2023 \| parent \| context \| favorite \| on: Show HN: Alpaca.cpp – Run an Instruction-Tuned Cha... From my experimentation it seems like there's some significant loss in accuracy running the tuned LoRa models through llama.cpp (due to bugs/differences in inference or tokenization), even aside from losses due to quantization.