GitHub topics: chatbot-memory
aahouzi/llama2-chatbot-cpu
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
Language: Python - Size: 30.3 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0
