Running Local LLM Models on NVIDIA Jetson Orin
Today, I would like to introduce a project: running the LLama 2 model on the Jetson AGX Orin. This project comes from: Don’t worry, we have already replicated this project on the Jetson AGX Xavier 32G, so it is feasible. Background Large language models (LLM) like ChatGPT and Llama 2 have the potential to change … Read more