Llama2Engine
Last updated
Last updated
Llama2Engine
is the class for local Llama2 model. Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the engine for the 7B pretrained model.
We load the model weights from Huggingface, see for more details. You can also load the model weights from your local disk.
Example