Llama2Engine
Llama2Engine
is the class for local Llama2 model. Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the engine for the 7B pretrained model.
We load the model weights from Huggingface, see here for more details. You can also load the model weights from your local disk.
Example
Last updated