Mistral 7B v0.1 is Mistral AI's first Large Language Model (LLM). A Large Language Model (LLM) is an artificial intelligence algorithm trained on massive amounts of data that is able to generate coherent text and perform various natural language processing tasks.
A Docker image bundling vLLM, a fast Python inference server, with everything required to run our model is provided to quickly spin a completion API on any major cloud provider with NVIDIA GPUs.
Where to start?
If you are interested in the deployment of the Mistral AI LLM on your own infrastructure, check out the Quickstart. If you want to use the API served by a deployed instance, go to the Interacting with the model page or to the API specification.
Mistral AI is committed to open source software development and welcomes external contributions. Please open a PR!