Instructions to use ChaoticNeutrals/Visual-LaylelemonMaidRP-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ChaoticNeutrals/Visual-LaylelemonMaidRP-7B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ChaoticNeutrals/Visual-LaylelemonMaidRP-7B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("ChaoticNeutrals/Visual-LaylelemonMaidRP-7B") model = AutoModelForCausalLM.from_pretrained("ChaoticNeutrals/Visual-LaylelemonMaidRP-7B") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use ChaoticNeutrals/Visual-LaylelemonMaidRP-7B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/ChaoticNeutrals/Visual-LaylelemonMaidRP-7B
- SGLang
How to use ChaoticNeutrals/Visual-LaylelemonMaidRP-7B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ChaoticNeutrals/Visual-LaylelemonMaidRP-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use ChaoticNeutrals/Visual-LaylelemonMaidRP-7B with Docker Model Runner:
docker model run hf.co/ChaoticNeutrals/Visual-LaylelemonMaidRP-7B
| base_model: | |||
| - Nitral-AI/Infinitely-Laydiculous-7B | |||
| - Nitral-AI/Stanta-Lelemon-Maid-7B | |||
| library_name: transformers | |||
| tags: | |||
| - mergekit | |||
| - merge | |||
| license: other | |||
| %3C!----%3E%3C%2Ftd%3E%3C%2Ftr%3E%3Ctr id="L12"> | Heard you like Imatrix Quants, if so find them from lewdiculus here: https://huggingface.co/Lewdiculous/Visual-LaylelemonMaidRP-7B-GGUF-IQ-Imatrix | ||
| # Vision/multimodal capabilities: | |||
| If you want to use vision functionality: | |||
| * You must use the latest versions of [Koboldcpp](https://github.com/LostRuins/koboldcpp). | |||
| To use the multimodal capabilities of this model and use **vision** you need to load the specified **mmproj** file, this can be found inside this model repo. | |||
| * You can load the **mmproj** by using the corresponding section in the interface: | |||
| %3C!----%3E%3C%2Ftd%3E%3C%2Ftr%3E%3C!--%5D--%3E%3C%2Ftbody%3E%3C%2Ftable%3E%3C%2Fdiv%3E |