How do you inference/serve the model?
#1
by
limcheekin - opened
Hi there,
Thanks for sharing the converted models.
I inferencing other LLMs using rkllama, what do you use and how do you inference the embedding model?
Thanks for advice.