How do you inference/serve the model?

#1
by limcheekin - opened

Hi there,

Thanks for sharing the converted models.

I inferencing other LLMs using rkllama, what do you use and how do you inference the embedding model?

Thanks for advice.

Sign up or log in to comment