AI & ML interests

AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.

Recent Activity

dkupnicki  published a model 19 days ago
AmpereComputing/bge-m3-gguf
dkupnicki  updated a model 19 days ago
AmpereComputing/bge-m3-gguf
jangrzybek  updated a model 3 months ago
AmpereComputing/granite-4.0-h-small-gguf
View all activity