|
[Research] From Functional Geometry to Dynamic Grammar: New LIMEN Audits (V23–V24) Across 7 Architectures
|
|
2
|
26
|
July 2, 2026
|
|
A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics
|
|
0
|
21
|
June 29, 2026
|
|
Error fix of the 503 loop
|
|
1
|
47
|
June 25, 2026
|
|
Deprecated parameters of pipeline() included in the course
|
|
0
|
24
|
June 12, 2026
|
|
Fine tuning for social media trends
|
|
2
|
84
|
June 8, 2026
|
|
A note on interpreting internal dynamics: Stability vs. Semantic Correctness in Transformers
|
|
0
|
29
|
June 2, 2026
|
|
How can LLMs be fine-tuned for specialized domain knowledge?
|
|
3
|
1558
|
May 29, 2026
|
|
Need generative model, high-quality description generation
|
|
3
|
110
|
May 28, 2026
|
|
SFTTrainerflags blocks assistant_only_loss=True
|
|
3
|
125
|
May 26, 2026
|
|
Date format for tine-tuning AI models
|
|
5
|
104
|
May 22, 2026
|
|
Chatbot Start Prompt for GPT-J
|
|
5
|
1400
|
May 21, 2026
|
|
Automatic -100 masking of the questions in Labels
|
|
1
|
39
|
May 21, 2026
|
|
PTQ INT8 via TFLiteConverter — encoder-decoder seq2seq model loses encoder context entirely after conversion
|
|
3
|
114
|
May 16, 2026
|
|
Fucking hugging face changed the zerogpu
|
|
0
|
32
|
May 14, 2026
|
|
Train a fully open SmolLM4-750M model
|
|
0
|
204
|
May 11, 2026
|
|
The BPE pre-tokenizer was not recognized!
|
|
6
|
301
|
May 7, 2026
|
|
Custom batches in sentence-transformers for MultipleNegativesRankingLoss
|
|
3
|
130
|
May 1, 2026
|
|
I developed an experimental Graph-Native Artificial Brain engine
|
|
4
|
83
|
May 1, 2026
|
|
When i use tool its pause and restart space not working why
|
|
0
|
21
|
April 30, 2026
|
|
CPU offloading error scenario
|
|
11
|
374
|
April 27, 2026
|
|
Gemma 3 12B: 4-bit Quantization failing/ignored in Transformers v5.1.0 (Gemma3ForConditionalGeneration)
|
|
9
|
470
|
April 24, 2026
|
|
Why am I facing this Error while running this code
|
|
1
|
106
|
April 23, 2026
|
|
What are the best tutorials to learn Transformers step by step?
|
|
2
|
164
|
April 20, 2026
|
|
LLM Course code errors
|
|
8
|
364
|
April 17, 2026
|
|
Independent researcher looking for technical feedback on a paper about a revision-capable language model
|
|
0
|
39
|
April 17, 2026
|
|
Why this BERTScore has a high precision?
|
|
1
|
130
|
April 16, 2026
|
|
Fine-tuning Gemma-4-E2B on MacBook M3
|
|
4
|
1126
|
April 14, 2026
|
|
Current State and Future of "Integer-Only" LLM Inference (Non-Floating Point)
|
|
1
|
286
|
April 14, 2026
|
|
Continous increase in Memory usage
|
|
17
|
2400
|
April 14, 2026
|
|
Peft 0.18.1 crashing when fine-tuning - Part 2
|
|
2
|
49
|
April 14, 2026
|