LLM Wiki

태그: reinforcement-learning

2건의 항목

  • 2026년 4월 15일

    Unsloth Gemma 4 RL Sudoku Notebook — 9GB VRAM 찍먹

    • ai-models
    • reinforcement-learning
    • gemma
    • grpo
    • unsloth
  • 2026년 3월 29일

    Memento-Skills — Let Agents Design Agents

    • ai-agents
    • multi-agent
    • llm
    • reinforcement-learning
    • skill-library

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community