Gui's ML Library

Search

SearchSearch
      • Supervised Fine-Tuning
        • DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
        • Layer Normalization
        • Monolith - Real Time Recommendation System With Collisionless Embedding Table
      • About
    Home

    ❯

    tags

    ❯

    institution

    Tag: #institution/DeepSeek

    1 item with this tag.

    • Mar 09, 2025

      DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

      • #paper
      • #LLM
      • #institution/DeepSeek
      • #stub

    Website by Guilherme Ilunga, created with Quartz v4.1.4, © 2025