Gui's ML Library

Search

SearchSearch
      • Supervised Fine-Tuning
        • DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
        • Layer Normalization
        • Monolith - Real Time Recommendation System With Collisionless Embedding Table
      • About
    Home

    ❯

    Papers

    Folder: Papers

    3 items under this folder.

    • May 15, 2025

      Monolith - Real Time Recommendation System With Collisionless Embedding Table

      • #paper
      • #recsys
      • #stub
      • #institution/ByteDance
    • Mar 09, 2025

      DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

      • #paper
      • #LLM
      • #institution/DeepSeek
      • #stub
    • Jan 27, 2024

      Layer Normalization

      • #normalization
      • #paper
      • #stub

    Website by Guilherme Ilunga, created with Quartz v4.1.4, © 2025