reasoning-language-models topics

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

The-FinAI

deepseek

financial-modeling

gpt-4o

llamas

GPT-OSS-MoE-ExpertFingerprinting

19

Stars

3

Forks

19

Watchers

ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture

AmanPriyanshu

behavioral-analysis

deep-learning

expert-routing

gpt

Awesome-Long-Chain-of-Thought-Reasoning

596

Stars

28

Forks

596

Watchers

Latest Advances on Long Chain-of-Thought Reasoning

LightChen233

agent

chain-of-thought

deepseek-r1

long

STAR-1

32

Stars

1

Forks

32

Watchers

[AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

UCSC-VLAA

ai-safety

alignment

deliberative-agent

language-generation

Awesome-RAG-Reasoning

369

Stars

30

Forks

369

Watchers

[EMNLP 2025] Awesome RAG Reasoning Resources

DavidZWZ

agent

agentic-ai

agentic-rag

llm

CUREBench

124

Stars

32

Forks

124

Watchers

CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale

mims-harvard

agentic-ai

agents

large-language-models

neurips-2025

Logic-RL-Lite

49

Stars

0

Forks

49

Watchers

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

DolbyUUU

deepseek

deepseek-r1

fine-tuning

gpt-o1