Simplified Paper Implementations

Project Goal

This project aims to provide simplified implementations of research papers. While many original authors release their code, it often includes complex distributed, parallel, or framework-specific code that can obscure the core innovations of the paper for readers.

This project addresses this challenge by simplifying the implementations based on the original papers and the authors' public code. All complex distributed, parallel, and framework-specific code has been removed to offer the most straightforward replications of the papers' core ideas.

Implemented Papers

Currently, the following papers have been implemented:

Absolute Zero Reinforced Self-play Reasoning with Zero Data.py: Corresponds to the paper "Absolute Zero Reinforced Self-play Reasoning with Zero Data" https://arxiv.org/abs/2505.03335
LUFFY.py: Corresponds to the paper "Learning to Reason under OFF-Policy guidance"https://arxiv.org/abs/2504.14945
GRPO.py: Based on the implementation found at https://github.com/aburkov/theLMbook.
swin_transformer.py: A reproduction of the paper "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
vit.py: A reproduction of the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale".
reinforce++.py: A reproduction of "REINFORCE++: An Efficient RLHF Algorithm with Robustness to Both Prompt and Reward Models". Important Note: I do not currently endorse this paper, as I believe it misinterprets the concept of advantage and provides insufficient justification for its batch normalization approach, with inadequate experimental support. It is recommended to await further community feedback before use.
Qformer.py: A reproduction of the Qformer module.
MA-LMM.py: A reproduction of the paper "MA-LMM Memory-Augmented Large Multimodal Model" https://github.com/boheumd/MA-LMM.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
GRPO.py		GRPO.py
LUFFY.py		LUFFY.py
MA-LMM.py		MA-LMM.py
Qformer.py		Qformer.py
README.md		README.md
absolute_zero.py		absolute_zero.py
absolute_zero总结.md		absolute_zero总结.md
diffusion.py		diffusion.py
gate_attention.py		gate_attention.py
reinforce++.py		reinforce++.py
sam_encoder.py		sam_encoder.py
swin_transformer.py		swin_transformer.py
vit.py		vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simplified Paper Implementations

Project Goal

Implemented Papers

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Simplified Paper Implementations

Project Goal

Implemented Papers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages