H

Huajian Xin

Northeast Agricultural University

Publishes on Topic Modeling, Logic, programming, and type systems, Mathematics, Computing, and Information Processing. 18 papers and 259 citations.

18Publications
259Total Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

DeepSeek-V3 Technical Report
DeepSeek-AI, Aixin Liu, Bei Feng et al.|arXiv (Cornell University)|2024
Cited by 222Open Access

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated in DeepSeek-V2. Furthermore, DeepSeek-V3 pioneers an auxiliary-loss-free strategy for load balancing and sets a multi-token prediction training objective for stronger performance. We pre-train DeepSeek-V3 on 14.8 trillion diverse and high-quality tokens, followed by Supervised Fine-Tuning and Reinforcement Learning stages to fully harness its capabilities. Comprehensive evaluations reveal that DeepSeek-V3 outperforms other open-source models and achieves performance comparable to leading closed-source models. Despite its excellent performance, DeepSeek-V3 requires only 2.788M H800 GPU hours for its full training. In addition, its training process is remarkably stable. Throughout the entire training process, we did not experience any irrecoverable loss spikes or perform any rollbacks. The model checkpoints are available at https://github.com/deepseek-ai/DeepSeek-V3.

FIMO: A Challenge Formal Dataset for Automated Theorem Proving
Chengwu Liu, Jianhao Shen, Huajian Xin et al.|arXiv (Cornell University)|2023
Cited by 10Open Access

We present FIMO, an innovative dataset comprising formal mathematical problem statements sourced from the International Mathematical Olympiad (IMO) Shortlisted Problems. Designed to facilitate advanced automated theorem proving at the IMO level, FIMO is currently tailored for the Lean formal language. It comprises 149 formal problem statements, accompanied by both informal problem descriptions and their corresponding LaTeX-based informal proofs. Through initial experiments involving GPT-4, our findings underscore the existing limitations in current methodologies, indicating a substantial journey ahead before achieving satisfactory IMO-level automated theorem proving outcomes.

LEGO-Prover: Neural Theorem Proving with Growing Libraries
Haiming Wang, Huajian Xin, Chuanyang Zheng et al.|arXiv (Cornell University)|2023
Cited by 9Open Access

Despite the success of large language models (LLMs), the task of theorem proving still remains one of the hardest reasoning tasks that is far from being fully solved. Prior methods using language models have demonstrated promising results, but they still struggle to prove even middle school level theorems. One common limitation of these methods is that they assume a fixed theorem library during the whole theorem proving process. However, as we all know, creating new useful theorems or even new theories is not only helpful but crucial and necessary for advancing mathematics and proving harder and deeper results. In this work, we present LEGO-Prover, which employs a growing skill library containing verified lemmas as skills to augment the capability of LLMs used in theorem proving. By constructing the proof modularly, LEGO-Prover enables LLMs to utilize existing skills retrieved from the library and to create new skills during the proving process. These skills are further evolved (by prompting an LLM) to enrich the library on another scale. Modular and reusable skills are constantly added to the library to enable tackling increasingly intricate mathematical problems. Moreover, the learned library further bridges the gap between human proofs and formal proofs by making it easier to impute missing steps. LEGO-Prover advances the state-of-the-art pass rate on miniF2F-valid (48.0% to 57.0%) and miniF2F-test (45.5% to 47.1%). During the proving process, LEGO-Prover also manages to generate over 20,000 skills (theorems/lemmas) and adds them to the growing library. Our ablation study indicates that these newly added skills are indeed helpful for proving theorems, resulting in an improvement from a success rate of 47.1% to 50.4%. We also release our code and all the generated skills.

Attenuation of Ultraviolet Radiation by Aerosols and Clouds in Beijing Area in 2005–2020
Shuman Zhao, Huajian Xin, Shumin Wu et al.|Atmosphere|2024
Cited by 7Open Access

Ultraviolet radiation (UV) has strong chemical and biological effects on human health and ecosystems, and it plays an important role in the atmospheric environment by affecting photochemical processes, etc. Clouds and aerosols are the main factors affecting UV radiation and analyzing the quantitative impact of them on UV radiation is of great significance. Using the observation data of UV radiation in Beijing from 2005 to 2020, as well as the data of aerosol optical depth (AOD), single scattering albedo (SSA), and other related parameters, this paper simulated the surface UV radiation in two scenarios of cloudless without aerosol and cloudless with aerosol based on the TUV (Tropospheric Ultraviolet-Visible model), and quantitatively evaluated the attenuation of UV radiation by aerosol and cloud in the Beijing area. The results show that UV radiation is more sensitive to changes in AOD. Fixing the SSA value to 0.9, when the AOD increases from 0.2 to 1.0, the UV radiation decreases from 21.16 W/m2 to 12.64 W/m2 at 12:00; when AOD is maintained at 0.64, the SSA increases from 0.7 to 0.95, and the UV radiation increases from 14.55 W/m2 to 19.91 W/m2. The average annual attenuation rates of ultraviolet radiation by aerosols and clouds from 2005 to 2020 are 30.64% and 40.22%, respectively; the monthly averaged attenuation rates are 30.48% and 42.04%, respectively; and the daily averaged attenuation rates are 31.02% and 50.45%, respectively.

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Huajian Xin, Daya Guo, Zhihong Shao et al.|arXiv (Cornell University)|2024
Cited by 4Open Access

Proof assistants like Lean have revolutionized mathematical proof verification, ensuring high accuracy and reliability. Although large language models (LLMs) show promise in mathematical reasoning, their advancement in formal theorem proving is hindered by a lack of training data. To address this issue, we introduce an approach to generate extensive Lean 4 proof data derived from high-school and undergraduate-level mathematical competition problems. This approach involves translating natural language problems into formal statements, filtering out low-quality statements, and generating proofs to create synthetic data. After fine-tuning the DeepSeekMath 7B model on this synthetic dataset, which comprises 8 million formal statements with proofs, our model achieved whole-proof generation accuracies of 46.3% with 64 samples and 52% cumulatively on the Lean 4 miniF2F test, surpassing the baseline GPT-4 at 23.0% with 64 samples and a tree search reinforcement learning method at 41.0%. Additionally, our model successfully proved 5 out of 148 problems in the Lean 4 Formalized International Mathematical Olympiad (FIMO) benchmark, while GPT-4 failed to prove any. These results demonstrate the potential of leveraging large-scale synthetic data to enhance theorem-proving capabilities in LLMs. Both the synthetic dataset and the model will be made available to facilitate further research in this promising field.