YuanHongyi's Projects
Assignments of AdvancedComputationalStatistics-2021Spring by Prof. Ke Deng
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
The 3rd place in the CHIP Chinese biomedical concept retrieval task 2020
EHRDiff: Exploring Realistic EHR Synthesis with Diffusion Models [TMLR]
Tools for curating biomedical training data for large-scale language modeling
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning [NAACL 2022]
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
Exploring Partial Knowledge Base Inference in Biomedical Entity Linking [ACL-BioNLP 2023]
Aligning Human Preferences and Language Models without tears
Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]
Code and documentation to train Stanford's Alpaca models, and generate the data.