Thanks for visiting my Github Page. Here are some facts about me:
- 🔭 I’m currently working on: machine learning / deep learning, data analysis / risk control / data mining, and algorithm.
- 🌱 I'm also working on those directions of NLP: text classification, information extraction and text generation.
- 🔬 I'm now interesting in how to get high-quality text for training language models (for examples, text-to-text model such as T5, causal language model such as GPT2 / Phi), and how to speedy up LLM (Large Language Model) training, fine-tune and inference. In addition, the application of LLM in vertical fields is also a very interesting direction, such as RAG (Retrieval Augmented Generation).
- 📫 ······
- Languages:
- Python, SQL, Shell, C++, a little Golang and a little Java.
- Frameworks:
- PyTorch, Huggingface's NLP framework, Pandas & Numpy, PySpark, Hive.
- Developments:
- Linux, Git, Docker, VSCode, Markdown.