Code Monkey home page Code Monkey logo

Hi there 👋

I am an NLP algorithm engineer graduated from Xiamen University with bachelor degree (2015-2019) and Tianjin University with master degree (2019-2022).

My research interests include:

  • 🔭 clustering analysis (fuzzy clustering theory and linguistic clustering)
  • 🌱 machine translation (text-only and multimodal machine translation)
  • 👯 multimodal learning (pretraining technology and reasoning)
  • 🌱 large language modeling (infra, multilingual pretrain and efficient universal sft)

I am passionate about specializing in algorithms and fit them into practical applications.

Experiences

  • 📫 2023-09 - now : working on Foundational LLM Team, Alibaba Inc., towards the universal intelligence of LLM, especially on dialogue and searching.
  • 📫 2022-04 - 2023-09: worked on ByteDance AI Lab in the fields of multimodal/multilingual machine translation and multilingual LLM.
  • 📫 2021-07 - 2021-11: conducted research on semi-parametric MT as a NLP Research intern on Alibaba Damo Academy (One conference paper published).
  • 📫 2020-11 - 2021-02: participated in early NLP Migration Project on HUAWEI Ascend, our work was reported as a markable practice [wiki].
  • 📫 2020-05 - 2020-11: conducted research on translation quality estimation in corporation with OPPO Research (One paper under review).
  • 📫 2020-04 - 2020-09: conducted research on vison & language multimodal machine translation (One conference paper published).
  • 🤔 2019-09 - 2020-05: joined in TJUNLP lab and conducted research on vision & language commensense reasoning, finally stopped for the lack of computational resources.
  • 👯 2018-03 - 2019-09: joined in Optimization Machine Learning Team and studied Fuzzy Clustering Theory (major) and Mainfold Learning (secondary) (One journal paper published and another two journal papers collaborated).
  • 👯 2016-11 - 2018-09: joined the Drone Team in charge of the compute vision algorithm, won the second place in International Aerial Robotics Competition.

Representative Publications [google scholar]

  • Efficient Cluster-Based k-Nearest-Neighbor Machine Translation. ACL. 2022.
  • AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation. ACL Findings. 2021.
  • Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding. AAAI. 2021.
  • A Novel Fuzzy c-Means Clustering Algorithm Using Adaptive Norm. International Journal of Fuzzy Sytems. 2019.

GitHub Stats

WonderSeen's Projects

cmmlu icon cmmlu

CMMLU: Measuring massive multitask language understanding in Chinese

ewfcm icon ewfcm

My Reproduce EWFCM algorithm in matlab

optimized-lk-flow icon optimized-lk-flow

Here are a train of trials to explore that How lk_flow algorithm works and How to make it better.

pckmt icon pckmt

Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation

sparse-points-gen-convex icon sparse-points-gen-convex

A demo to compute and optimize the convex-like contour of an image with sparse points and visualize the result.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.