Selected Publications

Unlocking Emergent Modularity in Large Language Models
Unlocking Emergent Modularity in Large Language Models

NAACL 2024, Outstanding Paper Award, (6 out of 2434 submissions)

Jun 29, 2024

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters
LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

arXiv

May 31, 2024

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

NeurIPS 2024 Spotlight

May 30, 2024

Tracking single cell evolution via clock-like chromatin accessibility
Tracking single cell evolution via clock-like chromatin accessibility

Nature Biotechnology 2024

May 20, 2024

Think Before You Act: Decision Transformers with Working Memory
Think Before You Act: Decision Transformers with Working Memory

ICML 2024

May 19, 2024

HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts

ACL 2024

May 19, 2024

Massive Editing for Large Language Models via Meta Learning
Massive Editing for Large Language Models via Meta Learning

ICLR 2024

Jan 16, 2024

AI Alignment: A Comprehensive Survey
AI Alignment: A Comprehensive Survey

arXiv

Oct 30, 2023

When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability

NeurIPS 2023

Sep 21, 2023

Running Ahead of Evolution - AI based Simulation for Predicting Future High-risk SARS-CoV-2 Variants
Running Ahead of Evolution - AI based Simulation for Predicting Future High-risk SARS-CoV-2 Variants

IJHPCA, ACM Gordon Bell COVID Finalist 2022

Jun 20, 2023

Interactive Natural Language Processing
Interactive Natural Language Processing

arXiv

May 22, 2023

MUDiff: Unified Diffusion for Complete Molecule Generation
MUDiff: Unified Diffusion for Complete Molecule Generation

LoG 2023

May 1, 2023

Learning Multi-Objective Curricula for Robotic Policy Learning
Learning Multi-Objective Curricula for Robotic Policy Learning

CoRL 2022

Sep 5, 2022

Biological Sequence Design with GFlowNets
Biological Sequence Design with GFlowNets

ICML 2022

Mar 5, 2022

Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond
Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond

ICLR 2022 Spotlight

Jan 20, 2022

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters

ICLR 2021 Outstanding Paper Award (8 out of 2997 submissions)

Apr 2, 2021

CoCon: A Self-Supervised Approach for Controlled Text Generation
CoCon: A Self-Supervised Approach for Controlled Text Generation

ICLR 2021

Apr 1, 2021

Interactive Machine Comprehension with Information Seeking Agents
Interactive Machine Comprehension with Information Seeking Agents

ACL 2020

Aug 27, 2019