Research

My research interests lie in robot learning systems that combine data-driven policies with structure from robotics, control, and physical interaction.

Main Research Themes

Long-Horizon Imitation Learning

I am interested in how robot policies can learn temporally extended manipulation skills from demonstrations. This includes behavior cloning, Action Chunking Transformers, VQ-style action abstractions, diffusion policies, and recovery-aware data collection.

Reinforcement Learning under Partial Observability

Many robotic tasks are partially observable because the robot only has local sensors, noisy perception, or incomplete state information. I study recurrent policies such as PPO-LSTM and PPO-GRU for such settings.

Vision-Language-Action Models

I am interested in how high-level language-conditioned reasoning can connect to low-level continuous robot control through action decoders, proprioception, temporal modeling, and grounding in real demonstrations.

Practical Robot Data Collection

A major bottleneck in robot learning is collecting useful, trustworthy, and scalable data. I am exploring low-cost data collection pipelines using SO101 arms, cameras, teleoperation, and structured evaluation.

Current Questions

  • How can ACT-style policies be made more robust to disturbances?
  • When should we combine imitation learning with motion planning or RMPFlow-style smoothing?
  • How much data is needed for reliable long-horizon manipulation on low-cost hardware?
  • How can recurrent RL policies form useful memory in sparse-reward POMDPs?
  • How should VLA models represent actions for precise robot control?