|
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing
Changho Choi, Minho Kim, Jinkyu Kim
CVPR Findings, 2026
arxiv /
code /
A causal sequential visual encoder that achieves input-size agnostic charateristic like human vision by using a pure Mamba2 backbone. We designed a strictly unidirectional approach and relative move embeddings, MambaEye ensures translation invariance and adaptability to arbitrary resolutions.
|
|
SAOInstruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck, Florian Grötschla, Luca A. Lanzendörfer, June Young Yi, Changho Choi, Roger Wattenhofer
NeurIPS, 2025
arxiv /
code /
website /
A model based on Stable Audio Open for flexible, free-form natural language audio editing. Trained on a novel dataset of audio editing triplets, it generalizes to real-world audio and unseen instructions.
|
|
LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping
Changho Choi, Minho Kim, Junhyeok Lee, Hyoung-Kyu Song, Younggeun Kim, Seungryong Kim
Arxiv preprint, 2024
arxiv /
code /
A lightweight face swapping framework that generates latent codes for pre-trained generators without external datasets. It features a fast training process with a simple three-term loss, producing high-resolution results comparable to state-of-the-art models.
|
Work & Research Experience
Besides my work on the research and publications above, a sampling of my past works
|
|
Vision & AI Lab, Korea University
Undergraduate Researcher
2025-03 ~ 2025-12
Conducted research on causal sequential visual encoders under Prof. Jinkyu Kim. Proposed novel vision encoder architecture, MambaEye.
|
|
OptimizerAI
AI Researcher
2024-10 ~ 2024-12
Contributed to research and development for the Text-to-SFX Model v2, achieving state-of-the-art human preference win rates.
|
|
Pion Corporation (vcat.ai)
AI Engineer
2023-06 ~ 2024-10
Developed deep learning models for optimal product image recommendation and online image clustering using CLIP and Triton Inference Server.
|
|
Maum.AI
AI Scientist
2019-03 ~ 2023-05
Led the computer vision research team and worked on various CV tasks (Super Resolution, Face Swapping) through open-source utilization and paper implementation.
|
Other Projects
These include coursework, side projects and unpublished research work.
|
|
HifiFace PyTorch Implementation
projects
2021-12
code /
Developed an unofficial PyTorch implementation of HifiFace, the state-of-the-art face swapping model at the time (Stars: 300+).
|
|
FaceShifter Pytorch Implementation
projects
2020-10
code /
Implemented FaceShifter, one of the first GAN-based face swapping models, from scratch in PyTorch (Stars: 600+).
|
|