Audio Representation Learning


Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation
Ke Chen*, Yusong Wu*, Tianyu Zhang*, Yuchen Hui*, Taylor Berg-Kirkpatrick, Shlomo Dubnov
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
[code] [dataset] [api] [pip library]

HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022 (oral presentation)
[code]

Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification
Xingjian Du, Ke Chen, Zijie Wang, Bilei Zhu, Zejun Ma
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022

Learning Audio Embeddings with User Listening Data for Content-based Music Recommendation
Ke Chen, Beici Liang, Xiaoshuan Ma, Minwei Gu
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021


Audio Source Separation


MDX-GAN: Enhancing Perceptual Quality in Multi-Class Source Separation via Adversarial Training
Ke Chen, Jiaqi Su, Zeyu Jin
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024 (oral presentation)

Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong*, Ke Chen*, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley
In Submission to IEEE Transactions on Pattern Analysis and Machine Intelligence, TPAMI
[code] [demo]

PAᗧ-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 (SASB Workshop)

Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data
Ke Chen, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov
Association for the Advancement of Artificial Intelligence Conference, AAAI 2022 (oral presentation, ac: 15%)
[code] [api] [intro video] [talk]

Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Ke Chen, Hao-Wen Dong, Yi Luo, Julian McAuley, Taylor Berg-Kirkpatrick, Miller Puckette, Shlomo Dubnov
International Society for Music Information Retrieval Conference, ISMIR 2022
[code] [demo] [dataset]


Algorithmic Composition and Music Generation


MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Ke Chen*, Yusong Wu*, Haohe Liu*, Marianna Nezhurina, Taylor Berg-Kirkpatrick, Shlomo Dubnov
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024 (oral presentation)
[code] [api][demo]

Multitrack Music Transformer: Learning Long-Term Dependencies in Music with Diverse Instruments
Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, Taylor Berg-Kirkpatrick
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
[code] [demo] [intro video]

Deep Music Information Dynamics: Novel Framework for Reduced Neural-Network Music Representation with Applications to MIDI and Audio Analysis and Improvisation
Shlomo Dubnov, Ke Chen, Kevin Huang
Journal of Creative Music Systems, JCMS 2022
[code]

Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and Rhythm
Ke Chen, Cheng-i Wang, Taylor Berg-Kirkpatrick, Shlomo Dubnov
International Society for Music Information Retrieval Conference, ISMIR 2020
[code] [demo] [intro video]

POP909: A Pop-song Dataset for Music Arrangement Generation
Ke Chen*, Ziyu Wang*, Junyan Jiang, Yiyi Zhang, Maoran Xu, Shuqi Dai, Guxian Bin, Gus Xia International Society for Music Information Retrieval Conference, ISMIR 2020
[dataset] [intro video]

MusPy: A Toolkit for Symbolic Music Generation
Hao-Wen Dong, Ke Chen, Julian McAuley, Taylor Berg-Kirkpatrick
International Society for Music Information Retrieval Conference, ISMIR 2020
[code] [pip library] [intro video]

The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation
Ke Chen, Weilin Zhang, Shlomo Dubnov, Gus Xia, Wei Li
International Workshop on Multilayer Music Representation and Processing, MMRP 2019
[code]


Music Information Retrieval


AudioSR: Versatile Audio Super-resolution at Scale
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024 (oral presentation)
[code] [demo] [api]

Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction
Keren Shao*, Ke Chen*, Taylor Berg-Kirkpatrick, Shlomo Dubnov
International Society for Music Information Retrieval Conference, ISMIR 2023
[code]

A posthumous improvisation by Toots Thielemans
Marc Chemillier, Ke Chen, Mikhail Malt, Shlomo Dubnov
Toots Thielemans (1922-2016). A Century of Music across Europe and America, 2022.

TONET: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music
Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022
[code]

Large Vocabulary Chord Transcription via Chord Structure Decomposition
Junyan Jiang, Ke Chen, Wei Li, Gus Xia
International Society for Music Information Retrieval Conference, ISMIR 2019
[code]

Contact Me