Link to paper
The full paper is available here.
You can also find the paper on PapersWithCode here.
Abstract
- Machine learning methods are used to build a mapping from source data to target data.
- Target data is high-dimensional and complex, making it difficult to learn the mapping.
- Regeneration learning is a learning paradigm that generates an abstraction of the target data, then uses it to generate the target data.
- Regeneration learning is a counterpart of traditional representation learning.
- Regeneration learning can be used for various data generation tasks.
Paper Content
Formulations of regeneration learning
- Regeneration learning bridges the information mismatch between X and Y.
- Regeneration learning has three steps.
Connections to other methods
- Template-based methods extract template from target data and regenerate target data from template
- Vocoding methods convert speech waveform to spectrograms and regenerate waveform from spectrograms
- Grapheme/phoneme conversions convert text/character sequence to phoneme sequence and regenerate text/character sequence from phoneme sequence
- Auto-encoding methods convert target data to representations and regenerate target data from representations
- Easy Mapping learning is a sequence-to-sequence task where X and Y contain comparable information
- Denoising diffusion probabilistic models add noise to original data Y
- Iterative-based non-autoregressive sequence generation is an extended version of regeneration learning
- Post-refine methods improve quality of generated Y but not considered regeneration learning
Regeneration learning vs. representation learning
- Regeneration learning is a type of representation learning for data generation
- Regeneration learning and traditional representation learning are counterparts of each other
- Regeneration learning handles abstraction of target data for data generation, traditional representation learning handles abstraction of source data for understanding
- Both processes can be learned in a self-supervised way
- Mapping from X to Y is simpler in regeneration and traditional representation learning than direct mapping from X to Y
Applications of regeneration learning
- Text-to-Speech Synthesis
- Automatic Speech Recognition
- Text Generation
- Melody Generation
- Talking-Head Video Synthesis
- Image/Video/Sound Generation
- Regeneration Learning used when target data is too high-dimensional or complex, source and target data have uncorrelated information, or lack of paired data
Research opportunities on regeneration learning
- RQ1: How to learn Y from Y?
- RQ2: How to design better learning paradigms to learn Y?
- RQ6: How to determine the format of Y?
- RQ7: How to design better generative models to learn X → Y and Y → Y mapping?
- RQ8: How to leverage the assumption of semantic conversion and detail rendering?
- RQ9: How to leverage self-supervised learning for Y → Y mapping?
- RQ10: How to reduce the training-inference mismatch in regeneration learning?
Conclusion
- Presentation of a learning paradigm called regeneration learning for data generation tasks
- Regeneration learning handles the abstraction of the target data for data generation while representation learning handles the abstraction of source data for data understanding
- Connections of regeneration learning to other methods
- Variety of data generation tasks that can benefit from regeneration learning
- Comparison between regeneration learning and traditional representation learning on X → Y
- How to learn X → Y and Y → Y
- How to reduce training-inference mismatch in X → Y → Y
- Generate target Y from Y
- Infer Y from X via Y
- Typical data generation tasks that leverage regeneration learning