MM-Interleaved is a new end-to-end generative model for interleaved image-text modeling. It introduces a novel fine-grained multi-modal feature synchronizer named MMFS, allowing it to recognize ...
This update is focused on the look-and-feel of the book, not on the technical content. This is in preparation for a printed edition of this book.