<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Module 2: Multi-Vector Representations for Multi-Modal Data on Qdrant - Vector Search Engine</title><link>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/</link><description>Recent content in Module 2: Multi-Vector Representations for Multi-Modal Data on Qdrant - Vector Search Engine</description><generator>Hugo</generator><language>en-us</language><managingEditor>info@qdrant.tech (Andrey Vasnetsov)</managingEditor><webMaster>info@qdrant.tech (Andrey Vasnetsov)</webMaster><atom:link href="https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/index.xml" rel="self" type="application/rss+xml"/><item><title>How ColPali Models Work</title><link>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/how-colpali-works/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><author>info@qdrant.tech (Andrey Vasnetsov)</author><guid>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/how-colpali-works/</guid><description>&lt;div class="date">
 &lt;img class="date-icon" src="https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/icons/outline/date-blue.svg" alt="Calendar" /> Module 2 
&lt;/div>

&lt;h1 id="how-colpali-models-work">How ColPali Models Work&lt;/h1>
&lt;p>ColPali extends the late interaction paradigm from text to visual documents. It can process PDFs, images, and scanned documents, generating multi-vector representations that capture both textual and visual information.&lt;/p>
&lt;p>Understanding ColPali&amp;rsquo;s architecture helps you leverage its full potential for multi-modal document retrieval.&lt;/p>
&lt;hr>
&lt;div class="video">
&lt;iframe
 src="https://www.youtube-nocookie.com/embed/Fai9aY1PMCA?rel=0"
 frameborder="0"
 allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
 referrerpolicy="strict-origin-when-cross-origin"
 allowfullscreen>
&lt;/iframe>
&lt;/div>
&lt;hr>
&lt;p>&lt;strong>Follow along in Colab:&lt;/strong> &lt;a href="https://colab.research.google.com/github/qdrant/examples/blob/master/course-multi-vector-search/module-2/how-colpali-works.ipynb">
&lt;img src="https://colab.research.google.com/assets/colab-badge.svg" style="display:inline; margin:0;" alt="Open In Colab"/>
&lt;/a>&lt;/p>
&lt;hr>
&lt;h2 id="from-text-to-visual-documents">From Text to Visual Documents&lt;/h2>
&lt;p>&lt;strong>What about documents that aren&amp;rsquo;t just text?&lt;/strong> PDFs often contain diagrams, tables, charts, equations, and complex layouts where the visual presentation carries as much meaning as the text itself.&lt;/p></description></item><item><title>ColPali Family Overview</title><link>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/colpali-family/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><author>info@qdrant.tech (Andrey Vasnetsov)</author><guid>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/colpali-family/</guid><description>&lt;div class="date">
 &lt;img class="date-icon" src="https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/icons/outline/date-blue.svg" alt="Calendar" /> Module 2 
&lt;/div>

&lt;h1 id="colpali-family-overview">ColPali Family Overview&lt;/h1>
&lt;p>The ColPali is not only the name of a model. Still, it is also often used to refer to an entire family of models that convert images and text into multi-vector representations, based on Vision Language Models.&lt;/p>
&lt;p>Let&amp;rsquo;s explore what the options are and which model to choose depending on the data you work with.&lt;/p>
&lt;hr>
&lt;div class="video">
&lt;iframe
 src="https://www.youtube-nocookie.com/embed/5ypH0t_X-4k?rel=0"
 frameborder="0"
 allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
 referrerpolicy="strict-origin-when-cross-origin"
 allowfullscreen>
&lt;/iframe>
&lt;/div>
&lt;hr>
&lt;p>The ColPali family includes several model variants. When selecting a model for your application, you&amp;rsquo;ll need to consider factors like model size, supported languages, computational requirements, and licensing constraints - each variant offers different trade-offs along these dimensions.&lt;/p></description></item><item><title>Visual Interpretability of ColPali</title><link>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/visual-interpretability/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><author>info@qdrant.tech (Andrey Vasnetsov)</author><guid>https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/course/multi-vector-search/module-2/visual-interpretability/</guid><description>&lt;div class="date">
 &lt;img class="date-icon" src="https://deploy-preview-2256--condescending-goldwasser-91acf0.netlify.app/icons/outline/date-blue.svg" alt="Calendar" /> Module 2 
&lt;/div>

&lt;h1 id="visual-interpretability-of-colpali">Visual Interpretability of ColPali&lt;/h1>
&lt;p>&lt;strong>Why did this document match my query?&lt;/strong> Unlike traditional black-box embedding models that produce a single opaque vector, ColPali&amp;rsquo;s multi-vector architecture offers something remarkable: you can see exactly where the model &amp;ldquo;looks&amp;rdquo; when matching a query to a document.&lt;/p>
&lt;p>This visual interpretability is invaluable for building trust in multi-modal search systems, debugging unexpected results, and understanding model behavior and limitations.&lt;/p>
&lt;hr>
&lt;div class="video">
&lt;iframe
 src="https://www.youtube-nocookie.com/embed/sQcuYWMS4bo?rel=0"
 frameborder="0"
 allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share"
 referrerpolicy="strict-origin-when-cross-origin"
 allowfullscreen>
&lt;/iframe>
&lt;/div>
&lt;hr>
&lt;p>&lt;strong>Follow along in Colab:&lt;/strong> &lt;a href="https://colab.research.google.com/github/qdrant/examples/blob/master/course-multi-vector-search/module-2/visual-interpretability.ipynb">
&lt;img src="https://colab.research.google.com/assets/colab-badge.svg" style="display:inline; margin:0;" alt="Open In Colab"/>
&lt;/a>&lt;/p></description></item></channel></rss>