Artificial Intelligence

Inside Botipedia: INSEAD’s AI Breakthrough That Could Redefine How We Access Information

From information gaps to global access — how AI is reshaping the pursuit of knowledge.

Updated

January 8, 2026 6:33 PM

Paper cut-outs of robots sitting on a pile of books. PHOTO: FREEPIK

Encyclopaedias have always been mirrors of their time — from heavy leather-bound volumes in the 19th century to Wikipedia’s community-edited pages online. But as the world’s information multiplies faster than humans can catalogue it, even open platforms struggle to keep pace. Enter Botipedia, a new project from INSEAD, The Business School for the World, that reimagines how knowledge can be created, verified and shared using artificial intelligence.

At its core, Botipedia is powered by proprietary AI that automates the process of writing encyclopaedia entries. Instead of relying on volunteers or editors, it uses a system called Dynamic Multi-method Generation (DMG) — a method that combines hundreds of algorithms and curated datasets to produce high-quality, verifiable content. This AI doesn’t just summarise what already exists; it synthesises information from archives, satellite feeds and data libraries to generate original text grounded in facts.

What makes this innovation significant is the gap it fills in global access to knowledge. While Wikipedia hosts roughly 64 million English-language entries, languages like Swahili have fewer than 40,000 articles — leaving most of the world’s population outside the circle of easily available online information. Botipedia aims to close that gap by generating over 400 billion entries across 100 languages, ensuring that no subject, event or region is overlooked.

"We are creating Botipedia to provide everyone with equal access to information, with no language left behind", says Phil Parker, INSEAD Chaired Professor of Management Science, creator of Botipedia and holder of one of the pioneering patents in the field of generative AI. "We focus on content grounded in data and sources with full provenance, allowing the user to see as many perspectives as possible, as opposed to one potentially biased source".

Unlike many generative AI tools that depend on large language models (LLMs), Botipedia adapts its methods based on the type of content. For instance, weather data is generated using geo-spatial techniques to cover every possible coordinate on Earth. This targeted, multi-method approach helps boost both the accuracy and reliability of what it produces — key challenges in today’s AI-driven content landscape.

Additionally, the innovation is also energy-efficient. Its DMG system operates at a fraction of the processing power required by GPU-heavy models like ChatGPT, making it a sustainable alternative for large-scale content generation.

By combining AI precision, linguistic inclusivity and academic credibility, Botipedia positions itself as more than a digital library — it’s a step toward universal, unbiased access to verified knowledge.

"Botipedia is one of many initiatives of the Human and Machine Intelligence Institute (HUMII) that we are establishing at INSEAD", says Lily Fang, Dean of Research and Innovation at INSEAD. "It is a practical application that builds on INSEAD-linked IP to help people make better decisions with knowledge powered by technology. We want technologies that enhance the quality and meaning of our work and life, to retain human agency and value in the age of intelligence".

By harnessing AI to bridge gaps of language, geography and credibility, Botipedia points to a future where access to knowledge is no longer a privilege, but a shared global resource.

Keep Reading

Artificial Intelligence

As AI Music Copyright Battles Grow, Companies Are Turning to Licensed Training Data

Sonilo and Shutterstock are betting that licensed training data could define the future of AI music.

Updated

May 13, 2026 3:39 PM

A human operating a digital turntable. PHOTO: UNSPLASH

As copyright disputes continue to grow around AI-generated music, Sonilo, the world’s first professionally licensed video-to-music AI platform, has partnered with Shutterstock to train its models on licensed music catalogs.

The agreement gives Sonilo access to Shutterstock’s music library for AI model training. According to the companies, it is Shutterstock’s first partnership with a video-to-music AI platform and the timing is significant. AI music companies are facing growing pressure over how their systems are trained. Artists and record labels have increasingly challenged the use of copyrighted music in AI datasets, especially when licensing agreements or compensation structures are unclear.

That tension has created a divide across the industry. Some companies have continued building models around scraped or disputed data. Others are trying to position licensing as part of the product itself.

Sonilo falls into the second group. The company says its models are trained only on licensed material where artists and rights holders have agreed to participate and receive compensation. The Shutterstock partnership strengthens that position while giving Sonilo access to a larger pool of commercially cleared music.

The collaboration also points to a broader change happening inside generative AI. As AI tools move into commercial production, companies are being pushed to show not just what their models can generate, but also where their training data comes from.

Sonilo’s platform is built around video rather than text prompts. The system analyses footage directly, studies pacing and emotional tone, then generates an original soundtrack to match the content. The company says this removes the need for manual music searches, syncing or editing workflows. The generated tracks are cleared for commercial use across social media, branded content and broadcast production.

Shawn Song, CEO of Sonilo, said: "Music has always been the last unsolved layer of video creation, and video has always carried its own soundtrack. We built Sonilo to hear it and compose from it, without a single text prompt. But how we build matters as much as what we build. While others have chosen to take artists' work without permission and charge creators for the privilege, we've chosen a different path—one where artists are compensated from day one. Partnering with Shutterstock reflects that standard. Every model we train meets a bar the music industry can stand behind, because the most innovative AI platforms don't have to come at the expense of the artists who make all of these possible."

For Shutterstock, the deal expands the company’s growing role in generative AI infrastructure. The company has increasingly focused on licensing content for AI systems across images, video and music.

Jessica April, Vice President of Data Licensing & AI Services at Shutterstock, said: "AI innovation depends on access to high-quality, rights-cleared content and trusted licensing partnerships. Sonilo's approach reflects the growing demand for responsibly sourced training data and commercially safe AI workflows. We're pleased to support companies building generative AI products with licensed content and scalable data solutions that help accelerate innovation while respecting creators and rights holders."

The partnership also comes as Sonilo expands into creator and developer ecosystems. Earlier this month, the company launched as a native node inside ComfyUI, an open-source AI workflow platform used by millions of creators. Sonilo also offers API access for integration into creator tools, video platforms, game engines and other AI systems.

As AI-generated music becomes more common across advertising, creator platforms and digital media, the industry’s focus is shifting beyond generation alone. Questions around licensing, ownership and compensation are increasingly shaping how AI music companies position themselves and build trust with creators.