large language models

Distill Your LLMs and Surpass Their Performance: spaCy’s Creator at InfoQ DevSummit Munich

Distill Your LLMs and Surpass Their Performance: spaCy’s Creator at InfoQ DevSummit Munich

In her presentation at the inaugural edition of InfoQ Dev Summit Munich, Ines Montani built on top of the presentation she had earlier this year at QCon London and provided the audience with practical solutions for using the latest state-of-the-art models in real-world applications and distilling their knowledge into smaller and faster components that you can run and maintain in-house. She began by stating that using black box models hidden behind APIs would prevent us from satisfying the properties of good software: modular, transparent, explainable, data-private, reliable, and affordable. Further, Montani pointed out that GenAI can be helpful in multiple situations…
Read More
The AI Revolution Will Not Be Monopolized

The AI Revolution Will Not Be Monopolized

Key Takeaways Open-source initiatives are pivotal in democratizing AI technology, offering transparent, extensible tools that empower users. The open-source community quickly turns new research into practical AI tools, making them stronger and more useful. Distilling large language models during development enables the creation of accurate, fast, and private task-specific models, reducing reliance on general-purpose APIs. Effective regulation should distinguish between human-facing AI applications and underlying machine-facing components, ensuring innovation while addressing concerns about data privacy, security, and equitable access. This is a summary of a talk that Ines Montani gave at QCon London in April 2024. Large language models…
Read More
Ines Montani at QCon London: Economies of Scale Can’t Monopolise the AI Revolution

Ines Montani at QCon London: Economies of Scale Can’t Monopolise the AI Revolution

During her presentation at QCon London, Ines Montani, co-founder and CEO of explosion.ai (the maker of spaCy), stated that economies of scale are not enough to create monopolies in the AI space and that open-source techniques and models will allow everybody to keep up with the "Gen AI revolution". Montani opened her presentation by asking for a show of hands to identify the open-source users in the audience. The vast majority of the audience raised their hand, easily demonstrating that open-source is ubiquitous ("it would be easier to ask who doesn’t use open-source’"). She pointed out the multiple benefits of the…
Read More
Ghostbuster: Detecting Text Ghostwritten by Large Language Models

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

The structure of Ghostbuster, our new state-of-the-art method for detecting AI-generated text. Large language models like ChatGPT write impressively well—so well, in fact, that they’ve become a problem. Students have begun using these models to ghostwrite assignments, leading some schools to ban ChatGPT. In addition, these models are also prone to producing text with factual errors, so wary readers may want to know if generative AI tools have been used to ghostwrite news articles or other sources before trusting them. What can teachers and consumers do? Existing tools to detect AI-generated text sometimes do poorly on data that differs from…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.