17
Oct
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Zyphra Technologies, the company working on a multimodal agent system combining advanced research in next-gen SSM hybrid architectures, long-term memory and reinforcement learning, just released Zyda-2, an open pretraining dataset comprising 5 trillion tokens. The offering comes as the successor of the original Zyda dataset. It is five times larger in size and covers a vast range of topics and domains to ensure a high level of diversity and quality – which is critical for training robust and competitive language models. …