Apple researchers have developed a new method for training large language models (LLMs) that seamlessly integrates both text and visual information. The company's findings, detailed in a research paper titled "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training," showcase a new approach to creating more intelligent and flexible AI systems. By utilizing a diverse dataset comprising image-caption pairs, interleaved image-text documents, and text-only data, Apple's claims that...
In a bid to make its AI the best offering possible, Apple has revealed details about how it is approaching the development of new MM1 AI model differently.
Elon Musk’s AI team has recently released Grok-1, a large language model with 314 billion parameters. This mixture of experts model, which has not yet been quantized, was put through its paces in various areas, including coding, logic, reasoning, and censorship. One of the most impressive aspects of Grok-1 is its ability to generate code […]
WSE-3, dubbed the 'fastest AI chip in the world' powers Cerebras' CS-3 AI supercomputer.
Stability AI has unveiled StableCode Instruct 3B, a state-of-the-art coding language model designed to revolutionize the way developers approach coding tasks. This innovative tool promises to simplify the coding process across a multitude of programming languages, from Python and JavaScript to C++ and Rust. Its instruction tuning feature is engineered to enhance code generation, offering […]
According to Lightning AI, the compiler Thunder achieves up to a 40% speed-up for training LLMs when compared to unoptimized code in real-world scenarios.
MAINGEAR, in collaboration with Phison, has launched the MAINGEAR PRO AI workstations featuring aiDAPTIV+ technology, aimed at making Large Language Model (LLM) development and training more affordable for small and medium-sized businesses. These workstations are designed to provide supercomputer-level LLM training capabilities within a standard desktop PC footprint, reducing the cost and hardware requirements traditionally […]
Stability AI has released a Stable Video Diffusion-based generative AI model called Stable Video 3D (SV3D) to simplify the creation of 3D videos. The SV3D has two components that help users generate 3D videos from 2D images: SV3D-u and SV3D-p. “Today we are releasing Stable Video 3D (SV3D), a generative model based on Stable Video […]
"There is no fundamental reason that large language model developers can’t work in a way that respects creators’ rights" Source
It's been just over a year since the launch of ChatGPT sent the world AI-crazy. So it's no surprise that tech giants now want to integrate and promote the artificial intelligence capabilities.
Governments and companies are relying on safety-testing to reduce dangers from powerful AI systems. But the tests are far from ready.
Elon Musk’s xAI is open-sourcing its artificial intelligence (AI) chatbot, Grok, in a move that some say could democratize AI technology and foster innovation in commercial applications. The company has released the data for Grok to X Premium+ subscribers through GitHub and BitTorrent. This open release comes amid Musk’s ongoing criticism and legal action against OpenAI for not […]