All posts tagged: ai model

Nvidia Releases Cosmos-Transfer1 AI Model That Can Be Used for Simulation-Based Training for Robots

Nvidia Releases Cosmos-Transfer1 AI Model That Can Be Used for Simulation-Based Training for Robots

Nvidia released a new artificial intelligence (AI) model last week that can be used to train robots on simulation. Dubbed Cosmos-Transfer 1, the new world generation large language model (LLM) is aimed at AI-powered robotics hardware, also known as physical AI. The company has released the model in open source with a permissive licence, and interested individuals can download it from popular online repositories. The Santa Clara-based tech giant highlighted that the main advantage of the latest AI model is that users will have granular control over the generated simulations. Nvidia Releases AI Model to Train Robots Simulation-based robotics training has gained wind in recent times due to the advancement in generative AI technology. This specific branch of robotics deals with hardware that uses an AI for its brain. Essentially, the training method trains the brain of the machine in various real-world scenarios so that it can handle a wider range of tasks. This is a big improvement compared to current robots in factories that are designed to complete a single task. Nvidia’s Cosmos-Transfer1 is …

Google DeepMind Unveils Gemini Robotics AI Models That Can Control Robots in the Real World

Google DeepMind Unveils Gemini Robotics AI Models That Can Control Robots in the Real World

Google DeepMind unveiled two new artificial intelligence (AI) models on Thursday, which can control robots to make them perform a wide range of tasks in real-world environments. Dubbed Gemini Robotics and Gemini Robotics-ER (embodied reasoning), these are advanced vision language models capable of displaying spatial intelligence and performing actions. The Mountain View-based tech giant also revealed that it is partnering with Apptronik to build Gemini 2.0-powered humanoid robots. The company is also testing these models to evaluate them further, and understand how to make them better. Google DeepMind Unveils Gemini Robotics AI Models In a blog post, DeepMind detailed the new AI models for robots. Carolina Parada, the Senior Director and Head of Robotics at Google DeepMind, said that for AI to be helpful to people in the physical world, they would have to demonstrate “embodied” reasoning — the ability to interact and understand the physical world and perform actions to complete tasks. Gemini Robotics, the first of the two AI models, is an advanced vision-language-action (VLA) model which was built using the Gemini 2.0 …

Sudowrite Launches Muse AI Model That Can Generate Narrative-Driven Fiction

Sudowrite Launches Muse AI Model That Can Generate Narrative-Driven Fiction

Sudowrite, a Los Angeles-based AI startup, launched a new fiction-writing artificial intelligence (AI) model last week. Dubbed Muse, the AI model is built with a single function of generating creative writing pieces focused on fiction. The new model will be offered on the Sudowrite platform, which also offers several other writing-focused AI models. The startup claims that the model was specifically trained to remove clichés from the output to ensure that every generation is a unique prose. Sudowrite also offers free credits to users to try out the AI model. Sudowrite Introduces Muse AI Model In a post on X (formerly known as Twitter), Sudowrite’s Founder James Yu announced the launch of the Muse AI model. He highlighted that the model was in the testing phase for the last few months. The company claimed to have Muse with “hundreds of authors” to ensure the high-quality of output. The model’s writing samples can be found on this web page. While generative AI models are capable of creative writing, they come with certain limitations. AI models are …

Alibaba’s Qwen Team Releases QwQ-32B Open-Source Reasoning Model, Said to Perform Similar to DeepSeek-R1

Alibaba’s Qwen Team Releases QwQ-32B Open-Source Reasoning Model, Said to Perform Similar to DeepSeek-R1

Alibaba’s Qwen Team, a division tasked with developing artificial intelligence (AI) models, released the QwQ-32B AI model on Wednesday. It is a reasoning model based on extended test time compute with visible chain-of-thought (CoT). The developers claim that despite being smaller in size compared to the DeepSeek-R1, the model can match its performance based on benchmark scores. Like other AI models released by the Qwen Team, the QwQ-32B is also an open-source AI model, however, it is not fully open-sourced. QwQ-32B Reasoning AI Model Released In a blog post, Alibaba’s Qwen Team detailed the QwQ-32B reasoning model. QwQ (short for Qwen with Questions) series AI models were first introduced by the company in November 2024. These reasoning models were designed to offer an open-source alternative for the likes of OpenAI’s o1 series. The QwQ-32B is a 32 billion parameter model developed by scaling reinforcement learning (RL) techniques. Explaining the training process, the developers said that the RL scaling approach was added to a cold-start checkpoint. Initially, RL was used only for coding and mathematics-related tasks, …

Researchers Create a Low-Cost Open-Source AI Model to Analyse How OpenAI’s o1 Reasons

Researchers Create a Low-Cost Open-Source AI Model to Analyse How OpenAI’s o1 Reasons

Researchers from Stanford University and Washington University have developed an open-source artificial intelligence (AI) model that is comparable in performance to OpenAI’s o1 model. The main objective of the researchers was not to create a powerful reasoning-focused model but to understand how the San Francisco-based AI firm instructed its o1 series models to perform test time scaling. Notably, the researchers were able to showcase the methodology and replicate the model’s behaviour at an extremely low cost while using far fewer compute resources. Researchers Develop S1-32B AI Model The researchers detailed the methodology and process of developing the model in a study published in the pre-print journal arXiv. The process involved creating a synthetic dataset from a different AI model and using several new techniques such as ablation and supervised fine-tuning (SFT). The model is available in a GitHub listing. It should be noted that the AI model was not built from scratch. The developers used the Qwen2.5-32B-Instruct and distilled it to create the s1-32B large language model (LLM). Released in September 2024, the model is …

Mistral Small 3 Open-Source AI Model Introduced, Outperforms OpenAI’s GPT-4o Mini

Mistral Small 3 Open-Source AI Model Introduced, Outperforms OpenAI’s GPT-4o Mini

Mistral, the Paris-based artificial intelligence (AI) firm, released the Mistral Small 3 AI model on Thursday. The company, known for its open-source large language models (LLMs), has also made the latest AI model available on Hugging Face as well as several other platforms. Mistral claimed that the latest model was built with processing speed, efficiency, and performance in mind, and it can outperform models double its size. The AI firm’s internal testing found the model to offer better performance than OpenAI’s GPT-4o mini. Mistral Small 3 AI Model Released In a newsroom post, the French AI firm detailed the new AI model. Mistral Small 3 is a latency-optimised model with 24 billion parameters. The LLM is being released with both a pre-trained and an instruction-tuned checkpoint to cater to a wide range of tasks. The AI model is available under the Apache 2.0 licence for academic and commercial usage. Mistral highlighted that it is moving away from the Mistral Research Licence (MRL) model that only allows academic and research-related usage. The company stated that the …

OpenAI’s o3 Model Claims Human-Level Intelligence on Benchmark, But It Might Not Be That Smart

OpenAI’s o3 Model Claims Human-Level Intelligence on Benchmark, But It Might Not Be That Smart

OpenAI unveiled the reasoning-focused o3 series of artificial intelligence (AI) models last month. During a live stream, the company shared the benchmark scores of the model based on internal testing. While all of the shared scores were impressive and highlighted the improved capabilities of the successor to o1, one benchmark score stood out. On the ARC-AGI benchmark, the large language model (LLM) scored 85 percent, beating the previous best score by a 30 percent margin. Interestingly, this score is also on par with what an average human scored on the test. OpenAI Scores 85 Percent on ARC-AGI Benchmark However, just because o3 scored such a high score on the test, does it mean its intelligence is equal to that of an average human? This would be easier to answer if the AI model was released in the public domain and we could test it out. Since OpenAI has not disclosed anything about the model’s architecture, training techniques, or datasets, it is difficult to conclusively claim anything. There are certain things that we do know about the …

DeepSeek-V3 Open-Source AI Model With Mixture-of-Experts Architecture Released

DeepSeek-V3 Open-Source AI Model With Mixture-of-Experts Architecture Released

DeepSeek, a Chinese artificial intelligence (AI) firm, released the DeepSeek-V3 AI model on Thursday. The new open-source large language model (LLM) features a massive 671 billion parameters, surpassing the Meta Llama 3.1 model which has 405 billion parameters. Despite its size, the researchers claimed that the LLM is focused towards efficiency with its mixture-of-expert (MoE) architecture. Due to this, the AI model can only activate specific parameters relevant to the task provided and ensure efficiency and accuracy. Notably, it is a text-based model and does not have multimodal capabilities. DeepSeek-V3 AI Model Released The open-source DeepSeek-V3 AI model is currently being hosted on Hugging Face. According to the listing, the LLM is geared towards efficient inference and cost-effective training. For this, the researchers adopted Multi-head Latent Attention (MLA) and DeepSeekMoE architectures. Essentially, the AI model only activates the parameters which are relevant to the topic of the prompt, ensuring faster processing and higher accuracy compared to typical models of this size. Pre-trained on 14.8 trillion tokens, the DeepSeek-V3 uses techniques such as supervised fine-tuning and …

Alibaba’s Qwen Team Releases QVQ-72B Open Source Vision AI Model in Preview

Alibaba’s Qwen Team Releases QVQ-72B Open Source Vision AI Model in Preview

Alibaba’s Qwen research team has released another open-source artificial intelligence (AI) model in preview. Dubbed QVQ-72B, it is a vision-based reasoning model that can analyse visual information from images and understand the context behind them. The tech giant has also shared benchmark scores of the AI model and highlighted that on one specific test, it was able to outperform OpenAI’s o1 model. Notably, Alibaba has released several open-source AI models recently, including the QwQ-32B and Marco-o1 reasoning-focused large language models (LLMs). Alibaba’s Vision-Based QVQ-72B AI Model Launched In a Hugging Face listing, Alibaba’s Qwen team detailed the new open-source AI model. Calling it an experimental research model, the researchers highlighted that the QVQ-72B comes with enhanced visual reasoning capabilities. Interestingly, these are two separate branches of performance, that the researchers have combined in this model. Vision-based AI models are plenty. These include an image encoder and can analyse the visual information and context behind them. Similarly, reasoning-focused models such as o1 and QwQ-32B come with test-time compute scaling capabilities that allow them to increase the …

Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

Gemini 1.5 Flash-8B, the latest entrant in the Gemini family of artificial intelligence (AI) models, is now generally available for production use. On Thursday, Google announced the general availability of the model, highlighting that it was a smaller and faster version of the Gemini 1.5 Flash which was introduced at Google I/O. Due to being fast, it has a low latency inference and more efficient output generation. More importantly, the tech giant stated that the Flash-8B AI model is the “lowest cost per intelligence of any Gemini model”. Gemini 1.5 Flash-8B Now Generally Available In a developer blog post, the Mountain View-based tech giant detailed the new AI model. The Gemini 1.5 Flash-8B was distilled from the Gemini 1.5 Flash AI model, which was focused on faster processing and more efficient output generation. The company now claims that Google DeepMind developed this even smaller and faster version of the AI model in the last few months. Despite being a smaller model, the tech giant claims that it “nearly matches” the performance of the 1.5 Flash …