News

The FrontierMath benchmark from Epoch AI tests generative models on difficult math problems. Find out how OpenAI’s o3 and ...
OpenAI delivered advanced ChatGPT reasoning models this month that are more capable than o1, but they also hallucinate more.
Having said that, the model lags behind OpenAI’s newly released o4-mini (high) model. In the LiveCodeBench coding benchmark, ...
Is OpenAI's O3 AI model part of an uneven rise of AGI in tools such as ChatGPT? If so, what are the implications for ...
Chinese tech company Alibaba released Qwen 3, a family of AI models that the company claims outperforms some of the best.
OpenAI CEO Sam Altman has admitted that recent updates have made ChatGPT overly sycophantic and "annoying" after users ...
With a ChatGPT Plus, Team or Enterprise account you now have access to 100 messages a week with the ChatGPT-o3 model and a ...
OpenAI is streamlining its AI model lineup, retiring popular models like GPT-4 and GPT-4.5, all in anticipation of the launch ...
However, according to OpenAI’s internal tests, these new o3 and o4-mini reasoning models also hallucinate significantly more ...
Explore 9 transformative use cases of OpenAI’s o3 model, the AI assistant pushing boundaries in work and innovation. OpenAI’s o3 model ...
Chinese tech giant Alibaba has launched Qwen 3, a new series of AI models that it claims can rival or even surpass leading ...