InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage News DeepThought-8B Leverages LLaMA-3.1 8B to Create a Compact Reasoning Model

AI, ML & Data Engineering

DeepThought-8B Leverages LLaMA-3.1 8B to Create a Compact Reasoning Model

This item in japanese

Dec 31, 2024 2 min read

Write for InfoQ

Feed your curiosity. Help 550k+ global
senior developers
each month stay ahead.Get in touch

DeepThought-8B is a small "reasoning" model built on LLaMA-3.1 8B that can carry through decision-making processes step by step, similarly to how OpenAI o1 does, but in a much smaller package.

Requiring a "mere" 16GB of VRAM, DeepThought-8B is particularly aimed at step-by-step problem-solving, coding and mathematical tasks, and instruction-following. Ruliad, the company behind it, says its reasoning capabilities rival larger models.

This release represents our first step toward making AI reasoning more transparent and controllable, while demonstrating that smaller, more efficient models can achieve sophisticated reasoning capabilities that rival models of much larger scales.

As Ruliad explains, DeepThought-8B can break down the process of finding the solution to a problem into a sequence of steps, each of a specific type. The first step in the process is problem understanding, followed by data gathering, analysis, calculation, verification, conclusion drawing, and implementation. The actual number of steps varies with the complexity of the given task. At the end of the process, DeepThought outputs a JSON document detailing all the steps, which makes it possible for users to understand and validate the reasoning.

Ruliad emphasizes users' ability to customize the model's reasoning patterns without retraining. This is shown in the deepthought_inference tool included with the model.

Ruliad has not disclosed benchmark scores, inviting users to test the model and share their findings with the community. However, the company has published a comparison of the model's performance with other major models.

Interestingly, while DeepThought-8B shows similar performance to LLaMA-3.1-8B-Instruct for coding and math tasks, it outperforms it on "reasoning" tasks. Ruliad's model also outperforms Qwuen-2-72B, despite the latter being larger. On the other hand, GPT-4o, o1-mini, and Claude-3.5-Sonnet get better scores on all the counts, including reasoning. This should come as no surprise, anyway, since they are far larger.

Several Hacker News readers tried the model out to test its performance. While it failed at "finding two primes whose sum is 123" or at counting the "r"s in "strawerberry" or in similar un-lexical variations of "strawberry", it correctly answered "which is heavier? 2kg of feathers or 1kg of lead". This might sound trivial, but it appears to be a challenging question for small LLMs like LLaMA-8B, Gemma-2-9B, and others.

Other Hacker News readers took issue with the idea that such models do actually "reason" and stressed that using beam search to select the best path to reach an answer is hardly "reasoning" at all. This stance is also backed by research showing that the ability of LLM models to solve tasks is quite limited since they seem to rely on narrow procedures that cannot be easily transferred to problems differing significantly from those used for training.

DeepThought-8B can be downloaded from Hugging Face or used on Ruliad's website.

About the Author

Sergio De Simone

Sergio De Simone is a software engineer. Sergio has been working as a software engineer for over twenty five years across a range of different projects and companies, including such different work environments as Siemens, HP, and small startups. For the last 10+ years, his focus has been on development for mobile platforms and related technologies. He is currently working for BigML, Inc., where he leads iOS and macOS development.

Show moreShow less

The InfoQ Newsletter

A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example

We protect your privacy.

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

DeepThought-8B Leverages LLaMA-3.1 8B to Create a Compact Reasoning Model

Write for InfoQ

About the Author

Sergio De Simone

This content is in the AI, ML & Data Engineering topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

The InfoQ Newsletter