News Archive - Page 14 of 77

LIMO: small data, big results

A new study suggests that large language models (LLMs) can learn complex reasoning tasks with only a small set of examples, VentureBeat reported last week. The story is not accessible at this moment, but here’s an archived copy.

The researchers found that LLMs already have a lot of knowledge from their pre-training phase. With smart training methods, it could be possible to create custom LLMs without needing the huge resources of big AI labs.

The study introduces a method called “less is more” (LIMO) that uses fewer but carefully chosen examples to train LLMs. The researchers created a small LIMO dataset for hard math problems with just a few hundred examples. They fine-tuned the Qwen2.5-32B-Instruct LLM on this dataset.

The results were impressive. The LIMO-trained model solved 57.1% of problems on the tough AIME math test and 94.8% on the MATH test. It beat other models trained on much more data. It also did better than some advanced reasoning models like QwQ-32B-Preview and OpenAI o1-preview, which used more resources. The LIMO model even worked well on new, different problems, scoring high on science tests like OlympiadBench and GPQA.

Quality, not quantity of data

Reasoning tasks often need fine-tuning, and experts think this required lots of data. LIMO changes that, making it easier to build specialized models.

The researchers say LIMO works because LLMs already have reasoning knowledge from pre-training. New techniques also let models “think” longer by creating detailed reasoning chains, which helps them solve problems better. To make LIMO datasets, you need to pick hard problems that push the model to think in new ways. Solutions should be clear and well-organized, guiding the model step by step.

The researchers shared their code and data on GitHub. They plan to apply LIMO to other areas in the future. This study suggests that quality, not quantity, is key to unlocking LLM reasoning power.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

xAI plans to showcase Grok 3 tonight

Elon Musk announced the launch of Grok 3, xAI’s latest AI chatbot, for tonight (Monday) at 8 p.m. Pacific Time. He claimed Grok 3 will be the smartest AI on Earth, outshining rivals like OpenAI’s ChatGPT and Google’s DeepMind Gemini.

xAI, founded by Musk in 2023, challenges tech giants like Microsoft and Google, intensifying the race for AI dominance.

A live demo will showcase Grok 3, drawing global attention to xAI’s ambitious push in the AI race. The launch could redefine AI standards. The demo will likely stream on X, making it accessible to a wide audience.

Tech fans and critics alike eagerly await the demo to see if Grok 3 lives up to the hype. Posts on X show mixed reactions, with some users excited for Grok 3’s potential and others skeptical of Musk’s bold claims,

The launch comes amid fierce competition in AI, with OpenAI rushing its GPT-5 release after Grok 3’s announcement.

xAI recently raised $6 billion from investors, boosting its valuation to $50 billion. This funding fuels its rapid development, including Grok 3’s advanced training on synthetic data.

The Colossus supercomputer

Grok 3 boasts powerful reasoning, image processing, and real-time data integration from X posts. It trained on 100,000 Nvidia H100 GPUs, using ten times more computing power than Grok 2. This massive training, powered by xAI’s Colossus supercomputer, aims to deliver unmatched accuracy and efficiency.

Colossus is a giant system with 100,000 Nvidia H100 GPUs connected by Nvidia’s networking technology. This setup allows the GPUs to work together smoothly, sharing data fast without delays. The system is liquid-cooled to handle the heat from so many chips running at once, ensuring they don’t overheat during long training sessions.

Building Colossus took just 122 days. It started training Grok 3 only 19 days after the first equipment arrived. The supercomputer uses huge amounts of electricity and needs a lot of water for cooling.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Tiny crystals, big storage: a breakthrough in computer memory

Objects with an “on” and “off” state can store information, like computers and cellphones. In computers, transistors use low or high voltage to make ones and zeroes. On a CD, a one is a change from a tiny pit to a flat spot, while a zero is no change.

The size of these parts has always limited how small storage devices can be. Now, researchers at the University of Chicago have found a new way to store data. They use tiny defects in crystals, each as small as an atom, to make ones and zeroes. Their work appeared in Nanophotonics.

Each memory cell is a missing atom in the crystal. This mixes ideas from quantum research with regular computer memory to pack huge amounts of data, like terabytes, into a tiny cube just one millimeter wide. Quantum research studies tiny particles, but this work improves normal, non-quantum storage.

A billion memory cells in one millimeter cube

The idea started with radiation dosimeters, which measure radiation exposure for hospital workers. These devices store radiation data in crystals. The researchers found they could use light to read this data. When the crystal gets energy, it releases electrons. The electrons get trapped in crystal defects. By shining light, the researchers could read the trapped information.

The researchers saw this could work for memory storage and created a new kind of storage device using quantum techniques for regular computers.

To make it work, they added rare earth elements, like Praseodymium, to a crystal made of Yttrium oxide. Rare earths have special properties that let light control them easily. They used a simple UV laser to activate the crystal. The laser makes the rare earth release electrons, which get trapped in crystal defects, like missing oxygen atoms.

Crystals always have defects, and the team used these gaps to store data. A charged gap is a one, and an uncharged gap is a zero. This way, they turned the tiny crystal into powerful storage. In just one millimeter cube, they fit at least a billion memory cells for regular computers.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Other interesting news for 02.14.25

Multi-photon bionic skin realizes high-precision haptic visualization for reconstructive perception

Discovery of unexpected collagen structure could ‘reshape biomedical research’

First detection of an ultra-high-energy neutrino

When qubits learn the language of fiberoptics

Engineered animals show new way to fight mercury pollution embargo

University of Houston physicists hit major milestone in advancing superconductor applications

Not humans or robots, but humans and robots; A perspective for AI-driven self-controlled laboratories of the future

SNU researchers develop soft robot that crawls, climbs, and shape-shifts to move in new directions

Is the Metaverse a new frontier for human-centric manufacturing?

AI predicts the precursor materials needed for material synthesis

More colors for a high-performance quantum internet

IOP Publishing and Fudan University convene experts to explore AI and Machine Learning’s impact on the Physical sciences

ChatGPT for birdsong may shed light on how language is wired in the human brain

Researchers control metal microstructure for better 3D printing

Evolution, evolution, evolution: How evolution got so good at evolving

Greetings from the Fourth Dimension

Physicists uncover evidence of two arrows of time emerging from the quantum realm

Discovering topological structures in water

The molecular einstein

Scientists discover mechanism driving molecular network formation

Engineers enable a drone to determine its position in the dark and indoors

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Latent reasoning: language models that think?

In a thread posted to X, artificial intelligence (AI) popularizer Matthew Berman has commented on a new arXiv paper about language models that think. Berman, who seems enthusiastic about this paper, has also posted a video to YouTube about it.

Berman explains the central concept of latent reasoning, which means processing information in a hidden space before writing. This is different from chain of thought methods, where models generate words as they think. Latent reasoning helps models work better even without special training data. They can handle tasks with smaller windows of context, and understand ideas that are hard to express in words.

The researchers made a model with 3.5 billion parameters. This model has a part that turns input into a hidden thought space, another part that does the thinking, and a final part that turns thoughts back into words. The model decides how much thinking it needs based on the task’s difficulty, similar to how humans think more about hard problems.

This approach doesn’t need special data to train, works with less context, and can capture complex reasoning. The model shows patterns in its thinking process, like orbits and sliders, which are ways it organizes thoughts in the hidden space. Performance improves with more thinking iterations, matching larger models. It also allows for tricks like zero-shot adaptive compute, where the model adjusts its thinking without training, and sharing of key-value caches for efficiency. Continuous chain-of-thought means the model keeps a flow of reasoning.

Toward language models that truly reason?

This research could help language models truly reason, addressing critiques like those from Yann LeCun about their inability to reason deeply. While still experimental, it suggests a new path for language models, combining internal reasoning with traditional word generation.

Of course, readers should study the paper carefully before jumping to enthusiastic conclusions. Another thread posted to X by Jonas Geiping, one of the authors of the arXiv paper, can assist.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Scientists map brain connections with advanced silicon chip technology

Harvard researchers have developed a new way to study how brain cells connect and communicate. They used a silicon chip with thousands of tiny electrodes to record synaptic connection signals from about 2,000 rat neurons. Synaptic connections are points where neurons touch and communicate with each other.

The researchers have described the methods and results of this study in a paper published in Nature Biomedical Engineering.

Neurons form the basis of brain functions through their connections. Each connection, or synapse, has a strength that affects how neurons interact. While electron microscopy can show these connections visually, it doesn’t reveal how strong they are. A traditional method, called patch-clamp recording, measures this strength but can only handle a few neurons at once.

Massive amount of data from over 70,000 synaptic connections

The Harvard team created a chip with 4,096 microhole electrodes. This allowed them to record from many neurons simultaneously, capturing over 70,000 synaptic connections. This is a huge leap from their previous work with nanoneedle electrodes, which could only capture about 300 connections.

“The integrated electronics in the silicon chip plays as equally an important role as the microhole electrode, providing gentle currents in an elaborate way to obtain intracellular access, and recording at the same time the intracellular signals,” says one of the researcher in a Harvard press release.

This new technique is not only more effective but also easier to implement. The chip’s design allows for high-quality data, helping researchers understand the nature and strength of each connection.

The researchers note that one of the biggest challenges, after achieving a massively parallel intracellular recording, was how to analyze the overwhelming amount of data. The team has made significant progress in analyzing this data to uncover how neurons connect.

The researchers are now looking to adapt this technology for use in living brains.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Synthetic worms made of active matter exhibit life-like behaviors

Researchers led by the University of Bristol have developed synthetic materials that can move on their own, much like worms. These materials fall into a category called ‘active matter.’ Active matter involves substances that, unlike the usual inanimate materials like plastic or wood, can exhibit behaviors similar to living organisms. These materials contain elements that use internal energy to move by themselves.

The researchers used tiny particles known as Janus colloids, which are about one millionth of a meter in size. These particles were placed in a liquid and then exposed to a strong electric field. Using a special microscope that captures three-dimensional images, the researchers observed the particles’ behavior.

When the researchers applied the electric field, the scattered colloids joined together to form structures that looked like worms. This created a three-dimensional system of active matter.

A paper published in Physical Review Letters describe the methods and results of this study.

Potential applications to soft robotics and medicine

The researchers observed the emergence of self-driven, worm-like structures. They also developed a theory to predict and control how these synthetic worms move based on their length.

The behavior of these materials changes with density. At low densities, they form worm-like chains, but at higher densities, they create sheet-like or maze-like patterns. The researchers are now exploring further experiments and theoretical models to understand and possibly harness these materials for practical uses.

The researchers underline that, although practical uses might be years away, this discovery has potential for future applications.

It “could eventually lead to the ability to design devices that independently move different parts of themselves, or the design of swarms of particles which can search for a target which could have health applications by having specifically targeted medicines and treatments,” says researcher Tannie Liverpool in a press release issued by the University of Bristol.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

A common brain circuit for creativity

A research team led by researchers at Brigham and Women’s Hospital analyzed data from 857 people across 36 fMRI studies to find a common brain circuit for creativity. fMRI is a technique that shows which parts of the brain are active during tasks. They looked at healthy people first, then predicted how brain injuries or diseases might affect creativity. They discovered that changes in creativity could relate to where an injury is compared to the creativity circuit.

The researchers found that creativity doesn’t just involve one brain spot but a whole circuit. This circuit includes areas active during activities like drawing or writing creatively.

A paper published in JAMA Network Open describes the methods and results of this study.

The researchers noticed that people with brain injuries or neurodegenerative diseases affecting this circuit might become more creative.

Less self-censoring, more creative thought

Interesting to the researchers was that all these creative areas were linked to the right frontal pole, a part of the brain that helps with self-monitoring and following rules. Reducing activity here might mean less self-censoring, allowing for more creative thought. This could explain why some people might get more creative with certain brain changes.

The study suggests that understanding this circuit could help in developing brain stimulation methods to boost creativity. But, the researchers stress that creativity involves many brain parts, not just this circuit. The findings also show how brain changes can both harm and enhance function, shedding light on neurodiversity.

The researchers say that the main message of these findings is that creativity in the brain is more about circuits than single regions, and brain injuries might alter creativity based on where they occur.

“We are learning more about neurodiversity and how brain changes that are considered pathological may improve function in some ways,” says researcher Isaiah Kletenik in a Brigham and Women’s Hospital press release. “These findings help us better understand how the circuitry of our brains may influence and unleash creativity.”

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Sam Altman says OpenAI will launch GPT 4.5 and GPT-5 soon

OpenAI CEO Sam Altman announced a simplified product line, TechCrunch reports.

In a post on X titled “OPENAI ROADMAP UPDATE FOR GPT-4.5 and GPT-5“, OpenAI CEO Sam Altman said that OpenAI will launch GPT-5 soon. GPT-5 will combine all OpenAI technology, including o3, into one system for ChatGPT and API use.

OpenAI won’t release o3, which was going to be their next big AI model, as a stand-alone product.

OpenAI had planned to launch o3 early this year, but that changed. Altman shared they want to make AI easier to use and less complicated. He mentioned that OpenAI is not happy with how users have to pick models in ChatGPT, and aim for a more seamless experience.

When GPT-5 comes out, everyone will get unlimited chat access at a “standard intelligence” level, though there are limits to prevent misuse. Paying users get smarter versions of GPT-5. This new model will include features like voice, drawing, search, and deep research.

Before that, OpenAI will release GPT-4.5, dubbed Orion internally, which will be their last model without “chain-of-thought” reasoning. These models are less reliable for tasks like math because they don’t have the ability to self-check or break down problems into steps like reasoning models do. Reasoning models, like OpenAI’s o1 or DeepSeek’s R1, take longer to answer but are more accurate and versatile.

GPT 4.5 / GPT 5 in weeks / months

In a reply, Altman said that OpenAI will release GPT 4.5 / GPT 5 in weeks / months.

DeepSeek has made waves with its R1 model, which matches o1 in performance. This has pushed OpenAI to speed up their releases.

These developments come at a time when Elon Musk has made a $97.4 billion bid to acquire OpenAI. Musk’s bid, which has been rejected by Altman, adds another layer of complexity to OpenAI’s strategic decisions, potentially influencing how they prioritize and release new AI models.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

How the brain builds mental maps of the world

Researchers at the Janelia Research Campus of Howard Hughes Medical Institute (HHMI) have detailed how the brain creates cognitive maps to help us navigate and understand our surroundings.

The researchers watched thousands of neurons in the hippocampus, the part of the brain that deals with learning and memory. They studied how these neurons change over time as an animal learns to navigate two similar but different paths, like two hotel floors. At first, the neurons’ activity was quite similar for both paths. But as the animal learned, the neurons started to act differently, creating unique maps for each path. These maps help the animal tell the paths apart, even if they look alike.

A paper published in Nature describes the methods and results of this study.

Implications for neurology and AI research

The researchers used a special microscope to see how neurons behave while a mouse learned to find rewards in virtual corridors. The mouse had to figure out where to expect rewards based on visual cues. Over time, the mouse learned not to lick in places without rewards and to only lick where rewards were available. As the mouse learned, its brain activity began to reflect this learning. Neurons that initially responded similarly to both corridors began to respond differently, creating distinct maps for each corridor.

They also found “state cells,” which are neurons that pick up on hidden information, helping the brain tell one situation from another. This is like knowing which floor you’re on in a hotel even if they look the same, by remembering the elevator number.

To understand how the brain computes these maps, the researchers looked at different math models. They found that a model called a Clone-Structured Causal Graph best mimicked this learning process. This model shows the brain acts like a state machine, figuring out scenarios by considering hidden states.

This research not only maps out cognitive map formation but also hints at how the brain might compute these maps, offering insights into memory and intelligence. It could help improve treatments for memory disorders and advance artificial intelligence (AI). Understanding brain algorithms is key to connecting how cells and molecules work together to create intelligence.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Exciting News! Our Mobile App is Here!

Welcome Back

No account? Create One

Join

Already have an account? Sign in

forgot password

Quality, not quantity of data

The Colossus supercomputer

A billion memory cells in one millimeter cube

Toward language models that truly reason?

Massive amount of data from over 70,000 synaptic connections

Potential applications to soft robotics and medicine

Less self-censoring, more creative thought

GPT 4.5 / GPT 5 in weeks / months

Implications for neurology and AI research