Please wait while the search results are loading...

DeepSeek Shakes up the LLMs/Reinforcement and Recursive Self Learning/Synthetic Data, Business and Industry Trends Analysis

China made ground-breaking news in early 2025 when the DeepSeek-V3 R1 AI large language was announced.  The developers claim to have created the system with only $5.6 million dollars in investment (compared to the normal U.S. AI model training budgets of $50 to $100 million) and to have done so quickly, with a lower total server investment.  The model is open source, and fees charged for accessing and using the model’s data are significantly lower than those of competing models.  This announcement from DeepSeek, established in 2023 in Hangzhou and funded by the hedge fund High Flyer, totally shook up competitors worldwide.  DeepSeek planned to launch an advanced R2 version quickly, and will likely grow through continual development of newer, enhanced models.
DeepSeek utilized several advanced strategies, including Reinforcement Learning and Distillation.  Reinforcement Learning encourages the system to experiment and improve itself continually.  Distillation is a process of creating a smaller, more efficient language model by “distilling” existing larger language models (which may include the generation of “synthetic” data, or new data generated during this process).

Reinforcement Learning (RL) and Recursive Self Learning (RSL)
Reinforcement Learning (RL) is a machine learning technique whereby an agent or system learns to make decisions by receiving feedback on its decisions and actions.  The system is “rewarded” or scored by the addition of positive points to its score when it is correct and penalized with negative points when it is wrong.  The system is programmed to attempt to achieve a high score, and it can gradually adjust its behavior to maximize the total reward score.  These systems learn the best processing paths to achieve desired outcomes or solutions.  To some extent, RL imitates the trial-and-error learning process that humans use to achieve their goals.  A good example is an autonomous (self-driving) vehicle system that enables a car to learn how to navigate challenging traffic conditions.
Recursive Self Learning (RSL) (also known as Recursive Self Improvement or RSI) is a system where the AI model makes itself much better and more accurate over time.  The system may write/enhance pieces of its own code, repeatedly improving itself.  For example, an RSL system might teach itself to play a complex game, such as chess, by playing the game over and over against expert humans, learning from the moves that generate wins and losses.

     Dozens of major large language models (LLMs) are constantly evolving and improving in a massive global competition with each other.  For example, in February 2025 Google announced the latest iteration of its Gemini model, called Gemini 2.0 Flash.  The firm claims greater capabilities and much faster responses with this new model.  During the same month, ChatGPT announced its “Deep Research” enhancement, “an agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you.”  Aimed at completing complex tasks quickly, it conducts multi-step research on the internet, by finding, analyzing and synthesizing hundreds of online sources to complete a “comprehensive report at the level of a research analyst.”  The fact that Deep Research searches both the internet (in real time) and an OpenAI large language model at the same time is a significant step up in capabilities.


A Representative List of Organizations that Have Used our Research and Products:



Testimonials

I’m amazed at how much information is available and the various ways to access it. This will be a major resource for our serious job seekers.

Career Services, Penn State University

Plunkett Research Online provides a great ‘one stop shop’ for us to quickly come up to speed on major industries. It provides us with an overall analysis of the market, key statistics, and overviews of the major players in the industry in an online service that is fast, easy to navigate, and reliable.

Wendy Stotts, Manager, Carlson Companies

I really appreciate the depth you were able to get to so quickly (for our project). The team has looked through the material and are very happy with the data you pulled together.

Hilton Worldwide, Marketing Manager

We are especially trying to push Plunkett since all of our students have to do so much industry research and your interface is so easy to use.

Library Services, St. John’s College

We are especially trying to push Plunkett’s since all of our students have to do so much industry research and your interface is so easy to use.

Gary White, Business Materials Selector, Penn State University

Your tool is very comprehensive and immensely useful. The vertical marketing tool is very helpful, for it assists us in that venue, as well as targeting customers’ competition for new sales…The comprehensive material is absolutely fabulous. I am very impressed, I have to say!

Tammy Dalton, National Account Manager, MCI

The more I get into the database, the happier I am that we’ll have it–REALLY happy!!! Between the quality and affordability of your product, its appeal to and value for our users, and the inestimably ethical and loyalty-guaranteeing conduct of your business, I will always have more than sufficient praises to sing for Plunkett Research.

Michael Oppenheim, Collections & Reference Services, UCLA

Plunkett Research Online is an excellent resource…the database contains a wealth of useful data on sectors and companies, which is easy to search and well presented. Help and advice on how to conduct, export and save searches is available at all stages.

Penny Crossland, Editor, VIP Magazine
Real Time Web Analytics