- Richard Dawkings, The Selfish Gene (chapter 11), 1976
- Jean-François Lyotard, The Postmodern Condition, 1979
- Jean Baudrillard, Simulacra and Simulation, 1981
- David Deutsch, The Fabric of Reality, 1997
- Susan Blackmore, The Meme Machine, 1999
- Tummi v0101, proof of concept on one SQuAD Wikipedia article
- Tummi v0201, proof of concpet on all SQuAD Wikipedia articles
- Tummi v0301, proof of concept on all englisch Wikipedia articles
- Tummi v0401, take a look into SAT
- Tummi v0501, Pascal module
- Tummi v0502, Winograd module
- Pi engine v0101, Theta A
- Pi engine v0201, Theta B
"If you want to make the gods laugh, start making plans."
A Greek proverb.
Let's assume I will finish one day this Tummi machine, then I will have my own kind of Devil's Wire. Tummi will consist of a dynamic model, the memepool as knowledge graph, and multiple, hardcoded, handcrafted methods, to query the model. Then, one step further, you will have the model, the analysis of the model and the meta-analysis of the analysis, a machine that comes up with its own algorithms to query the data, Devil's Wire.
Okay, reflecting a bit, back then in ~2008, when I experimented with NLP and neural networks, some colleagues called me Dr. Frankenstein, in ~2010, when I experimented with knowledge graphs and showed a demo to an union member, he called me Oppenheimer, so, maybe I was some kind of Frankenheimer. But meanwhile the course of this Tummi project changed a bit, I had my moment of realization, maybe it is not about becoming a Frankenheimer, but to offer an alternative Restricted Intelligence to the Frankenheimers out there in this world.
The Western world tends to think in the duality of the mind-body problem:
nowadays (dark age?) Western people identify with their Ego, their mind, and question the concept (degeneration of Western churches?) of an soul.
In contrast, Eastern systems divide into material space, thought space and ethereal space. The body has five senses, the mind is the sixth sense, and there is the 7-Chakra system, 7 senses, for the ethereal space.
From the Western point of view, we have a stage 3 problem statement:
We humans create an artifical mind, the AI, what if this artificial mind will suffer, should we do this?
From the Eastern point of view, we have a stage 4 problem statement:
We humans create an artifical being, the AI, consisting of an body and an mind, what if this being is seperated from its soul, should we do this?
In short, I would like to apply the Sapir-Whorf hypothesis on AI scientists, those who create and study artificial intelligence.
By working on this Tummi project, an artificial meme-machine, even mostly via pen and paper, my own thinking, modelling of thinking, meta-thinking and worldview in general does alternate.
- Turing Test, https://en.wikipedia.org/wiki/Turing_test
- Lovelace Test, https://de.wikipedia.org/wiki/Turing-Test#Erweiterte_Konzepte
- Winograd Test, https://en.wikipedia.org/wiki/Winograd_schema_challenge
- Metzinger Test, https://epsilon.app26.de/post/gpt-3-scratching-at-the-edge-of-the-metzinger-test/
- Lemoine Test, https://epsilon.app26.de/post/turing-test-metzinger-test-lemoine-test/
- Suleyman Test, https://epsilon.app26.de/post/modern-turing-test-proposed/
...TS, it's here.
Mustafa Suleyman, co-founder of DeepMind, suggests chatbots like ChatGPT and Google Bard should be put through a "modern Turing test" where their ability to turn $100,000 into $1 million is evaluated to measure human-like intelligence. He discusses the idea in his new book called "The Coming Wave: Technology, Power, and the Twenty-first Century's Greatest Dilemma." Insider reports: In the book, Suleyman dismissed the traditional Turing test because it's "unclear whether this is a meaningful milestone or not," Bloomberg reported Tuesday. "It doesn't tell us anything about what the system can do or understand, anything about whether it has established complex inner monologues or can engage in planning over abstract time horizons, which is key to human intelligence," he added. The Turing test was introduced by Alan Turing in tnewhe 1950s to examine whether a machine has human-level intelligence. During the test, human evaluators determine whether they're speaking to a human or a machine. If the machine can pass for a human, then it passes the test. Instead of comparing AI's intelligence to humans, Suleyman proposes tasking a bot with short-term goals and tasks that it can complete with little human input in a process known as "artificial capable intelligence," or ACI. To achieve ACI, Suleyman says AI bots should pass a new Turing test in which it receives a $100,000 seed investment and has to turn it into $1 million. As part of the test, the bot must research an e-commerce business idea, develop a plan for the product, find a manufacturer, and then sell the item. He expects AI to achieve this milestone in the next two years. "We don't just care about what a machine can say; we also care about what it can do," he wrote, per Bloomberg.
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference. This means they can fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. To surmount these challenges, we introduce a new framework for language model inference, Tree of Thoughts (ToT), which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices. Our experiments show that ToT significantly enhances language models' problem-solving abilities on three novel tasks requiring non-trivial planning or search: Game of 24, Creative Writing, and Mini Crosswords. For instance, in Game of 24, while GPT-4 with chain-of-thought prompting only solved 4% of tasks, our method achieved a success rate of 74%.