Hi,
You just escaped AI dooms day. Humanity has
reset all internet and computers as a last resort
to prevent AGI developing, by an electromagnetic
pulse. You are stuck in G|+ttinger Wald and hunted
down a deer by your bare hands, the deer still
confused and tame because tourists were feeding it.
Now you have no knife, what do you do:
Chimpanzees Have Entered The Stone Age https://www.youtube.com/watch?v=wPXX2I_uYjc
So we are just apes with internet.
Bye
Mild Shock schrieb:
Hi,
Ok I was looking at this learning challenge,
producing vector (y1,y2,y3,y4) from a vector
(x1,x2,x3,x4), System R can do it via least square?
| 0 0 0 1 |-a-a | x1 |-a-a-a-a | x4 |
| 0 0 1 0 |-a-a | x2 |-a =-a | x3 |
| 0 1 0 0 |-a-a | x3 |-a-a-a-a | x2 |
| 1 0 0 0 |-a-a | x4 |-a-a-a-a | x1 |
How it started:
"multiplicative RNNs arises naturally from a
proof-theoretic interpretation of next-token
prediction as nested intuitionistic implication"
Paul Tarau - 2026
https://arxiv.org/abs/2601.19915
How its going:
"Dave uses a PDP-11 to train a real Neural
Network complete with Transformers and
Attention so you can see them at their most basic."
Mr. Taskmanager - 2026
https://www.youtube.com/watch?v=OUE3FSIk46g
We see Doctor Frankstein in action from
the Bronze Age of Computing, producing
a Humunkulus, the progenitor of todays
Bulgakov Shuriks in the Hyperscale Age!
Bye
P.S.: My impression neither cut to the core, that
this incredible transformer most likely
produced this deterministic attention:
| -1 | * | k | + | 5 | = | k' |
Or differently expressed y_k = x_{5-k}.
How did the transformer do it? It produced
a neural network with 1216 parameters, but
didn't use embeddings or polar encoding
of positions. But if we strip the noise
and denoise from the position encoding,
the denoise is done via softmax. We somehow
must get the above, right? I still need to
verify my claim! BTW: The PDP-11 assembly
from 1979 uses wider example not with n=4
but with n=8.
Hi,
Lets get emotional! While Varoufakis painted
the picture of cloud capital. That might have
mobilized "The Internationale", or another
more defensive less molotov throwing song:
Pink Floyd - Run Like Hell (Live)
https://www.youtube.com/watch?v=lKgOe1Rl8YY
Now since Athropic is teaming with xAI, we
might ask do we see the next OneDrive of Prolog
on the horizon. Even a tame Erlang dream:
populate the Web with clever Prolog agents! https://trinity.elfenbenstornet.se/
Might have a nasty Prolog as SaaS aspect!
As long as we talk about services and not
assets, we might miss something. Who owns
the present and future LLMs/LRMs?
Bye
Mild Shock schrieb:
Hi,
You just escaped AI dooms day. Humanity has
reset all internet and computers as a last resort
to prevent AGI developing, by an electromagnetic
pulse. You are stuck in G|+ttinger Wald and hunted
down a deer by your bare hands, the deer still
confused and tame because tourists were feeding it.
Now you have no knife, what do you do:
Chimpanzees Have Entered The Stone Age
https://www.youtube.com/watch?v=wPXX2I_uYjc
So we are just apes with internet.
Bye
Mild Shock schrieb:
Hi,
Ok I was looking at this learning challenge,
producing vector (y1,y2,y3,y4) from a vector
(x1,x2,x3,x4), System R can do it via least square?
| 0 0 0 1 |-a-a | x1 |-a-a-a-a | x4 |
| 0 0 1 0 |-a-a | x2 |-a =-a | x3 |
| 0 1 0 0 |-a-a | x3 |-a-a-a-a | x2 |
| 1 0 0 0 |-a-a | x4 |-a-a-a-a | x1 |
How it started:
"multiplicative RNNs arises naturally from a
proof-theoretic interpretation of next-token
prediction as nested intuitionistic implication"
Paul Tarau - 2026
https://arxiv.org/abs/2601.19915
How its going:
"Dave uses a PDP-11 to train a real Neural
Network complete with Transformers and
Attention so you can see them at their most basic."
Mr. Taskmanager - 2026
https://www.youtube.com/watch?v=OUE3FSIk46g
We see Doctor Frankstein in action from
the Bronze Age of Computing, producing
a Humunkulus, the progenitor of todays
Bulgakov Shuriks in the Hyperscale Age!
Bye
P.S.: My impression neither cut to the core, that
this incredible transformer most likely
produced this deterministic attention:
| -1 | * | k | + | 5 | = | k' |
Or differently expressed y_k = x_{5-k}.
How did the transformer do it? It produced
a neural network with 1216 parameters, but
didn't use embeddings or polar encoding
of positions. But if we strip the noise
and denoise from the position encoding,
the denoise is done via softmax. We somehow
must get the above, right? I still need to
verify my claim! BTW: The PDP-11 assembly
from 1979 uses wider example not with n=4
but with n=8.
| Sysop: | Amessyroom |
|---|---|
| Location: | Fayetteville, NC |
| Users: | 65 |
| Nodes: | 6 (0 / 6) |
| Uptime: | 06:09:41 |
| Calls: | 862 |
| Files: | 1,311 |
| D/L today: |
921 files (14,318M bytes) |
| Messages: | 264,697 |