Hi,
Please meet Luo Fuli:
The 29-Year-Old Genius Behind DeepSeek’s AI Revolution
https://www.youtube.com/watch?v=B2fxh4aoQ8Q
I find this paper interesting, finally
some say about fine tuning during pretraing:
Raise a Child in Large Language Model
13 Sep 2021 - Fuli Luo et al.
https://arxiv.org/pdf/2109.05687
Bye
Mild Shock schrieb:
Hi,
So how its going? DeepSeek embraced by many cloud
providers, even by NVIDIA NIM itself.
DeepSeek-R1 Now Live With NVIDIA NIM https://blogs.nvidia.com/blog/deepseek-r1-nim-microservice/
What what are these models doing and how are they
trained. Is Geoffrey Hinton our only AI God? There
seems to be another slightly disputed AI God,
S. Hochreiter, J. Schmidhuber. Long Short-Term Memory. Neural
Computation, 9(8):1735-1780, 1997. https://people.idsia.ch/~juergen/deep-learning-history.html
Bye
P.S.: It allows a mechanistic view on our linguistic
brain if the latent space is some semantic vectors?
So that learning is a kind of control mechanism:
Machine Learning Approach to Model Order Reduction
of Nonlinear Systems via Autoencoder and LSTM Networks
Thomas Simpson - 23 Sep 2021
https://arxiv.org/abs/2109.11213
Mild Shock schrieb:
Hi,
Wait till USA figures out there is a second
competitor besides DeepSeek, its called Yi-Lightning:
Yi-Lightning Technical Report
https://arxiv.org/abs/2412.01253
It was already discussed 2 months ago:
Eric Schmidt DROPS BOMBSHELL: China DOMINATES AI!
https://www.youtube.com/watch?v=ddWuEUjo4u4
Bye
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)