Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Collab
19
6
24
AbstractPhila
PRO
AbstractPhil
Follow
rzyns's profile picture
imgstry's profile picture
sh111111111111111's profile picture
85 followers
·
124 following
https://civitai.com/user/AbstractPhila
AbstractEyes
AI & ML interests
datasets, research papers, experimentation, vision, classification, text encoders, tokenization, llms, diffusion, distillation, and more.
Recent Activity
updated
a model
1 day ago
AbstractPhil/geolip-aleph-void
reacted
to
OzTianlu
's
post
with 🧠
1 day ago
ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.
replied
to
OzTianlu
's
post
1 day ago
ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.
View all activity
Organizations
AbstractPhil
's models
190
Sort: Recently updated
AbstractPhil/geolip-aleph-void
Feature Extraction
•
Updated
1 day ago
AbstractPhil/geolip-sdxl-aleph
Text-to-Image
•
Updated
7 days ago
•
•
1
AbstractPhil/geolip-hypersphere-experiments
Updated
12 days ago
•
1
AbstractPhil/geolip-svae-transformer
Feature Extraction
•
Updated
15 days ago
AbstractPhil/sd15-flow-lune-flux
Updated
21 days ago
AbstractPhil/SDXL-Simulacrum-V3-1
0.2B
•
Updated
21 days ago
AbstractPhil/geolip-SVAE
Updated
26 days ago
•
2
AbstractPhil/sd15-flow-lune-json-geolip-vit
Updated
28 days ago
AbstractPhil/sd15-flow-lune-json-geolip-prompt
Updated
28 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora-v2
Updated
28 days ago
AbstractPhil/sd15-flow-lune-json-prompt
Updated
28 days ago
AbstractPhil/sd15-flow-lune-json-vit
Updated
28 days ago
AbstractPhil/qwen-json-finetunes-dump
Updated
29 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora-v2-stage1
Updated
29 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora
Text Generation
•
Updated
May 15
•
20
AbstractPhil/qwen3.5-0.8b-task_3-lora
Text Generation
•
Updated
May 15
•
1
AbstractPhil/qwen3.5-0.8b-task_2-lora
Text Generation
•
Updated
May 15
•
2
AbstractPhil/geolip-svae-text
Updated
May 8
AbstractPhil/geolip-svae-implicit-solver-experiments
Updated
Apr 25
AbstractPhil/geolip-svae-h2-64
11M
•
Updated
Apr 25
•
9
AbstractPhil/geolip-svae-ablations
Updated
Apr 24
AbstractPhil/geolip-svae-batteries
Other
•
Updated
Apr 20
AbstractPhil/geolip-cvae-proto
Updated
Apr 20
AbstractPhil/geolip-svd-encoder-sweeps
Updated
Apr 18
AbstractPhil/geolip-spectral-cell
Updated
Apr 16
AbstractPhil/geolip-spectral-vit
Updated
Apr 15
AbstractPhil/geolip-conduit-experiments
Updated
Apr 11
AbstractPhil/geolip-svd-reconstitution
Updated
Apr 10
AbstractPhil/svae-freckles-4096
Updated
Apr 9
AbstractPhil/svae-freckles-256
Updated
Apr 9
Previous
1
2
3
...
7
Next