Archive for the '–' Category

RL Project

January 11, 2026

some reading materials on RL:
https://arxiv.org/pdf/2412.05265
https://arxiv.org/abs/2412.05265

Murphy, K. (2024, December 6). Reinforcement Learning: An Overview. arXiv.org. https://arxiv.org/abs/2412.05265

QT:{{”
Reinforcement learning or RL is a class of methods for solving various kinds of sequential decision making
tasks. In such tasks, we want to design an agent that interacts with an external environment. The agent
maintains an internal state zt, which it passes to its policy π to choose an action at = π(zt). The environment
responds by sending back an observation ot+1, which the agent uses to update its internal state using the
state-update function zt+1 = SU(zt, at, ot+1). See Figure 1.1 for an illustration.
To simplify things, we often assume that the environment is also a Markovian process, which has internal
world state wt, from which the observations ot are derived. (This is called a POMDP — see Section 1.2.1).
We often simplify things even more by assuming that the observation ot reveals the hidden environment state;
in this case, we denote the internal agent state and external environment state by the same letter, namely
st = ot = wt = zt. (This is called an MDP — see Section 1.2.2). We discuss these assumptions in more detail
in Section 1.1.3.
RL is more complicated than supervised learning (e.g., training a classifier) or self-supervised learning
(e.g., training a language model), because this framework is very general: there are many assumptions we can
make about the environment and its observations ot, and many choices we can make about the form the
agent’s internal state zt and policy π, as well the ways to update these objects as we see more data. We
will study many different combinations in the rest of this document. The right choice ultimately depends on
which real-world application you are interested in solving.1 .”}}

What Is a Blepharoplasty? What to Know About the Popular Eyelid Surgery – WSJ

January 4, 2026

https://www.wsj.com/style/beauty/blepharoplasty-eyelid-lift-cosmetic-surgery-cc865580?st=iLeGT9&reflink=article_gmail_share

CourseTable

January 4, 2026

https://coursetable.com/catalog?selectSortBy=overall&searchText=s%26ds&sortOrder=desc

Metabolites and you – People leave molecular wakes that may give away their secrets | Science and technology | The Economist

January 3, 2026

https://www.economist.com/science-and-technology/2020/02/13/people-leave-molecular-wakes-that-may-give-away-their-secrets

GP-write 5.0 • Virtual Conference October 21-22, 2021

January 3, 2026

https://engineeringbiologycenter.org/2021meeting/

NYTimes.com: ‘Chinese Peptides’ Are the Latest Biohacking Trend in the Tech World

January 3, 2026

‘Chinese Peptides’ Are the Latest Biohacking Trend in the Tech World

The gray-market drugs flooding Silicon Valley reveal a community that believes it can move faster than the F.D.A.

https://www.nytimes.com/2026/01/03/business/chinese-peptides-silicon-valley.html?smid=em-share

‘Minions: The Rise of Gru’ director on Jackie Chan, Bruce Lee

January 2, 2026

https://variety.com/2022/artisans/artists/minions-gru-1235308229/

‘Minions’ and Its Influences: A Goggle-Eyed View – The New York Times

January 2, 2026

https://www.nytimes.com/interactive/2015/07/02/movies/05minions-feature.html

What Are ‘World Models’? The Key to the Next Big AI Leap – WSJ

January 2, 2026

What Are ‘World Models’? The Key to the Next Big AI Leap – WSJ https://www.wsj.com/tech/ai/world-models-ai-evolution-11275913
https://www.wsj.com/tech/ai/world-models-ai-evolution-11275913?mod=article_inline

1900: Rediscovery of Mendel’s Work

January 1, 2026

https://www.genome.gov/25520238/online-education-kit-1900-rediscovery-of-mendels-work QT:{{” DeVries, Correns and Tschermak independently rediscover Mendel’s work. Three botanists – Hugo DeVries, Carl Correns and Erich von Tschermak – independently rediscovered Mendel’s work in the same year, a generation after Mendel published his papers. They helped expand awareness of the Mendelian laws of inheritance in the scientific world.
The three Europeans, unknown to each other, were working on different plant hybrids when they each worked out the laws of inheritance. When they reviewed the literature before publishing their own results, they were startled to find Mendel’s old papers spelling out those laws in detail. Each man announced Mendel’s discoveries and his own work as confirmation of them. “}}