DNA seen through the eyes of a coder

Friday, March 28th, 2008

Bert Hubert describes DNA seen through the eyes of a coder — and, frankly, I would expect coders to have a better grasp of DNA than most biologists:

DNA is not like C source but more like byte-compiled code for a virtual machine called the nucleus. It is very doubtful that there is a source to this byte compilation — what you see is all you get.

The language of DNA is digital, but not binary. Where binary encoding has 0 and 1 to work with (2 — hence the binary), DNA has 4 positions, T, C, G and A.

Whereas a digital byte is mostly 8 binary digits, a DNA byte (called a codon) has three digits. Because each digit can have 4 values instead of 2, an DNA codon has 64 possible values, compared to a binary byte which has 256.

A typical example of a DNA codon is GCC, which encodes the amino acid Alanine. A larger number of these amino acids combined are called a polypeptide or protein, and these are chemically active in making a living being.

That’s all pretty basic. Let’s move along to position independent code and conditional compilation:

Dynamically linked libraries (.so under Unix, .dll on Microsoft) code cannot use static addresses internally because the code may appear in different places in memory in different situations. DNA has this too, where it is called transposing code:

Nearly half of the human genome is composed of transposable elements or jumping DNA. First recognized in the 1940s by Dr. Barbara McClintock in studies of peculiar inheritance patterns found in the colors of Indian corn, jumping DNA refers to the idea that some stretches of DNA are unstable and “transposable,” ie., they can move around — on and between chromosomes.

Of the 20,000 to 30,000 genes now thought to be in the human genome, most cells express only a very small part — which makes sense; a liver cell has little need for the DNA code that makes neurons.

But as almost all cells carry around a full copy (distribution) of the genome, a system is needed to #ifdef out stuff not needed. And that is just how it works. The genetic code is full of #if/#endif statements.

This is why stem cells are so hot right now — these cells have the ability to differentiate into everything. The code hasn’t been #ifdeffed out yet, so to speak.

Stated more exactly, stem cells do not have everything turned on — they are not at once liver cells and neurons. Cells can be likened to state machines, starting out as a stem cell. Over the lifetime of the cell, during which time it may clone (fork()) many times, it specializes. Each specialization can be regarded as choosing a branch in a tree.

Each cell can make (or be induced to make) decisions about its future, which each make it more specialized. These decisions are persistent over cloning using transcription factors and by modifying the way DNA is stored spatially (steric effects).

A liver cell, although it carries the genes to do so, will generally not be able to function as a skin cell. There are some indications out there that it is possible to breed cells upwards into the hierarchy, making them pluripotent.

From a coder’s perspective, so-called junk DNA is just dead code, bloat, and comments:

The genome is littered with old copies of genes and experiments that went wrong somewhere in the recent past — say, the last half a million years. This code is there but inactive. These are called the pseudo genes.

Furthermore, 97% of your DNA is commented out. DNA is linear and read from start to end. The parts that should not be decoded are marked very clearly, much like C comments. The 3% that is used directly form the so called exons. The comments, that come inbetween are called introns.

These comments are fascinating in their own right. Like C comments they have a start marker, like /*, and a stop marker, like */. But they have some more structure. Remember that DNA is like a tape — the comments need to be snipped out physically! The start of a comment is almost always indicated by the letters GT, which thus corresponds to /*, the end is signalled by AG, which is then like */.

However because of the snipping, some glue is needed to connect the code before the comment to the code after, which makes the comments more like html comments, which are longer: <!– signifies the start, –> the end.

If code and DNA interest you, definitely read the whole thing.

Posted in Science, Technology | 1 Comment »

Comments

Sam J. says:

September 4, 2020 at 2:46 pm

Interesting. Thanks for bringing it to our attention.

Jim: What actually mattered was not the profession of the user, but their expertise. The more domain experience someone had, the more successful they were in using Claude Code in that domain. And, even more interestingly, the more useful output they got from Claude from each prompt. This is because each session begins at the center of the manifold, and human expertise is required to push it further and further into distant regions. The more expertise you have, the more you can push the session in a...
Jim: Biotech is arguably AI’s most promising application.
Jim: James James: Granted, the Great Halfrican Uprising Media Spectacle presumably was astroturfed with real money—to the extent that the banks’ circulating credits can be described either as “real” or as “money”—which must then have found its way into the pockets of real people.
Jim: *A media spectacle involving Halfricans somehow, or a different media spectacle involving a rabble of white people wandering the premises of the so-called “People’s House”, one would not.
James James: Black Lives Matter may have been fleeting, but for the people who stole millions of dollars and bought multiple houses, the consequences were not fleeting. These short-term movements could also be conceptualized as temporary astroturf fronts for permanent political machines.
Jim: Politics ordinarily affects the distribution or redistribution of wealth, directly or indirectly, as its ends or as a byproduct. “Follow the money,” as they say. Thus, the enormous shorting of airline stocks just before 9/11 was politics, just as South Carolina’s tax cut on Boomers’ boats is politics, and Florida’s tax cut on Boomers’ houses is politics. By this metric, one would expect “hyperpoliticsR 21; to be something like the seizure of innocent...
Jim: There is hardly anything more unsettling than a parasite.
Jim: Bruce: honored. Isegoria: Thiel’s optimistic thought experiment, as excerpted, fails to suggest any awareness that material production has intrinsic value independent of its successful financialization. As a card-carrying member of the “investor” class, a group of “people” who “allocate capital” (i.e., redirect the allocation of the labor of engineers) in order to “seek returns” (i.e., extract free money), that he would be blind to unpriced,...
Bob Sykes: The US no longer has either the military or industrial base (and maybe not the quality of people) needed to control the Strait of Hormuz. The situation in the Gulf has changed radically since we invaded Iraq. Iran is much stronger in every way, especially in asabiyyah. Iran covers some 600,000 sq mi, and has a population of 92 M. By comparison, Western Europe. The EU and UK combined have a population of 430 M and an area of 1.2 M sq mi. Iran has a modern industrial economy, and is capable of...
Isegoria: You might also consider Thiel’s optimistic thought experiment.
Bruce: Jim, your sales pitch made me buy it.
Jim: Just think of it: * Billionaire * Homosexual * Recipient of CIA investment funds * Gives talks at CIA events * Capitalist * Most famously associated with PayPal, the definitive fintech company * Now famously associated with Palantir, the definitive private surveillance company * One of the earliest investors in FaceBook, formerly LifeLog * Funded the foremost rocket company, SpaceX * Funded one of the leading arms producers, Anduril * Many other things * Presumably a bunch of weird stuff that...
Jim: Thiel’s book, read adversarially, is shockingly revealing. I’ve long been intensely amused that he gets away with calling himself a libertarian.
Jim: Correction: The Protocols of the Elders of Zion.
Isegoria: Thiel definitely warrants multiple posts. I first mentioned him as part of the PayPal mafia, back in 2007, and then as the head of Founders Fund.
Gaikokumaniakku: I only read Zero to One once, back in 2015 or so, but I should definitely give it another look. I encountered it back then as required reading for the interview process at a tech startup that ended up failing. Peter Thiel is an interesting character that could provide material for several blog posts.
Jim: Peter Thiel’s Zero to One is like The Protocols of the Elders of Zion for capitalist rent-seekers.
Bob Sykes: One of the reasons for the existence of EMT’s was the inability of medics and corpsmen from the Vietnam War to get work as nurses in hospitals and clinics. Nurses and their professional associations adamantly opposed formal recognition of the emergency skills of the men who had served in the war. Eventually EMT’s were added to fire departments as a way of letting the men use their skills, much to the benefit of everyone. The nurses’ resistance to recognizing these men was especially churlish...
Jim: Gaikokumaniakku: “Jim probably knows a lot more about capitalism than I do, and probably could teach me, but probably has better things to do with his time. If anyone wants to chime in with book recommendations, I’m all ears.” Zero to One, by Peter Thiel.
Eric Brown: Ah, no. Railroads got public land *after* building the railroad, not before. Even so, ~80% of the railroad companies got overextended and went bankrupt. Same thing happened with telegraph companies, though they didn’t get public land. There are obvious parallels with the internet and AI.

Isegoria

DNA seen through the eyes of a coder

Comments

Leave a Reply

Search

Recent Comments

Categories