DNA seen through the eyes of a coder

Friday, March 28th, 2008

Bert Hubert describes DNA seen through the eyes of a coder — and, frankly, I would expect coders to have a better grasp of DNA than most biologists:

DNA is not like C source but more like byte-compiled code for a virtual machine called the nucleus. It is very doubtful that there is a source to this byte compilation — what you see is all you get.

The language of DNA is digital, but not binary. Where binary encoding has 0 and 1 to work with (2 — hence the binary), DNA has 4 positions, T, C, G and A.

Whereas a digital byte is mostly 8 binary digits, a DNA byte (called a codon) has three digits. Because each digit can have 4 values instead of 2, an DNA codon has 64 possible values, compared to a binary byte which has 256.

A typical example of a DNA codon is GCC, which encodes the amino acid Alanine. A larger number of these amino acids combined are called a polypeptide or protein, and these are chemically active in making a living being.

That’s all pretty basic. Let’s move along to position independent code and conditional compilation:

Dynamically linked libraries (.so under Unix, .dll on Microsoft) code cannot use static addresses internally because the code may appear in different places in memory in different situations. DNA has this too, where it is called transposing code:

Nearly half of the human genome is composed of transposable elements or jumping DNA. First recognized in the 1940s by Dr. Barbara McClintock in studies of peculiar inheritance patterns found in the colors of Indian corn, jumping DNA refers to the idea that some stretches of DNA are unstable and “transposable,” ie., they can move around — on and between chromosomes.

Of the 20,000 to 30,000 genes now thought to be in the human genome, most cells express only a very small part — which makes sense; a liver cell has little need for the DNA code that makes neurons.

But as almost all cells carry around a full copy (distribution) of the genome, a system is needed to #ifdef out stuff not needed. And that is just how it works. The genetic code is full of #if/#endif statements.

This is why stem cells are so hot right now — these cells have the ability to differentiate into everything. The code hasn’t been #ifdeffed out yet, so to speak.

Stated more exactly, stem cells do not have everything turned on — they are not at once liver cells and neurons. Cells can be likened to state machines, starting out as a stem cell. Over the lifetime of the cell, during which time it may clone (fork()) many times, it specializes. Each specialization can be regarded as choosing a branch in a tree.

Each cell can make (or be induced to make) decisions about its future, which each make it more specialized. These decisions are persistent over cloning using transcription factors and by modifying the way DNA is stored spatially (steric effects).

A liver cell, although it carries the genes to do so, will generally not be able to function as a skin cell. There are some indications out there that it is possible to breed cells upwards into the hierarchy, making them pluripotent.

From a coder’s perspective, so-called junk DNA is just dead code, bloat, and comments:

The genome is littered with old copies of genes and experiments that went wrong somewhere in the recent past — say, the last half a million years. This code is there but inactive. These are called the pseudo genes.

Furthermore, 97% of your DNA is commented out. DNA is linear and read from start to end. The parts that should not be decoded are marked very clearly, much like C comments. The 3% that is used directly form the so called exons. The comments, that come inbetween are called introns.

These comments are fascinating in their own right. Like C comments they have a start marker, like /*, and a stop marker, like */. But they have some more structure. Remember that DNA is like a tape — the comments need to be snipped out physically! The start of a comment is almost always indicated by the letters GT, which thus corresponds to /*, the end is signalled by AG, which is then like */.

However because of the snipping, some glue is needed to connect the code before the comment to the code after, which makes the comments more like html comments, which are longer: <!– signifies the start, –> the end.

If code and DNA interest you, definitely read the whole thing.

Posted in Science, Technology | 1 Comment »

Comments

Sam J. says:

September 4, 2020 at 2:46 pm

Interesting. Thanks for bringing it to our attention.

Gaikokumaniakku: I got up this morning planning on having a productive and diligent day, but now that I have seen a single mention of skeleton, I suppose I will spend the next sixteen hours watching Alessia Crippa videos. Che ci vuoi fare? Così è la vita.
Gaikokumaniakku: 1961: The prospect of domination of the nation’s scholars by Federal employment, project allocations, and the power of money is ever present and is gravely to be regarded. 1971: Federal funding becomes normal. 1981: Defense funding becomes foundational. 1991: Dependence survives the Cold War. 2001: No civil rights for “enemy combatants” or “terrorists.” ; 2011: Grant-seeking becomes institutionalized. 2021: Government influence over entire economy is semi-concealed...
Bruce: D party ringers are expensive.
Phileas Frogg: The cost of civilization is the vicious, perpetual, and unapologetic enforcement of civilization. The refusal to pay that cost by our leaders is their insistence that we must forego the laws of civilization and be subject to the laws of the jungle once again. While the American experience of this seems to still be at the stage where institutional efforts could, maybe, still reverse our descent, in Europe, and the UK in particular and in light of the Belfast situation, it appears that they...
Bob Sykes: So, this yet another benefit of open borders and free migration. Evidently, this is an unintended (?) consequence of the wholesale, heavily subsidized transport of illegal aliens into the US by the Biden administration. Or did the anti-red meat crowd piggy-back a pet project on the Biden scheme?
Isegoria: Apparently “Descendant“ appears in his The State of the Art collection.
Bill: Eventually, the US Army will get to the logical conclusion of this line of development, namely, the smart suit from “Descendant” , a 1987 short story by Iain Banks. After a bad crash, the protagonist is badly injured; can he walk back to base? The suit stands up and starts walking, gripping me round the calves and waist, taking the bulk of my weight off my throbbing feet. The suit walks faster than I do. It reckons it is only twenty percent stronger than the average human. Something of...
Isegoria: I’m reminded of Feynman’s anecdote, in Surely You’re Joking, Mr. Feynman!, about struggling to speak Portuguese: Now I wanted to say, “So, I learned Portuguese,” but I couldn’t think of the word for “so.” I knew how to make BIG words, though, so I finished the sentence like this: “CONSEQUENTEMENTE, aprendi Portugues!” When the two men came back with the baggage, she said, “Oh, he speaks Portuguese! And with such wonderful words: CONSEQUENTEMENTE!”
Phileas Frogg: I had no clue Murakami used this method. Honestly my prose can get a bit purple at times, I should try it out. Now I just have to learn enough to write in another language.
Gaikokumaniakku: It is very hard to give honest and constructive feedback on complicated student projects that might prove a student has skill. If it were easier to give feedback, training desired skills would be much easier. Whether any form of training can really imbue a student with skill is questionable. Skill is like a delicate seedling: the teacher can try to provide the right conditions and after that everyone can HOPE that the student manifests skill by mysterious processes. Of course,...
Gaikokumaniakku: There are top-down and bottom-up approaches. In the hard sciences and engineering, we sometimes try to induce parents to send their bright 14-year-olds for special programs that could be called “baby’s first internship.” These top-down programs may or may not inculcate some detectable level of professionalism. These programs certainly are not common enough, or effective enough. But the scientific community is aware that more high-quality personnel are needed. Some...
Isegoria: I see that Swift’s knowledge engine has an entry in Technovelgy.
Bill: The Giertz method sounds like Swift’s knowledge engine, used for generating new ideas: These bits of wood were covered, on every square, with paper pasted on them; and on these papers were written all the words of their language, in their several moods, tenses, and declensions; but without any order. The professor then desired me “to observe; for he was going to set his engine at work.” The pupils, at his command, took each of them hold of an iron handle, whereof there were forty fixed round...
Bruce: Great catch James James! All the stuff about well-fed people and well-fed horses sounds like why the Mongols invaded everyone with food. Like starving men from small Viking settlements going after any seaside town with food.
Gaikokumaniakku: “Boredom is usually a consequence of an oppressive combination of physical constraint, social constraint, temporal constraint, and cognitive constraint, like sitting in a 2-3 hour faculty meeting, a boring high school class, or a superfluous but mandatory training workshop.” This is why teachers ought to make sure that their students take notes on paper, and teachers should not police those notes. The doodles and vagaries of paper ostensibly devoted to notes are the nesting-grounds of...
Gaikokumaniakku: “…given the important nature of the research performed by academics in the sciences and engineering, does he support having them them funded by the government, working in academia, and their work freely available in academic journals?” I don’t speak for Caplan, and he doesn’t speak for me, but I have a few choice jeremiads on the topic of why the peer-review system is broken. Engineering is so vitally important that I believe humans must prioritize its success despite the...
James James: This version of the quote comes from Genghis Khan: The Emperor of All Men (1927) by Harold Lamb. The original comes from Rashid ad-Din’s “Compendium of Chronicles”, according to Wikiquote.
Phileas Frogg: “Total freedom, then, is the enemy of creativity, and constraint its companion.” This is why boredom is so valuable. Boredom is usually a consequence of an oppressive combination of physical constraint, social constraint, temporal constraint, and cognitive constraint, like sitting in a 2-3 hour faculty meeting, a boring high school class, or a superfluous but mandatory training workshop. The mind, thus confined, suddenly begins to produce truly astonishing imaginations and...
Isegoria: There is, David Epstein explains, a very bright side to the scientific carnage: The so-called replication crisis over the last decade has been painful for many scientists, but researchers in every discipline have been learning from it and working to improve their fields. It was, after all, scientists themselves who raised the alarm about their colleagues’ work (and in some cases their own work) in the first place. Increasingly, researchers now share or formally preregister their hypotheses at...
Jim: The question, perhaps, is what to do with the boys and young men once they have been freed from bondage. Bryan Caplan, low-T Catholic mischling and natural-born slave that he is, proposes unpaid labor accruing to the Boomer and Israeli owners of the occupational gigacorporations. After he is sent to the Idaho potato farms established to permit the United State’s lipservicedly reformed academics, lawyers, traders, and assorted other Boomerregime water-carriers to perform honest labor for the...

Isegoria

DNA seen through the eyes of a coder

Comments

Leave a Reply

Search

Recent Comments

Categories