Reverse Engineering Life • 2015 Nov 19 • mahiwaga

This XKCD post (mentioned [previously]) really got me thinking too much about the analogy between genetic code and computer code.

Matthew Cobb adds more nuance (h/t Michael C.)

DNA: optimised source code? • 2015 Nov 18 • Matthew Cobb • Why Evolution Is True

Now that the Central Dogma is recognized to be more flexible, it seems clear DNA isn’t really source code. It’s more like a serialized object. While DNA can have other functions other than data storage and transmission¹^,², in general you have to process and deserialize it to generate executable code. The language DNA, RNA, and protein are written in is (bio)chemistry and the architecture it’s running on is physics.³

So, basically you have three types of byte code notation⁴^,⁵^,⁶ that are related but not identical and all capable of interacting directly as well as being directly executed with no sandboxing whatsoever, although one form is typically used for data storage, one for message passing, and one for actual program execution. The message passing notation was originally used for all three purposes⁷^,⁸^,⁹, but unsurprisingly it was forked and partly deprecated, although bits of legacy code still rely on its versatility.¹⁰

Molecular and Cell Biology is basically an attempt to reverse engineer a 4-billion year old OS and architecture by looking only at console messages and error logs.

DNA sequencing finally gave us the ability to read the file system directly but it’s still a long hard slog to figure out the higher level structures/motifs/moieties.¹¹

And even though we now have file system access, there isn’t any source code or even header files. We’re basically just looking at the raw binary (or quaternary?) code.

Aptamer • Wikipedia ↩
extracellular nucleic acids • Wikipedia ↩
crossposted on Facebook ↩
DNA • Wikipedia ↩
RNA • Wikipedia ↩
protein • Wikipedia ↩
RNA virus • Wikipedia ↩
Messenger RNA • Wikipedia ↩
Ribozyme • Wikipedia ↩
crossposted on Facebook ↩
crossposted on Facebook ↩