In this research a way to model GPRAs on an Altera Cyclone II FPGA was developed, using Quartus II programmable logic device design software and the VHDL hardware description language. The model consisting of 7 genes and 7 reactions was implemented using 7 combinational functions and 14 I/O pins.
Genome-scale metabolic models have proven to be incredibly useful. Allowing researchers to model cellular functionality based upon gene expression. However, as the number of genes and reactions increases, it can become computationally demanding. The first step in genome-scale metabolic modelling is to model the relationship between genes and reactions in the form of Gene-Protein-Reaction Associations (GPRA).
This model will be the first step towards creating a full genome scale metabolic model on FPGA devices which the author will be fully investigating in future studies.
Abstract
Genome-Scale metabolic models have proven to be incredibly use- ful.Allowing researchers to model cellular functionality based upon gene expression. However as the number of genes and reactions in- creases it can become computationally demanding. The first step in genome-scale metabolic modelling is to model the relationship between genes and reactions in the form of Gene-Protein-Reaction Associations (GPRA). In this research we have developed a way to model GPRAs on an Altera Cyclone II FPGA using Quartus II programmable logic device design software and the VHDL hardware description language. The model consisting of 7 genes and 7 reactions was implemented us- ing 7 combinational functions and 14 I/O pins. This model will be the first step towards creating a full genome scale metabolic model on FPGA devices which we will be fully investigating in future studies.
1 Keywords
FPGA, Genome-scale metabolic models, GPRA, Genes, Reactions, FBA
2 Introduction
Genome scale metabolic models have gone a long way since the first genome- scale metabolic model which was developed in 1995 for Haemophilus in- fluenzae a single celled organism [1]. This model was based on assembly of unselected pieces of DNA from the whole chromosome had been applied to obtain the complete nucleotide sequence. However the preferred approach today is Flux Balance Analysis (FBA). FBA can be traced back to Papout- sakis et al who in 1984 demonstrated the first use of flux balance equations from metabolic maps [6]. This approach has an advantage over more tradi- tional approaches as it requires very little information about enzyme kinetics or concentration of metabolites in the organism to be described. Instead it assumes that the system will be steady state ie that fluxes are balanced and that the system will always be optimal. The steady state assumption means that the system can be reduced to a set of linear equations which can then be solved using a linear solver such as Gurobi [5], [3]. The first step in constructing a genome scale metabolic model is to describe the relationship between genes and the reactions they encode for. These are called Gene- Protein-Reaction Associations (GPRAs). These are implemented as boolean rules and depending on the reaction these rules can become quite complex. In their simplest form however they model isozymes and enzyme complexes where an isoenyme can be described as
Abbildung in dieser Leseprobe nicht enthalten
where g 1 and g 2 are the genes coding for the reaction enzyme. Enzyme complexes can be described as
Abbildung in dieser Leseprobe nicht enthalten
One real world example of a GPRA would be (HGNC:6541 ∨ HGNC:6535 ∨ HGNC:19708) which would describe the reaction Lactate Dehydrogenase. All in-silico models so far have been software based however dedicated hard-ware such as FPGAs may be a promising alternative. FPGAs as a platform would provide several advantages as they would provide faster computational speeds. Given that GPRAs are essentially boolean rules these could be mod- elled naturally on an FPGA using in built logic.
3 Materials and Methods
3.1 Selection of GPRAs to be modelled
For this research pre-existing GPRAs had to be found from pre-existing lit- erature. GPRA’s were found from Virtual Metabolic Human (VIHM) a site that curates genes,reactions, metabolites and diseases as well as provide in- formation on how all these components interact including GPRAs [4]. For this research GPRAs and reactions that were associated with Lactate Dehy- drogenase were selected and are outlined in Table 1. Furthermore Lactate Dehydrogenase is an enzyme that catalyzes the conversion of pyruvate to lactate. The main reason for selecting these are that many genes involved in Lactate Dehydrogenase are also involved in a number of other reactions. Furthermore Lactate Dehydrogenase is a notable enzyme that has been stud- ied for its potential role as a biomarker in tumours [2].
Abbildung in dieser Leseprobe nicht enthalten
Table 1: Modelled Reactions and their GPRAs. Note the gene IDs are HGNC
3.2 Design and Synsthesis on Quartus II
Quartus II 64-bit is software for analysis and synthesis of HDL designs. Using Quartus II GPRAs were implemented using VHDL a hardware description language used in the synthesis of HDL designs. Genes were implemented as Std logic input signals using while reactions were implemented as Std logic output signals.
GPRAs were then mapped to the reactions using Boolean combinations of genes. Due to the boolean nature of the model gene expression of any gene is simplified to an On/Off state. If a gene signal is true then this corresponds to that particular gene being knocked in. The same can be said for reactions where if the reaction output is true then the reaction is turned on. Next the GPRAs were compiled and a vector waveform file created.A Block Diagram was also created as seen in Figure 1. The vector waveform file provides a way to analyze the functionality of the compiled design whose output is a timing diagram allowing the user to see the inputs and outputs as seen in Figure 1. In the waveform file the genes were grouped together and counted from 0000000 to 1111111 in grey code in order to go through each combination of gene knock-in/knock-out
Abbildung in dieser Leseprobe nicht enthalten
Figure 1: Block Diagram of GPRA Model. Note the genes as input signals to the left and reactions as outputs to the right
4 Results
The results shown in Figure 2 show all combinations of gene knock-in and knock-out were simulated in 989ns with each knock-in/knock-out taking 10ns.The waveform shows that reactions were modelled correctly with no logical hazards. However the risk of hazards is increased as the time between input changes is decreased. Analysis and Synthesis summary shown in Fig- ure 3 shows that the model is very compact as it uses only 7 combinational functions and 14 pins.
Abbildung in dieser Leseprobe nicht enthalten
Figure 2: Timing Diagram in Quartus II
5 Discussion
From the results it can be seen that modelling GPRAs on an FPGA could be a very promising first step in creating genome scale metabolic models on hardware. This is given that modelling GPRAs is very natural on an FPGA as they are boolean rules that can be implemented well using in-built logic in FPGA devices. Furthermore it can be seen that FPGAs provide an advantage in terms of speed compared to software implemented models given that the hardware in the FPGA will be fully dedicated to the model. From Figure 2 it is seen that all combinations of gene knock-in and knock-out were completed in 989 ns. However for a fully fledged genome scale model to be implemented on FPGAs more research will have to be done to implement Flux Balance Analysis on these devices which we will investigate in future studies.
Abbildung in dieser Leseprobe nicht enthalten
Figure 3: Analysis and Synthesis Summary
References
[1] R.D. Fleischmann, M.D. Adams, and O. White. Whole-genome ran- dom sequencing and assembly of haemophilus influenzae rd. Scienc e, 269(5223), 7 1995.
[2] Vladimir Jurisic, Sandra Radenkovic, and Gordana Konjevic. The actual role of ldh as tumor marker, biochemical and clinical aspects, 2015.
[3] Jan Kronqvist, David E. Bernal, Andreas Lundell, and Ignacio E. Gross- mann. A review and comparison of solvers for convex minlp. Optimization and Engineering, 20(2):397–455, Jun 2019.
[4] Alberto Noronha, Jennifer Modamio, Yohan Jarosz, Elisabeth Guerard, Nicolas Sompairac, German Preciat, Anna Dr¨ofn Dan´ıelsd´ottir, Max Krecke, Diane Merten, Hulda S Haraldsdo´ttir, Almut Heinken, Laurent Heirendt, Stefan´ıa Magnu´sd´ottir, Dmitry A Ravcheev, Swagatika Sahoo, Piotr Gawron, Lucia Friscioni, Beatriz Garcia, Mabel Prendergast, Al- berto Puente, Mariana Rodrigues, Akansha Roy, Mouss Rouquaya, Luca Wiltgen, Alise Zˇagare, Elisabeth John, Maren Krueger, Inna Kuperstein, Andrei Zinovyev, Reinhard Schneider, Ronan M T Fleming, and Ines Thiele. The Virtual Metabolic Human database: integrating human and gut microbiome metabolism with nutrition and disease. Nucleic Acids Research, 47(D1):D614–D624, 10 2018.
[5] Jeffrey D Orth, Ines Thiele, and Bernhard Ø Palsson. What is flux balance analysis?, Mar 2010.
[6] Eleftherios Terry Papoutsakis. Equations and calculations for fermen- tations of butyric acid bacteria. Biotechnology and Bioengineering, 26(2):174–187.
6 Supplementary Materials
VHDL Code can be found in GitHub: https://github.com/mcoggins96/Modelling- Gene-Protein-Reaction-Associations-on-an-FPGA
7 Dedication
Frequently asked questions
What is the purpose of this research on modeling Gene-Protein-Reaction Associations (GPRAs) on an FPGA?
The research explores the potential of using FPGAs (Field-Programmable Gate Arrays) to model Gene-Protein-Reaction Associations (GPRAs) for genome-scale metabolic modeling. This is motivated by the increasing computational demands of large-scale metabolic models and the potential for FPGAs to offer faster computational speeds.
What are Gene-Protein-Reaction Associations (GPRAs)?
GPRAs describe the relationship between genes and the reactions they encode. They are often implemented as boolean rules, representing how genes control the activation or deactivation of metabolic reactions.
What is Flux Balance Analysis (FBA)?
Flux Balance Analysis (FBA) is a method used in genome-scale metabolic modeling that relies on mass balance constraints around metabolites in a network, and doesn't necessarily require specific kinetic data of individual reactions. It assumes a steady-state condition within the system and that the system will always be optimal.
Why were Lactate Dehydrogenase associated GPRAs chosen for this study?
GPRAs and reactions associated with Lactate Dehydrogenase were selected because many of the genes involved in Lactate Dehydrogenase are also involved in a number of other reactions. Furthermore, Lactate Dehydrogenase is a notable enzyme that has been studied for its potential role as a biomarker in tumours.
How was the GPRA model implemented on the Altera Cyclone II FPGA?
The GPRA model, consisting of 7 genes and 7 reactions, was implemented on an Altera Cyclone II FPGA using Quartus II software and the VHDL hardware description language. Genes were represented as input signals, and reactions were represented as output signals. Boolean combinations of genes were used to map GPRAs to the reactions. The model uses 7 combinational functions and 14 I/O pins.
What were the key results of the GPRA modeling on FPGA?
The simulation results showed that the reactions were modeled correctly without logical hazards, and all combinations of gene knock-in and knock-out were simulated in 989ns. The analysis and synthesis summary indicated that the model is compact, using only 7 combinational functions and 14 pins.
What is the significance of modeling GPRAs on an FPGA?
Modeling GPRAs on an FPGA represents a promising first step towards creating genome-scale metabolic models on hardware. FPGAs offer potential advantages in terms of speed and dedicated hardware resources compared to software-based models.
What are the future research directions based on this work?
Future research will focus on implementing Flux Balance Analysis (FBA) on FPGAs to create a fully-fledged genome-scale metabolic model. This will require further investigation into efficient hardware implementations of linear solvers and constraint-based optimization techniques.
Where can the VHDL code for this GPRA model be found?
The VHDL code for the GPRA model can be found on GitHub at: https://github.com/mcoggins96/Modelling-Gene-Protein-Reaction-Associations-on-an-FPGA
- Quote paper
- Macauley Coggins (Author), 2019, Genome-scale metabolic models. Modelling gene-protein-reaction associations on an FPGA, Munich, GRIN Verlag, https://www.hausarbeiten.de/document/490231