Bioinformatics is a recently found science that utilizes computer science, molecular biology, chemistry and mathematics in order to help figure out contemporary biological dilemmas. The interdisciplinary science was first introduced by Margaret Dayhoff, Walter M. Fitch, Russel F. Doolittle as well as other scientists who helped contribute to the field (36). The science was later coined "Bioinformatics" by Paulien Hogeweg and Ben Hesper (34). The efforts began in the 1960s while not being completely understood nor utilized to its full capability until the early 2000s (35). Bioinformatics has been in recent years the forefront of the sciences as it helps use computational techniques to aid in gene discovery, diagnosis, treatment, and cures (33). Bioinformatics also plays a role in storage of mass data in DataBanks, which are databases that hold information on different types of proteins, isoforms, nucleic acids, genomes and DNA. These DataBanks are significant as they allow for the newest information to become easily accessible and allow for scientists to have access to the latest research, enabling scientists to have the ability to build upon research, compare and contrast data all in one place (33).
Bioinformatics importance comes down to the many ways in which it can be used on a large scale. Bioinformatics is the considered the future of the sciences not only in molecular, cellular and fundamental biology but also chemistry and mathematics as they all come together to make the science possible. These techniques allowed bioinformatics to be the leading the front in future medicine (33) as it allows for new application in medicine, drug discovery, and is currently leading the path in molecular medicine (33). With the future being data driven, so is the sciences (36). Therefor with all this data there must be a way to compare, contrast and analyze the accumulation of data. Bioinformatics allows for collection, organization and curation of the high volume of data while also making it trouble-free to bring forth. These advances allow biological information to be easy to find in one place, this is beneficial in the case that when problems arise in biological information, bioinformatics has the ability to take on large scale, complex problems that no other individual science would be able to take on alone. All this information is stored in DataBanks, whether it be basic biological information, Nucleic Acid sequence, Amino Acid sequence, diseases, gene expression and function, protein function and structure, domains or Biochemical pathways (33). Not only can all this information be stored but due to the high volume of information, predictions can be made on unknown sequences, structures or functions which help in overcoming diseases (33). The ability to make predictions on unknown proteins sequences is essential in modern science as it allows for scientists and bioinformatists to predict functions that would otherwise be unknown without months of research, this research being prone to human error, systemic error and/or random error. Bioinformatics Genomics (33), also known as the science of comparisons is the reason behind the powerful methods and tools used in bioinformatics. The fact that an entire genome of one species can be compared to the entire genome of another species is nothing short of excellence on the behalf of the computational science. Bioinformatics also uses tools such as DNA micro arrays to detect gene expression on a large scale allowing for thousands of genes to be tested at once, not only showing expression but how much that gene is actually being expressed at the given time of the experiment. Proteins can also be computed, designed and predicted using bioinformatic tools. Sequences can be predicted based on function of a given protein as certain functions go hand and hand with the amino acid sequences allowing for 3D computational design of proteins (33).
Bioinformatics databanks can be used to exchange information across the globe. These databanks are free, easily accessible and open to the general public. There are different Databanks for different aspects of the genome. There are individual nucleic acid sequences databanks that allow for anyone to access data on gene function and expression, evolutionary relationships and the ability to find new discoveries whether it be in medicine or cellular biology (33). There are also amino acid sequence databanks which are useful in finding information on the protein of intrest(33). While there are numerous databanks which play a role in the field of bioinformatics some that must be highlighted are the Molecular Modeling Databanks, Mendelian Inheritance databanks, Chromosome, Genome, Motif, Function and Mutation databanks (33). All these databanks play a significant role in the integrity and longevity of bioinformatics as they allow for diversity of information in the field.
Bcl-2 is a well-studied protein due to it's role in multiple disorders , most noticeably being lymphomas due to it being most prevalent in lymphocytes with a noticeable concentration in nerve cells as well(11). It's role as an anti-apoptotic protein is linked to uncontrolled cellular growth of mainly lymphocytes(26) which has called upon much research invested in understanding it's function for therapeutic reasons. Bioinformatics has played a large role in the understanding of it's function as we have obtained information on its genomic sequence(3), gene location(2), transcript variants (4,6), protein isoforms (5,7), as well as it's structure and mutations (14,15). By knowing the fundamental characteristics of this protein we have been able to determine it's function and biochemical pathways it works on. This has allowed us to begin to experiment with therapies that target it's anti-apoptotic activity in an attempt to prevent lymphomas linked to its function(26).