Data
In this section, you will find various data that you need to carry out projects.
Project B4: The Exciting World of Genes and Their Origin - Part II
Below you will find the complete nucleotide sequence that you will investigate in the project.
Saccharomyces cerevisiae S288C chromosome IV - region of interest
AACAATAAACCTTTGCCTAATTTCATATAACATCCAGTATTCATTTTTTTTAGTTGACGCTAAGGCATTCGCTTGTGAAATTAAAGAGCAGATCAGGAATAGTATCAATAGTCTAAACCATCAAGTTGCACAGAAACCACTATATATATATGGAAATATCTCGAATATTGCTTGTATGAACGATAGCCAAAACTGCCTACGACAGAGGGAAGAAAATAGTCATCTGAATCCTGGAAATGACTTCGGCCACCACCAGGGTGCAGAATGTACGATAAATCATAACAACATGCCACACCGCAATGCATACACAGAATCTACGAATGACACGGAAGCAAAGTCCATAGTGATGTGCGACGATCCTAACGCATACCAAATTTCCTACACAAATAATGAGCCGGCGGGAGATGGAGCTATAGAAACCACGTCCATTCTACTATCGCAACCGCTGCCGCTGCGATCGAATGTGATGTCTGTCTTGGTAGGCATATTTGTTGCCGTGGGGGGCTTCTTGTTTGGGTATGACACTGGACTTATAAACAGTATCACGGATATGCCGTATGTTAAAACCTACATTGCTCCGAACCATTCATATTTCACCACTAGCCAAATAGCCATACTCGTATCATTCCTCTCCCTAGGAACATTTTTCGGTGCGTTAATCGCTCCCTATATTTCAGATTCATATGGTAGGAAGCCAACAATTATGTTTAGTACCGCTGTTATCTTTTCCATCGGAAACTCATTACAGGTGGCATCCGGTGGCTTGGTGCTATTAATCGTCGGAAGAGTGATCTCAGGTATCGGGATCGGGATAATCTCTGCTGTGGTTCCTCTTTATCAAGCTGAAGCTGCGCAGAAGAACCTTAGAGGTGCCATCATTTCCAGTTATCAGTGGGCTATCACTATTGGGTTACTCGTGTCCAGTGCAGTATCGCAAGGAACTCATTCCAAAAATGGCCCGTCTTCATATAGAATACCAATTGGTTTGCAGTACGTTTGGTCAAGTATTTTAGCTGTGGGCATGATATTCCTTCCAGAGAGTCCAAGATATTACGTCTTGAAGGATGAACTCAATAAAGCTGCAAAATCGTTATCCTTTTTAAGAGGCCTCCCGATCGAAGATCCAAGACTCTTAGAGGAGCTTGTTGAAATAAAAGCCACTTACGATTATGAAGCATCGTTCGGCCCGTCAACACTTTTAGATTGTTTCAAAACAAGTGAAAATAGACCCAAACAGATTTTACGAATATTTACTGGTATCGCCATACAAGCTTTTCAACAGGCATCTGGTATCAATTTTATATTCTACTATGGAGTTAATTTTTTCAACAACACAGGGGTGGACAACTCTTACTTGGTTTCTTTTATCAGCTATGCCGTCAACGTCGCCTTCAGTATACCGGGTATGTATTTAGTGGATCGAATTGGTAGAAGACCAGTCCTTCTTGCTGGAGGTGTCATAATGGCAATAGCAAATTTAGTCATTGCCATCGTTGGTGTTTCCGAGGGAAAAACTGTTGTTGCTAGTAAAATTATGATTGCTTTTATATGCCTTTTCATTGCTGCATTTTCGGCGACATGGGGTGGTGTCGTGTGGGTGGTATCTGCTGAACTGTACCCACTTGGTGTCAGATCGAAATGTACCGCCATATGCGCTGCCGCAAATTGGCTAGTTAATTTCACCTGTGCCCTGATTACACCTTACATTGTTGATGTCGGATCACACACTTCTTCAATGGGGCCCAAAATATTCTTCATTTGGGGCGGCTTAAATGTCGTGGCCGTTATCGTTGTTTATTTCGCTGTTTATGAAACGAGGGGATTGACTTTGGAAGAGATTGACGAGTTATTTAGAAAGGCCCCAAATAGCGTCATTTCTAGCAAATGGAACAAAAAAATAAGGAAAAGGTGCTTAGCCTTTCCCATTTCACAACAAATAGAGATGAAAACTAATATCAAGAACGCTGGAAAGTTGGACAACAACAACAGTCCAATTGTACAGGATGACAGCCACAACATAATCGATGTGGATGGATTCTTGGAGAACCAAATACAGTCCAATGATCATATGATTGCGGCGGATAAAGGAAGTGGCTCGTTAGTAAACATCATCGATACTGCCCCCCTAACATCTACAGAGTTTAAACCCGTGGAACATCCGCCAGTAAATTACGTCGACTTGGGGAATGGTTTGGGTCTGAATACATACAATAGAGGTCCTCCTTCTATCATTTCTGACTCTACTGATGAGTTCTATGAGGAAAATGACTCTTCTTATTACAATAACAACACTGAACGAAATGGAGCTAACAGCGTCAATACATATATGGCTCAACTAATCAATAGCTCATCTACTACAAGCAACGACACATCGTTCTCTCCATCACACAATAGCAATGCAAGAACGTCCTCTAATTGGACGAGTGACCTCGCTAGTAAGCACAGCCAATACACTTCCCCCCAATAAAAACCAATAGCATCTTACGATCGTTCGAGGTCTTATAAACCGTTTTATATAAATTATTCTTATACACGCACTCTTGAGTTTCTTTTAACATTCCAACATGATAGACATAGAAAAGAATTATAGATAACACATAACATTACTCACAAGTAGCCCATGAGTTTGATTTTTCCTCGTTAATTCGGTGTCAAGTGGAATACAAAATGGGTCAACTGATCATAACTATTACCCGGATAATCTACATCATGTGGTACGTTAACGATGTATGGTTTCCTAAATCACCGGGTTCCAGATGCGGGTTGTTGTTGTTCGATCTTGCACCTCCACGTACTCCTTTTGAATAGTACACATATACCCTTACAGGAGATCGGACGTGTGCGATTATTGTTACCTCGTCACCTGTTATCGGTTGAACTGTTTTAGCTCTTTCCATTCCCTTTGTTAATAGTAGTTTAATTCTTCGGAAACTTTCTACGCAGACTCAAGTTGCAATAGGCTAAAGATAAGGCAGACACTTTGTATGTAACGAGTTTCATTAGGAGGGATAGACGCACGACACTATCTGGTACTACGTGTTATATCATACCCAGAAGGAAGGACTCTCTCCACTGAACGGTGGGATGAGTTCCTAAAGGCGCCCCTTTATTTGATCAGGAAGCCGTATTGATTATCTAATAGGGCCTAGTTATCCTAATTGTGGGGAGTCGAGCAGTACGGCTCTGATGTTTTTCGAACGAAGATAAGGAGTTGACATACAAAGTCAACAGAAGTTCTTCTTGTTAGCGTCTCTGTGCTCAATATCTCTCTTTTTTTCTTTAAGTAGTAATTACTAACATCAGCCAACCAATAGAGATAAAAAAAAAAGGAATTAAGATTTCATAGAGAAAAGATGGGTCTATACGCTTCTAAGTTATTCAGCAATCTTTTTGGCAACAAAGAGATGCGTATACTTATGGTTGGTCTAGATGGTGCCGGTAAGACCACCGTTTTGTACAAGTTGAAGTTGGGCGAAGTTATCACTACCATTCCAACCATTGGTTTCAACGTTGAGACTGTCCAATATAAGAACATTTCCTTCACTGTCTGGGACGTCGGTGGACAAGACAGGATTAGATCTTTATGGAGACACTACTACAGAAACACCGAAGGTGTTATTTTTGTCATCGATTCCAACGATAGATCGCGTATTGGTGAAGCCAGAGAAGTCATGCAGAGAATGCTGAATGAAGATGAATTGAGAAATGCTGTCTGGTTAGTCTTCGCTAACAAACAAGATTTGCCAGAAGCCATGTCTGCTGCTGAAATCACCGAAAAATTAGGTTTACATTCTATTAGAAACCGTCCATGGTTTATCCAGTCTACTTGTGCAACCTCGGGTGAAGGTCTGTACGAAGGTCTGGAGTGGTTAAGCAACAACTTGAAGAATCAATCCTAATCTAAATCTGTATAGAACGTTTAGTCATGCGGACCTTGTGTGTTTTGTTTCTAGATTGTTTTATTTTTATGATTGTTGAAGATATAAACCACTGTATAGTTGTATAAGATAGGATAATGATGGTGCACTGAAAATAAACTTACTAGCTCTTTAATATTGCAACGGCTTGTAACGGGCGCCATGATGACATTCAGAATTATACCACTACTATATGAAAAAATGAAAAGAGGCCCTGCTTTGAACCCGTACATTTTATTCTATAATATTGCATCTGTGGTTTGCCTGACGGCAGCGAGTCCAACACAAAGTCTGGCATATGCTACGAATTTTCCACCATGGATTCAGCACCCAAACATTTGAATTTTTTTTCATGTCGATTGTGAAATTTTACTGAAGATGAGGGTAAATAGAGGCCTGCAATCGTCA
RNA-seq reads
>1
CCAGATGCGGGTTGTTGTTGTTCGATCTTGCACCT
>2
TTTTGAATAGTACACATATACCCTTACAGGAGATC
>3
TAAAAGCAATCATAATTTTACTAGCAACAACAGTTTTT
>4
ACTTCCCATTTCACAACAAATAGAGATGAAAACTA
>5
TATGGTTGGTCTAGATGGTGCCGGTAAGACCACCG
>6
TGGGACGTCGGTGGACAAGACAGGATTAGATCTTT
>7
TACCTCGTCACCTGTTATCGGTTGAACTGTT
>8
GGGTTCAAAGCAGGGCCTCTTTTCATTTTTTCATATA
>9
TCTGCTGCTGAAATCACCGAAAAATTAGGTTTACA
>10\\
TATAAGATAGGATAATGATGGTGCACTGAAAATAA
>11
CTTATACAACTATACAGTGGTTTATATCTTCAAC
>12
CGGCCCTGCTTTGAACCCGTACATTTTATTCTATA
>13
AATATTGCATCTGTGGTTTGCCTGACGGCAGCGAG
>14
GACTGGTCTTCTACCAATTCGATCCACTAAATACA
>15
GTGGTATAATTCTGAATGTCATCATGGCGCCCGTTACAA
>16
TTTCAAAACAAGTGAAAATAGACCCAAACAGATTT
>17
AGGATGAACTCAATAAAGCTGCAAAATCGTTATCC
>18
CGGTACATTTCGATCTGACACCAAGTGGGTACAG