Đề tài Tin Sinh học: Các phương pháp tìm kiếm dữ liệu sinh học và ứng dụng trong việc thực hiện Đề tài nghiên cứu

 1. Tin sinh học

 - Tin sinh học là ứng dụng công nghệ thông tin để quản lý dữ liệu sịnh học hoặc nghiên cứu các vấn đề sinh học.

 - Những lĩnh vực nghiên cứu chính của tin sinh học bao gồm bắt cặp trình tự (sequence alữignment), bắt cặp cấu trúc protein (protein structural alignment), dự đoán cấu trúc protein (protein structure prediction), dự đoán biểu hiện gene (gene expression) và tương tác protein - protein (protein-protein interactions), và mô hình hóa quá trình tiến hoá.

 - Các ứng dụng của tin sinh học:

 Nghiên cứu về chuỗi trình tự

 Nghiên cứu về bộ gene

 Nghiên cứu bằng sự tiến hoá của sinh học bằng máy tính

 Nghiên cứu đa dạng di truyền

 Nghiên cứu các đột biến của tế bào ung thư

 So sánh bộ gene

 

ppt42 trang | Chia sẻ: gaobeo18 | Lượt xem: 1258 | Lượt tải: 0download
Bạn đang xem trước 20 trang tài liệu Đề tài Tin Sinh học: Các phương pháp tìm kiếm dữ liệu sinh học và ứng dụng trong việc thực hiện Đề tài nghiên cứu, để xem tài liệu hoàn chỉnh bạn click vào nút TẢI VỀ ở trên
aa gcaatacctt ctctctataa cccttctagg 8101 cctcctaacg gcaaccacgc tcttaattat ttaagctcat caaatcttaa agagcccgaa8161 gtcttctaag agaggtccca aaagaaacga ccagagccaa aaccaaaact actaataaag 8221 ccaaacctcc aataagtaag taaaaacccc ccaagtcgta aagatggcac aaactaccca 8281 ggtccaataa gacatgatta gaaactatac tcacgccgga ttcaaggaaa tcttcaaaga 8341 taatgaacac ccaagatgac aacaaaacaa acagtatcat aacctccccc agatttctaa 8401 ttgtaggata ccgatccgcc gacaaagcac tagaatagac aaacactact aacatacctc 8461 ccatgtagat aagcaacaaa agcaaagcca agaaacttaa ccctagtgaa accaacaaga 8521 cacagccaaa caaagatctt atcatcagac caagagcagc ataataggga gacaaactat 8581 aaaacactaa agttctcccg agaagaagaa acaacataaa caagtaaaat accatttata8641 taaatgacag gaccacttcg aaagagccac ccacttttcc gaattataaa aggctcactt 8701 atagacttac cagccccgag caacctctca atctgatgaa aatttggttc actcctagga 8761 ctttgtctta tcgtacaatt aatcacagga acctttctgg cgatgcacta caccgcagac 8821 atttccttag cgttctcctc agtaagacac atctgtcgag acgtaaaata tggatgactc 8881 ctacgaaaaa tacacgccaa aagagcatcc ttctttttca tatgccttta ctgccacata 8941 ggtcgaggaa tttattatgg atcctacgta aaagaggaaa catgaaatat aggagtaatc 9001 ttatttctca ttacaatgat tacagcattc gtaggttacg tttttccttg aggtcagatg 9061 tccttttgag cagctaccgt tattaccaac ctcctctcag cggtaccata cctaggagaa 9121 accctcgtac agtgagtatg aggagggttc tcagtggaca atgccaccct aactcgattc 9181 tttacctttc actttttgtt tccattcatt atagccgccc tctccattat tcacctctcg 9241 tttctacacc agaaaggttc taacaatcca acagggctag acagaagata cgacaagaca 9301 ccattccacg tctatttctc aactaaggac ctagtaggct ttttatttct tcttgcagga 9361 atcctagcac tagcactcct agcccccacc gccctaaaag acccagaaaa cttcattccg 9421 gccaaacccc ttgtaacacc aacacacatt cagccagaat gatacttcct ctttgcttac 9481 gccatcctcc gctcaatccc taaaaagcta ggaggagtta tagcccttgt agcagccgtt 9541 ctcgtctgat ttttagtacc aatcctccac acttcctcta atcaagcctc tacgtttcgt 9601 cccctctccc aaatcacgtt ctgaattcta atagcaacat tcctcattct cacatgaata 9661 gggagacaac ccgtggagga gccatttata cttataggac aagtagcgtc aatactttac 9721 ttctctatat ttatagtctt cttcccaact atatcaatat tagaaaaaaa gcttctgcta 9781 cggtaaatta aagtagctta aggaaaaagc ttggcgttga aaatgccaga tcaaaggtta 9841 aactcctttc tttaatacaa gcttggttct agcttaaaat ttagctttcc tctatctgcc 9901 acatgcaagc tccgaagcga cccagcaaga tgaaaacact ttacaaaaac ttttcaacag 9961 ataagttctt acctaactac taatcataac ttgcggtcag cagtgcttaa ctttataaaa 10021 tttgggaaac caagaaatag acaagagaaa gaaggatggt taataacgtg ccagcagccg 10081 cggttacacg ttaagcctga gttaaattat accggtccaa agaaggaaag gctaaaaaga 10141 cttattttaa agagaactcc cagcagtagc aagctaaagc ccagaaaata aaataaaaaa 10201 gcccaaccaa aaaagcgaga aataaactgg aattagatac tccactatac tcgctggtaa 10261 actagacaga caccagagta gtacggtttg aactaaaact taaagaactt ggcggttttc 10321 tagccctttt cggaggagct tgtcattgaa ttgataaccc acaagggacc ccaccattct 10381 tagaacatca gcttgtatac catcgtcgtc agctcacctc caagaaagtg aacaaaaagg 10441 aataccccct acgtcagatc aaggtgcagc caatagaatg ggaatcgatg agctacttta 10501 cttgataaca gaggacttat cactgaaaag tgattctgaa agaggattcg acagtaattt 10561 tttaaagaaa tacccaatga aaccagctct agaatgcgca cacatcgccc gtcactctcg 10621 tcaaacgagg agaaaagtcg taacatagta ggtgtactgg aaagtgtacc tggtaaagcc 10681 tttatagttt agctaaaaca agggcttttc aagccctaga cccaggttaa caccctggta 10741 aaagtagaag ctgtaaaagt ttaggctaaa caaccagtct tgtaaactgg caaagaaagt 10801 taaactcttt cttacagcta cccaaaaact aacctaacaa gaacacccgg aaagcttcct 10861 taaactcttt aaaaataaaa accccccaac tcttaccact tacgattgac tgaaatccca 10921 agccaaaatt ttgacacttt ccctttctaa aaaagtgtca aaacctaaaa aaaagtcttc 10981 atttaacttt tacctttctt tcccctcccc tacaaaacgc cccctatgaa cccccccccc 11041 ctcttgttcg tcctctccct cttactccta ccgaaaaatt cataggactc caatcggggg 11101 aaaatctttc tctttcccta tttttcccct gttccgggct ccccacgctc tcactaccca 11161 aaccttatat gttcccaagg atgttgctcg gacaccttca ctcgcctgac cctctattaa 11221 ctgttttcac tatacctcgc aacgttttta ggggttctac acctccatta ccgtccccct 11281 aaaaacgctt gcttggtgtt caaacagtta atagagactc agtagttact caggttcgtt 11341 cttctttttt ttttgattct tttgggacta gattgcttat acctcctaac cggtttcaat 11401 aaagctgata ttgtatattg atctcacctt cttaaccgct cggtgtgacg ataaaaattg 11461 ataccgtcat ttatataaat ggtaaacctt ctacaaaggc tagtttatta aaatggctgc 11521 tttgggggca gcagatatag gaaaattcct atgcttttga aaaggtaaga aacgaactta 11581 catctaggga atcaaaatcc cttatttttc attaaaatac ttctcagagt ctaactgttc 11641 tcttgagttg aagctgaagc aagcatttgg ccgttaacca aaagaatgaa ggtttaactc 11701 ctttcaactc agaggggtaa gtttaaatag caaagaaagt aatgcaatag atttaggctc 11761 tataaccagg ggtgcaagtc ctcttttaaa ctcgccattt tcacaatgtg gtatccaaga 11821 agtaacttgg atcttctgct tgcaaagcgg atatttttgt taaactaaaa ccacggaaga 11881 cttaagttta ataaaactgt gagccttcaa agctcaaaac atagattaaa attctatagt 11941 ctttggaccg gtcttaaggt gtccaacata tttaattgca aattaaaagt tactggttag 12001 cgccagttaa gacttatcga ggcagtttga attgcacaaa catcggctcc gcgtaaagga 12061 gacacttaac tggtttagct atgcttcgaa ccttgctttc gagatctatg agattttaac 12121 tcatgtcttc ggtctgacag tccaacgttt ttcttaactt agatctccct cgcctttcct 12181 ggcaagatga ctgaaatagt agaggattgt aaattctctt atgtaagtgg aagccttact 12241 cttcgccaac aacggtgctg tcatttatat aaatgacaaa accaaacatg tcatgacaca 12301 tttttattat aactaatata cctaacttcc aattaggaga ccttggttca agcccaagaa 12361 aatgtaccag caaggtaagc taaacaagct tttgggctca tactccaaga atggtggtta 12421 aaatcctccc tttgcaaaag ccaacgaaaa accttagtag caaagtggtt aatgcagggg 12481 acctaagatc ccttaccaaa agttcaattc ttttttaagg ttgtgaaagt tattcttttt 12541 ctcctacaag ctttaatatt cctagtacct attctcttat cagtagcatt tttaactctg 12601 gtagaacgga aggtactggg ctatatgcaa ttccgtaagg gccccaaagt agttggccca 12661 tatggactac tccaaccctt tgctgacgcc ttaaagctat ttattatgga aaccctaaag 12721 ccttccacag cttcaccata cctattcttt ttctccccat ttctgtttct tgtgcttgcc 12781 ttaatccttt gatcgctgat cccagcccct gtcccgaccc taaaagttaa cctttcactt 12841 cttctcatcc ttggtatttc tagattagcc gtttacgccc ttctaggctc cgggtgagcc 12901 tctaaatcca agtactcact tctcggaggt attcgagcgg tagcacaaac catatcttat 12961 gaaataagaa tagggttaat tcttctatcc ctcattatat gatcaggttc attctctcta 13021 tcagaaatag tcaagaccca gaaatactgc tgaatgttat taccacattt tcccttattc 13081 ctaatgtgac tagtttctac cttagccgaa acgaaacgag cgccatttga cctaacagaa 13141 ggggaatcag aattggtctc aggttataac gtagagtatg ctggagggcc gttcgcactg 13201 tttttcatcg ccgaatacgc taaaataatc ttcatgaaag ttttaagaac cgttctattt 13261 ctaggagccg gaagaccatt taaagaaaca agaatcctag ggccttcatg tctagccgta 13321 aagacgatag gcttagtatt cttatttcta tgggtgcgag catcttaccc gcgatttcga 13381 tacgaccagc tcatgcactt aacctggaag aaatatctcc ctctttctct gggaatcttc 13441 tcaatggcac tagcatttct gatatcttcg aaagcctcat cttccgttat ctaagtgaaa 13501 agagtttact cctagcagga atacctgacc gataatcaga tagaagaggt taaagtcctc 13561 ttgaactcta tgaaacgtct cataattatt cttcttttga caagccttct tataggaacc 13621 agactggttc tactttctag acattggttc cccatttgac taggcctcga actaagaact 13681 ctctccataa ttcccttact aaacatgaaa ggacattctc gaagaacgga agccaccttg 13741 aagtactttt tagttcaagc tttcagagca gcgctccttc taaaaggagc tgtactcaac 13801 ctctgactct ctaaaagatg aagtctagcg gaaacctcat cccccctttg ctactacacc 13861 atatccacag ccctaattat taagctcgga ttagcaccgt gtcacttttg gttcccagac 13921 gttctaagag gtatttcatt tccgaaagta ataatcatcg catgttgaca aaaggtagca 13981 cccatgttcc tccttctatc cctgtcaaga tacatatcct cagagatact aattctctgc 14041 tctacactat cagtaattgt gggaggatga gggggattaa accagataag aactcgtaag 14101 atcttagcct actcttctat cagtcacctg gggtgagtga cttgtgcctc cttcttcctt 14161 cccgaagtat cttttttatt attcctgttc tacattatca aaaaaaccgc aattttactc 14221 atatgcaaaa acagatctct tttctccctt tcctcactaa gaaaggcaag aataatacca14281 acaaaaatca ttctcttttc ccttgccctt ttgtccctgg gaggactccc tcctctcggc 14341 ggatttataa aaaagatagt acctttaata atcttttcat ttaaaagaag aaaaatcatt 14401 atcccactat tcttaggagg cagccttctc aacctcttct tttatttacg gatcgtctac 14461 aaaactagcc tgaccctctt tccacagaga agaataatcc tgctgtcttc ccgaaaaaca 14521 tcctcccaga ccatatcctc tttcttgata agaattcttt ttcccctttg tttatttggc 14581 ctacttttat taccccctgt ttaatctatt tatataaaga aattctgatc atagatgaaa 14641 tttttaaagt aataacttcc aattctctca caaatcttct actataaaag taccacaagg 14701 gaaaactgaa ataaagtgaa aattaaaagg agtaaaagag tctttcgtac cttttgtatt 14761 atggtttaac aagatttttc aggaaaattc tccaaagccc gaaacctaga gagctaaccc 14821 tcttcctctt aatcaagaga atcctcccac tgttacaaga gtggggaaag aggaagggtt 14881 agaaatgaaa tgttaaacgc gctaggtgat agctggtttc tcaagaaaga agtttaagct 14941 tcccctcctt accccctcta ttatgaacat tctcaatttt taacaagagc acctttaaag 15001 gaaggaaaga aagagggaga caagttcctc tttcaagaag gaaacaacca agagaccagg 15061 aaaggacaat aagatcatcc aaggtttatt atttaagtag gcctaaaagc tgccatcatt 15121 tcggaaagcg ttaaagctca taaaacgaaa taaccaaaaa tatctgtgtt cccacccaaa 15181 cctatcaaaa tggtattgag aaaaattata atgttaaaac gagtaagtat acaaaacagc 15241 tctgggcaac ggaaaactaa cccgatcaac caagaatagt tttggaaacc caagaaaaaa 15301 aacccttgct atccctccaa cacaggagcc cccagaacag tggaaaaagg aaaaggaagg 15361 aactaggcaa acctagagga tgactgttta ccaaaaacat agccccttgg aactcataag 15421 gggttaggcc tgcccagtgg gaagcacttt cctaaacggc cgcggtatct tgaccgtgca 15481 aaggtagcat aatcacttgt ctcttaaatg gggacccgta tgaatggctt ttcatccttt 15541 aactgtctcc ctttttttcc acacaaactc ctatttacgt gaagaagcgt acttcagaga 15601 gaaagacgag aagaccctgt cgagctttag cctaccctac tcacttacac tcatccttcc 15661 ccagggaaaa acctctgtag gccaggcttt ggttggggca accatggaga aaaaatatcc 15721 tccagttttc taggaagaga atcaactctc cttccccatt tttgagaacc aatacttttg 15781 gaaaacggaa aaagttaccg cagggataac agcgttatct tttctaagag cccttattga 15841 cgaaaaggat tgcgacctcg atgttggatt ggggtaacca gagggtgcag cagctcttta 15901 aggttggact gttcgtccat taattcccta catgatctga gttcagaccg acgtgagtca 15961 ggtcagcttc tatcttctat taattctccc tagtacgaaa ggaccggaaa aaagtcttaa 16021 atagttttca cgaaaagaca aaaaacaaaa aaaaaaaaaa aaaaaaagcc ttctatcgga 16081 aacacaatgt gactaaaaac ttctgccttt ttcaatcatt aggtttttcc atacaaaaaa 16141 atttaaaaaa aaatatacaa ggggtccccc tttggttttt tttaattttt caggaaaaat 16201 aaaaagtttt ttaactttag ccaggtgtta atatcaaatg cttttaactt aagaaac // 	Trên đây là 3 ví dụ về việc sử dụng cơ sở dữ liệu NCBI tìm kiếm thông tin về cây dưa leo.	Với phương pháp này chúng ta còn có thể tìm nhiều thông tin khác về cây dưa leo và cả trên nhiều loài sinh vật khác.

File đính kèm:

  • pptTIN SINH HOC P23.ppt