Amino acid dipepetide frequency for Sea turtle tornovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.817AlaAla: 2.817 ± 2.228
0.0AlaCys: 0.0 ± 0.0
1.408AlaAsp: 1.408 ± 1.114
2.817AlaGlu: 2.817 ± 1.473
2.817AlaPhe: 2.817 ± 1.226
7.042AlaGly: 7.042 ± 1.333
1.408AlaHis: 1.408 ± 0.914
2.817AlaIle: 2.817 ± 1.473
5.634AlaLys: 5.634 ± 2.452
5.634AlaLeu: 5.634 ± 2.946
4.225AlaMet: 4.225 ± 1.581
2.817AlaAsn: 2.817 ± 1.226
1.408AlaPro: 1.408 ± 0.914
2.817AlaGln: 2.817 ± 0.974
4.225AlaArg: 4.225 ± 3.342
1.408AlaSer: 1.408 ± 1.114
7.042AlaThr: 7.042 ± 1.581
0.0AlaVal: 0.0 ± 0.0
2.817AlaTrp: 2.817 ± 1.828
2.817AlaTyr: 2.817 ± 1.828
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.408CysLys: 1.408 ± 1.114
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.408CysArg: 1.408 ± 1.114
1.408CysSer: 1.408 ± 1.476
1.408CysThr: 1.408 ± 1.114
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.408CysTyr: 1.408 ± 0.914
0.0CysXaa: 0.0 ± 0.0
Asp
2.817AspAla: 2.817 ± 1.828
0.0AspCys: 0.0 ± 0.0
5.634AspAsp: 5.634 ± 4.456
0.0AspGlu: 0.0 ± 0.0
1.408AspPhe: 1.408 ± 1.114
5.634AspGly: 5.634 ± 3.098
1.408AspHis: 1.408 ± 0.914
1.408AspIle: 1.408 ± 1.114
0.0AspLys: 0.0 ± 0.0
7.042AspLeu: 7.042 ± 2.058
0.0AspMet: 0.0 ± 0.0
1.408AspAsn: 1.408 ± 1.114
4.225AspPro: 4.225 ± 1.882
0.0AspGln: 0.0 ± 0.0
1.408AspArg: 1.408 ± 1.114
1.408AspSer: 1.408 ± 1.114
7.042AspThr: 7.042 ± 4.128
1.408AspVal: 1.408 ± 0.914
2.817AspTrp: 2.817 ± 0.974
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.408GluAla: 1.408 ± 1.114
1.408GluCys: 1.408 ± 1.114
2.817GluAsp: 2.817 ± 2.228
1.408GluGlu: 1.408 ± 1.114
1.408GluPhe: 1.408 ± 1.476
2.817GluGly: 2.817 ± 2.228
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
5.634GluLys: 5.634 ± 2.946
1.408GluLeu: 1.408 ± 1.476
0.0GluMet: 0.0 ± 0.0
2.817GluAsn: 2.817 ± 1.828
1.408GluPro: 1.408 ± 1.476
1.408GluGln: 1.408 ± 1.476
8.451GluArg: 8.451 ± 2.611
4.225GluSer: 4.225 ± 2.73
0.0GluThr: 0.0 ± 0.0
0.0GluVal: 0.0 ± 0.0
1.408GluTrp: 1.408 ± 0.914
2.817GluTyr: 2.817 ± 2.228
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.408PheAsp: 1.408 ± 0.914
0.0PheGlu: 0.0 ± 0.0
4.225PhePhe: 4.225 ± 2.742
4.225PheGly: 4.225 ± 1.526
0.0PheHis: 0.0 ± 0.0
2.817PheIle: 2.817 ± 1.828
2.817PheLys: 2.817 ± 1.828
2.817PheLeu: 2.817 ± 2.228
0.0PheMet: 0.0 ± 0.0
4.225PheAsn: 4.225 ± 1.581
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.817PheArg: 2.817 ± 1.828
1.408PheSer: 1.408 ± 1.114
4.225PheThr: 4.225 ± 0.606
1.408PheVal: 1.408 ± 0.914
0.0PheTrp: 0.0 ± 0.0
4.225PheTyr: 4.225 ± 1.882
0.0PheXaa: 0.0 ± 0.0
Gly
7.042GlyAla: 7.042 ± 0.673
0.0GlyCys: 0.0 ± 0.0
5.634GlyAsp: 5.634 ± 4.456
5.634GlyGlu: 5.634 ± 4.456
4.225GlyPhe: 4.225 ± 2.742
9.859GlyGly: 9.859 ± 2.887
0.0GlyHis: 0.0 ± 0.0
2.817GlyIle: 2.817 ± 1.473
2.817GlyLys: 2.817 ± 0.974
5.634GlyLeu: 5.634 ± 2.274
0.0GlyMet: 0.0 ± 0.0
2.817GlyAsn: 2.817 ± 0.974
2.817GlyPro: 2.817 ± 1.473
0.0GlyGln: 0.0 ± 0.0
5.634GlyArg: 5.634 ± 2.452
4.225GlySer: 4.225 ± 0.606
5.634GlyThr: 5.634 ± 1.948
0.0GlyVal: 0.0 ± 0.0
4.225GlyTrp: 4.225 ± 2.742
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.817HisAsp: 2.817 ± 0.974
0.0HisGlu: 0.0 ± 0.0
1.408HisPhe: 1.408 ± 1.114
1.408HisGly: 1.408 ± 0.914
1.408HisHis: 1.408 ± 0.914
1.408HisIle: 1.408 ± 0.914
0.0HisLys: 0.0 ± 0.0
2.817HisLeu: 2.817 ± 2.951
0.0HisMet: 0.0 ± 0.0
1.408HisAsn: 1.408 ± 0.914
1.408HisPro: 1.408 ± 0.914
2.817HisGln: 2.817 ± 1.828
1.408HisArg: 1.408 ± 0.914
5.634HisSer: 5.634 ± 1.948
1.408HisThr: 1.408 ± 0.914
1.408HisVal: 1.408 ± 1.476
2.817HisTrp: 2.817 ± 1.828
1.408HisTyr: 1.408 ± 1.114
0.0HisXaa: 0.0 ± 0.0
Ile
1.408IleAla: 1.408 ± 1.476
1.408IleCys: 1.408 ± 1.114
1.408IleAsp: 1.408 ± 1.114
2.817IleGlu: 2.817 ± 0.974
1.408IlePhe: 1.408 ± 0.914
0.0IleGly: 0.0 ± 0.0
1.408IleHis: 1.408 ± 1.476
4.225IleIle: 4.225 ± 4.427
4.225IleLys: 4.225 ± 1.882
7.042IleLeu: 7.042 ± 5.444
0.0IleMet: 0.0 ± 0.0
2.817IleAsn: 2.817 ± 1.828
1.408IlePro: 1.408 ± 0.914
2.817IleGln: 2.817 ± 1.828
0.0IleArg: 0.0 ± 0.0
4.225IleSer: 4.225 ± 1.581
2.817IleThr: 2.817 ± 1.473
0.0IleVal: 0.0 ± 0.0
2.817IleTrp: 2.817 ± 1.473
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.225LysAla: 4.225 ± 2.73
1.408LysCys: 1.408 ± 0.914
4.225LysAsp: 4.225 ± 2.73
1.408LysGlu: 1.408 ± 1.114
1.408LysPhe: 1.408 ± 0.914
2.817LysGly: 2.817 ± 1.226
1.408LysHis: 1.408 ± 0.914
2.817LysIle: 2.817 ± 1.473
8.451LysLys: 8.451 ± 4.418
5.634LysLeu: 5.634 ± 1.309
0.0LysMet: 0.0 ± 0.807
0.0LysAsn: 0.0 ± 0.0
4.225LysPro: 4.225 ± 1.882
1.408LysGln: 1.408 ± 1.114
8.451LysArg: 8.451 ± 3.442
1.408LysSer: 1.408 ± 0.914
5.634LysThr: 5.634 ± 2.452
2.817LysVal: 2.817 ± 1.226
0.0LysTrp: 0.0 ± 0.0
1.408LysTyr: 1.408 ± 1.476
0.0LysXaa: 0.0 ± 0.0
Leu
7.042LeuAla: 7.042 ± 2.058
1.408LeuCys: 1.408 ± 1.476
4.225LeuAsp: 4.225 ± 1.882
5.634LeuGlu: 5.634 ± 4.134
0.0LeuPhe: 0.0 ± 0.0
1.408LeuGly: 1.408 ± 0.914
2.817LeuHis: 2.817 ± 1.828
5.634LeuIle: 5.634 ± 2.035
4.225LeuLys: 4.225 ± 2.555
9.859LeuLeu: 9.859 ± 3.854
2.817LeuMet: 2.817 ± 1.099
4.225LeuAsn: 4.225 ± 2.742
4.225LeuPro: 4.225 ± 0.606
5.634LeuGln: 5.634 ± 2.936
4.225LeuArg: 4.225 ± 2.73
7.042LeuSer: 7.042 ± 5.444
5.634LeuThr: 5.634 ± 2.274
2.817LeuVal: 2.817 ± 2.951
0.0LeuTrp: 0.0 ± 0.0
2.817LeuTyr: 2.817 ± 0.974
0.0LeuXaa: 0.0 ± 0.0
Met
2.817MetAla: 2.817 ± 0.974
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.408MetGly: 1.408 ± 0.914
1.408MetHis: 1.408 ± 0.914
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.408MetLeu: 1.408 ± 1.476
0.0MetMet: 0.0 ± 0.0
1.408MetAsn: 1.408 ± 0.914
0.0MetPro: 0.0 ± 0.0
1.408MetGln: 1.408 ± 0.914
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.408MetThr: 1.408 ± 1.476
2.817MetVal: 2.817 ± 1.828
1.408MetTrp: 1.408 ± 0.914
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.225AsnAla: 4.225 ± 1.526
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
4.225AsnGlu: 4.225 ± 2.742
2.817AsnPhe: 2.817 ± 1.828
1.408AsnGly: 1.408 ± 0.914
1.408AsnHis: 1.408 ± 1.114
5.634AsnIle: 5.634 ± 2.274
5.634AsnLys: 5.634 ± 2.274
4.225AsnLeu: 4.225 ± 1.526
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
7.042AsnPro: 7.042 ± 0.673
1.408AsnGln: 1.408 ± 0.914
1.408AsnArg: 1.408 ± 1.114
2.817AsnSer: 2.817 ± 1.226
0.0AsnThr: 0.0 ± 0.0
1.408AsnVal: 1.408 ± 0.914
0.0AsnTrp: 0.0 ± 0.0
1.408AsnTyr: 1.408 ± 1.114
0.0AsnXaa: 0.0 ± 0.0
Pro
4.225ProAla: 4.225 ± 2.555
0.0ProCys: 0.0 ± 0.0
1.408ProAsp: 1.408 ± 1.114
5.634ProGlu: 5.634 ± 4.134
0.0ProPhe: 0.0 ± 0.0
5.634ProGly: 5.634 ± 4.456
0.0ProHis: 0.0 ± 0.0
1.408ProIle: 1.408 ± 1.114
2.817ProLys: 2.817 ± 0.974
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
2.817ProPro: 2.817 ± 1.226
4.225ProGln: 4.225 ± 1.526
2.817ProArg: 2.817 ± 1.226
7.042ProSer: 7.042 ± 1.581
7.042ProThr: 7.042 ± 3.178
2.817ProVal: 2.817 ± 1.828
0.0ProTrp: 0.0 ± 0.0
2.817ProTyr: 2.817 ± 1.828
0.0ProXaa: 0.0 ± 0.0
Gln
1.408GlnAla: 1.408 ± 0.914
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.408GlnGlu: 1.408 ± 1.114
2.817GlnPhe: 2.817 ± 0.974
1.408GlnGly: 1.408 ± 1.114
2.817GlnHis: 2.817 ± 0.974
0.0GlnIle: 0.0 ± 0.0
2.817GlnLys: 2.817 ± 1.473
4.225GlnLeu: 4.225 ± 1.581
1.408GlnMet: 1.408 ± 0.914
7.042GlnAsn: 7.042 ± 2.391
1.408GlnPro: 1.408 ± 0.914
2.817GlnGln: 2.817 ± 2.228
2.817GlnArg: 2.817 ± 2.228
8.451GlnSer: 8.451 ± 2.922
4.225GlnThr: 4.225 ± 2.742
1.408GlnVal: 1.408 ± 1.114
1.408GlnTrp: 1.408 ± 0.914
1.408GlnTyr: 1.408 ± 0.914
0.0GlnXaa: 0.0 ± 0.0
Arg
7.042ArgAla: 7.042 ± 4.128
0.0ArgCys: 0.0 ± 0.0
4.225ArgAsp: 4.225 ± 2.73
7.042ArgGlu: 7.042 ± 3.519
2.817ArgPhe: 2.817 ± 0.974
4.225ArgGly: 4.225 ± 1.581
4.225ArgHis: 4.225 ± 1.526
2.817ArgIle: 2.817 ± 0.974
1.408ArgLys: 1.408 ± 1.114
4.225ArgLeu: 4.225 ± 1.581
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
5.634ArgPro: 5.634 ± 1.948
1.408ArgGln: 1.408 ± 1.114
21.127ArgArg: 21.127 ± 12.034
8.451ArgSer: 8.451 ± 5.109
2.817ArgThr: 2.817 ± 1.473
4.225ArgVal: 4.225 ± 1.882
2.817ArgTrp: 2.817 ± 0.974
9.859ArgTyr: 9.859 ± 3.317
0.0ArgXaa: 0.0 ± 0.0
Ser
9.859SerAla: 9.859 ± 3.854
0.0SerCys: 0.0 ± 0.0
1.408SerAsp: 1.408 ± 1.114
0.0SerGlu: 0.0 ± 0.0
7.042SerPhe: 7.042 ± 2.783
5.634SerGly: 5.634 ± 2.035
2.817SerHis: 2.817 ± 0.974
2.817SerIle: 2.817 ± 1.226
2.817SerLys: 2.817 ± 2.951
4.225SerLeu: 4.225 ± 0.606
1.408SerMet: 1.408 ± 0.914
4.225SerAsn: 4.225 ± 1.581
2.817SerPro: 2.817 ± 2.951
9.859SerGln: 9.859 ± 1.717
5.634SerArg: 5.634 ± 2.936
9.859SerSer: 9.859 ± 3.528
7.042SerThr: 7.042 ± 3.503
5.634SerVal: 5.634 ± 2.035
2.817SerTrp: 2.817 ± 1.226
2.817SerTyr: 2.817 ± 1.226
0.0SerXaa: 0.0 ± 0.0
Thr
4.225ThrAla: 4.225 ± 2.742
1.408ThrCys: 1.408 ± 1.114
5.634ThrAsp: 5.634 ± 1.948
2.817ThrGlu: 2.817 ± 1.473
2.817ThrPhe: 2.817 ± 1.828
9.859ThrGly: 9.859 ± 2.887
1.408ThrHis: 1.408 ± 0.914
1.408ThrIle: 1.408 ± 1.476
1.408ThrLys: 1.408 ± 0.914
8.451ThrLeu: 8.451 ± 1.591
0.0ThrMet: 0.0 ± 0.0
8.451ThrAsn: 8.451 ± 0.93
1.408ThrPro: 1.408 ± 1.476
4.225ThrGln: 4.225 ± 1.526
5.634ThrArg: 5.634 ± 4.134
8.451ThrSer: 8.451 ± 3.02
8.451ThrThr: 8.451 ± 0.93
7.042ThrVal: 7.042 ± 2.678
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
2.817ValAsp: 2.817 ± 1.828
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
1.408ValGly: 1.408 ± 0.914
1.408ValHis: 1.408 ± 1.476
0.0ValIle: 0.0 ± 0.0
0.0ValLys: 0.0 ± 0.0
2.817ValLeu: 2.817 ± 1.473
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
4.225ValPro: 4.225 ± 1.581
5.634ValGln: 5.634 ± 1.948
4.225ValArg: 4.225 ± 0.606
4.225ValSer: 4.225 ± 1.581
5.634ValThr: 5.634 ± 2.452
0.0ValVal: 0.0 ± 0.0
1.408ValTrp: 1.408 ± 0.914
1.408ValTyr: 1.408 ± 1.476
0.0ValXaa: 0.0 ± 0.0
Trp
1.408TrpAla: 1.408 ± 0.914
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.408TrpGly: 1.408 ± 0.914
2.817TrpHis: 2.817 ± 1.226
2.817TrpIle: 2.817 ± 1.226
1.408TrpLys: 1.408 ± 1.114
0.0TrpLeu: 0.0 ± 0.0
1.408TrpMet: 1.408 ± 0.914
0.0TrpAsn: 0.0 ± 0.0
4.225TrpPro: 4.225 ± 1.526
0.0TrpGln: 0.0 ± 0.0
4.225TrpArg: 4.225 ± 1.526
4.225TrpSer: 4.225 ± 2.742
2.817TrpThr: 2.817 ± 1.828
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
1.408TyrGly: 1.408 ± 0.914
2.817TyrHis: 2.817 ± 0.974
1.408TyrIle: 1.408 ± 0.914
7.042TyrLys: 7.042 ± 3.519
4.225TyrLeu: 4.225 ± 0.606
2.817TyrMet: 2.817 ± 1.828
1.408TyrAsn: 1.408 ± 1.114
0.0TyrPro: 0.0 ± 0.0
1.408TyrGln: 1.408 ± 1.114
8.451TyrArg: 8.451 ± 5.485
2.817TyrSer: 2.817 ± 2.228
2.817TyrThr: 2.817 ± 0.974
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (711 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski