Amino acid dipepetide frequency for Torque teno midi virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.456AlaAla: 21.456 ± 10.679
0.0AlaCys: 0.0 ± 0.0
5.364AlaAsp: 5.364 ± 2.82
9.195AlaGlu: 9.195 ± 4.549
3.065AlaPhe: 3.065 ± 1.061
0.766AlaGly: 0.766 ± 0.467
3.065AlaHis: 3.065 ± 2.268
0.766AlaIle: 0.766 ± 0.467
3.065AlaLys: 3.065 ± 0.514
1.533AlaLeu: 1.533 ± 0.933
1.533AlaMet: 1.533 ± 0.933
1.533AlaAsn: 1.533 ± 0.933
0.766AlaPro: 0.766 ± 0.467
3.065AlaGln: 3.065 ± 1.468
1.533AlaArg: 1.533 ± 0.933
2.299AlaSer: 2.299 ± 1.626
6.13AlaThr: 6.13 ± 0.701
0.0AlaVal: 0.0 ± 0.0
0.766AlaTrp: 0.766 ± 0.467
0.766AlaTyr: 0.766 ± 0.467
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.299CysAsp: 2.299 ± 1.626
0.766CysGlu: 0.766 ± 0.467
0.0CysPhe: 0.0 ± 0.0
0.766CysGly: 0.766 ± 0.467
2.299CysHis: 2.299 ± 1.626
0.0CysIle: 0.0 ± 0.0
4.598CysLys: 4.598 ± 1.153
3.065CysLeu: 3.065 ± 1.273
0.766CysMet: 0.766 ± 0.825
0.0CysAsn: 0.0 ± 0.0
1.533CysPro: 1.533 ± 0.698
1.533CysGln: 1.533 ± 0.698
0.766CysArg: 0.766 ± 0.758
4.598CysSer: 4.598 ± 3.252
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.766CysTyr: 0.766 ± 0.467
0.0CysXaa: 0.0 ± 0.0
Asp
3.065AspAla: 3.065 ± 1.201
0.0AspCys: 0.0 ± 0.0
1.533AspAsp: 1.533 ± 0.933
1.533AspGlu: 1.533 ± 0.933
5.364AspPhe: 5.364 ± 0.666
3.065AspGly: 3.065 ± 2.268
0.0AspHis: 0.0 ± 0.0
6.897AspIle: 6.897 ± 4.879
0.766AspLys: 0.766 ± 0.467
3.831AspLeu: 3.831 ± 2.333
0.0AspMet: 0.0 ± 0.0
1.533AspAsn: 1.533 ± 0.734
0.766AspPro: 0.766 ± 0.947
1.533AspGln: 1.533 ± 0.933
2.299AspArg: 2.299 ± 1.626
4.598AspSer: 4.598 ± 0.584
3.831AspThr: 3.831 ± 0.821
0.0AspVal: 0.0 ± 0.0
1.533AspTrp: 1.533 ± 0.933
3.831AspTyr: 3.831 ± 0.821
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.766GluCys: 0.766 ± 0.758
3.831GluAsp: 3.831 ± 1.834
12.261GluGlu: 12.261 ± 5.726
3.065GluPhe: 3.065 ± 2.268
2.299GluGly: 2.299 ± 1.626
0.766GluHis: 0.766 ± 0.467
1.533GluIle: 1.533 ± 0.933
5.364GluLys: 5.364 ± 1.053
4.598GluLeu: 4.598 ± 1.42
0.766GluMet: 0.766 ± 0.853
1.533GluAsn: 1.533 ± 0.933
0.766GluPro: 0.766 ± 0.467
1.533GluGln: 1.533 ± 0.734
7.663GluArg: 7.663 ± 4.899
4.598GluSer: 4.598 ± 1.829
6.13GluThr: 6.13 ± 2.276
0.766GluVal: 0.766 ± 0.758
1.533GluTrp: 1.533 ± 0.933
0.766GluTyr: 0.766 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
4.598PheAla: 4.598 ± 0.584
0.766PheCys: 0.766 ± 0.467
0.0PheAsp: 0.0 ± 0.0
3.065PheGlu: 3.065 ± 1.468
2.299PhePhe: 2.299 ± 1.4
0.766PheGly: 0.766 ± 0.467
0.0PheHis: 0.0 ± 0.0
5.364PheIle: 5.364 ± 0.666
3.065PheLys: 3.065 ± 2.562
3.065PheLeu: 3.065 ± 1.201
0.0PheMet: 0.0 ± 0.0
1.533PheAsn: 1.533 ± 0.734
3.831PhePro: 3.831 ± 1.55
0.766PheGln: 0.766 ± 0.467
1.533PheArg: 1.533 ± 0.933
1.533PheSer: 1.533 ± 0.933
2.299PheThr: 2.299 ± 0.915
0.0PheVal: 0.0 ± 0.0
1.533PheTrp: 1.533 ± 0.933
4.598PheTyr: 4.598 ± 1.859
0.0PheXaa: 0.0 ± 0.0
Gly
4.598GlyAla: 4.598 ± 0.584
0.766GlyCys: 0.766 ± 0.467
2.299GlyAsp: 2.299 ± 1.626
1.533GlyGlu: 1.533 ± 0.698
4.598GlyPhe: 4.598 ± 1.568
6.897GlyGly: 6.897 ± 1.3
2.299GlyHis: 2.299 ± 1.626
4.598GlyIle: 4.598 ± 1.153
6.13GlyLys: 6.13 ± 0.797
5.364GlyLeu: 5.364 ± 0.833
0.0GlyMet: 0.0 ± 0.0
1.533GlyAsn: 1.533 ± 0.933
3.831GlyPro: 3.831 ± 0.821
0.766GlyGln: 0.766 ± 0.467
2.299GlyArg: 2.299 ± 0.784
0.0GlySer: 0.0 ± 0.0
3.065GlyThr: 3.065 ± 1.201
0.0GlyVal: 0.0 ± 0.0
0.766GlyTrp: 0.766 ± 0.467
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.533HisAla: 1.533 ± 0.698
0.0HisCys: 0.0 ± 0.0
3.065HisAsp: 3.065 ± 1.201
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.299HisGly: 2.299 ± 1.626
1.533HisHis: 1.533 ± 0.933
0.0HisIle: 0.0 ± 0.0
1.533HisLys: 1.533 ± 1.516
6.13HisLeu: 6.13 ± 3.099
0.766HisMet: 0.766 ± 0.758
0.0HisAsn: 0.0 ± 0.0
3.065HisPro: 3.065 ± 1.468
3.831HisGln: 3.831 ± 0.821
2.299HisArg: 2.299 ± 1.629
3.831HisSer: 3.831 ± 1.55
2.299HisThr: 2.299 ± 1.4
0.0HisVal: 0.0 ± 0.0
0.766HisTrp: 0.766 ± 0.467
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.766IleAla: 0.766 ± 0.467
0.0IleCys: 0.0 ± 0.0
1.533IleAsp: 1.533 ± 0.933
0.766IleGlu: 0.766 ± 0.467
3.831IlePhe: 3.831 ± 0.821
0.0IleGly: 0.0 ± 0.0
1.533IleHis: 1.533 ± 0.698
1.533IleIle: 1.533 ± 0.933
8.429IleLys: 8.429 ± 1.345
7.663IleLeu: 7.663 ± 2.248
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
3.831IlePro: 3.831 ± 1.685
4.598IleGln: 4.598 ± 0.584
0.0IleArg: 0.0 ± 0.0
0.766IleSer: 0.766 ± 0.467
3.831IleThr: 3.831 ± 0.687
3.065IleVal: 3.065 ± 1.061
3.065IleTrp: 3.065 ± 1.201
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.065LysAla: 3.065 ± 1.061
3.065LysCys: 3.065 ± 0.514
5.364LysAsp: 5.364 ± 2.271
8.429LysGlu: 8.429 ± 4.47
2.299LysPhe: 2.299 ± 0.701
3.831LysGly: 3.831 ± 1.44
4.598LysHis: 4.598 ± 1.056
2.299LysIle: 2.299 ± 0.915
5.364LysLys: 5.364 ± 2.236
4.598LysLeu: 4.598 ± 2.095
0.0LysMet: 0.0 ± 0.0
3.065LysAsn: 3.065 ± 1.977
9.195LysPro: 9.195 ± 0.795
6.13LysGln: 6.13 ± 1.481
6.13LysArg: 6.13 ± 1.625
1.533LysSer: 1.533 ± 0.698
8.429LysThr: 8.429 ± 1.581
2.299LysVal: 2.299 ± 0.915
0.766LysTrp: 0.766 ± 0.467
6.897LysTyr: 6.897 ± 0.772
0.0LysXaa: 0.0 ± 0.0
Leu
6.897LeuAla: 6.897 ± 3.579
6.13LeuCys: 6.13 ± 3.955
0.766LeuAsp: 0.766 ± 0.467
6.13LeuGlu: 6.13 ± 3.099
2.299LeuPhe: 2.299 ± 1.4
1.533LeuGly: 1.533 ± 0.933
1.533LeuHis: 1.533 ± 0.933
2.299LeuIle: 2.299 ± 0.915
5.364LeuLys: 5.364 ± 2.566
9.962LeuLeu: 9.962 ± 0.782
3.831LeuMet: 3.831 ± 1.55
1.533LeuAsn: 1.533 ± 0.734
2.299LeuPro: 2.299 ± 1.629
6.13LeuGln: 6.13 ± 2.546
1.533LeuArg: 1.533 ± 0.734
6.897LeuSer: 6.897 ± 1.813
6.897LeuThr: 6.897 ± 0.772
3.065LeuVal: 3.065 ± 1.866
3.831LeuTrp: 3.831 ± 0.821
2.299LeuTyr: 2.299 ± 1.4
0.0LeuXaa: 0.0 ± 0.0
Met
2.299MetAla: 2.299 ± 1.626
0.0MetCys: 0.0 ± 0.0
0.766MetAsp: 0.766 ± 0.467
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.299MetLeu: 2.299 ± 1.4
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.533MetPro: 1.533 ± 0.734
3.831MetGln: 3.831 ± 0.821
0.0MetArg: 0.0 ± 0.0
3.831MetSer: 3.831 ± 0.821
1.533MetThr: 1.533 ± 1.075
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.533MetTyr: 1.533 ± 0.698
0.0MetXaa: 0.0 ± 0.0
Asn
1.533AsnAla: 1.533 ± 0.734
0.0AsnCys: 0.0 ± 0.0
1.533AsnAsp: 1.533 ± 0.933
2.299AsnGlu: 2.299 ± 1.626
0.0AsnPhe: 0.0 ± 0.0
0.766AsnGly: 0.766 ± 0.947
0.0AsnHis: 0.0 ± 0.0
2.299AsnIle: 2.299 ± 1.4
1.533AsnLys: 1.533 ± 0.698
3.065AsnLeu: 3.065 ± 0.514
0.766AsnMet: 0.766 ± 0.467
0.766AsnAsn: 0.766 ± 0.467
3.065AsnPro: 3.065 ± 1.866
5.364AsnGln: 5.364 ± 0.833
0.766AsnArg: 0.766 ± 0.467
4.598AsnSer: 4.598 ± 3.358
3.065AsnThr: 3.065 ± 1.273
2.299AsnVal: 2.299 ± 1.4
0.766AsnTrp: 0.766 ± 0.467
3.065AsnTyr: 3.065 ± 1.061
0.0AsnXaa: 0.0 ± 0.0
Pro
4.598ProAla: 4.598 ± 2.661
2.299ProCys: 2.299 ± 1.626
1.533ProAsp: 1.533 ± 0.933
2.299ProGlu: 2.299 ± 1.4
5.364ProPhe: 5.364 ± 1.053
3.831ProGly: 3.831 ± 1.834
1.533ProHis: 1.533 ± 0.933
2.299ProIle: 2.299 ± 1.879
4.598ProLys: 4.598 ± 2.12
3.065ProLeu: 3.065 ± 1.061
0.0ProMet: 0.0 ± 0.0
1.533ProAsn: 1.533 ± 0.734
5.364ProPro: 5.364 ± 2.271
3.831ProGln: 3.831 ± 1.44
2.299ProArg: 2.299 ± 1.629
1.533ProSer: 1.533 ± 0.734
2.299ProThr: 2.299 ± 0.701
3.065ProVal: 3.065 ± 0.514
1.533ProTrp: 1.533 ± 0.734
6.13ProTyr: 6.13 ± 3.732
0.0ProXaa: 0.0 ± 0.0
Gln
2.299GlnAla: 2.299 ± 0.784
1.533GlnCys: 1.533 ± 0.698
5.364GlnAsp: 5.364 ± 2.82
1.533GlnGlu: 1.533 ± 0.933
0.0GlnPhe: 0.0 ± 0.0
4.598GlnGly: 4.598 ± 2.799
3.065GlnHis: 3.065 ± 1.201
0.766GlnIle: 0.766 ± 0.467
5.364GlnLys: 5.364 ± 2.236
7.663GlnLeu: 7.663 ± 3.121
2.299GlnMet: 2.299 ± 1.4
2.299GlnAsn: 2.299 ± 0.915
3.831GlnPro: 3.831 ± 2.333
9.962GlnGln: 9.962 ± 3.984
3.065GlnArg: 3.065 ± 1.273
3.065GlnSer: 3.065 ± 1.866
4.598GlnThr: 4.598 ± 0.983
1.533GlnVal: 1.533 ± 0.734
1.533GlnTrp: 1.533 ± 0.933
3.065GlnTyr: 3.065 ± 1.273
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.766ArgCys: 0.766 ± 0.467
3.831ArgAsp: 3.831 ± 0.821
0.766ArgGlu: 0.766 ± 0.947
0.766ArgPhe: 0.766 ± 0.467
3.065ArgGly: 3.065 ± 1.201
0.766ArgHis: 0.766 ± 0.947
2.299ArgIle: 2.299 ± 1.4
9.195ArgLys: 9.195 ± 2.253
0.0ArgLeu: 0.0 ± 0.0
2.299ArgMet: 2.299 ± 1.061
3.065ArgAsn: 3.065 ± 2.562
1.533ArgPro: 1.533 ± 0.734
3.065ArgGln: 3.065 ± 1.273
16.858ArgArg: 16.858 ± 8.403
3.065ArgSer: 3.065 ± 1.977
0.766ArgThr: 0.766 ± 0.947
1.533ArgVal: 1.533 ± 0.698
0.766ArgTrp: 0.766 ± 0.467
2.299ArgTyr: 2.299 ± 1.4
0.0ArgXaa: 0.0 ± 0.0
Ser
2.299SerAla: 2.299 ± 1.4
3.065SerCys: 3.065 ± 1.201
2.299SerAsp: 2.299 ± 1.4
3.831SerGlu: 3.831 ± 0.821
2.299SerPhe: 2.299 ± 0.915
3.831SerGly: 3.831 ± 1.55
6.897SerHis: 6.897 ± 4.371
4.598SerIle: 4.598 ± 0.584
6.897SerLys: 6.897 ± 2.546
3.065SerLeu: 3.065 ± 0.514
0.766SerMet: 0.766 ± 0.688
5.364SerAsn: 5.364 ± 2.236
1.533SerPro: 1.533 ± 1.895
3.065SerGln: 3.065 ± 1.273
2.299SerArg: 2.299 ± 0.784
10.728SerSer: 10.728 ± 9.661
4.598SerThr: 4.598 ± 1.056
1.533SerVal: 1.533 ± 0.933
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.831ThrAla: 3.831 ± 0.821
1.533ThrCys: 1.533 ± 0.933
3.065ThrAsp: 3.065 ± 1.866
3.065ThrGlu: 3.065 ± 1.468
2.299ThrPhe: 2.299 ± 1.4
9.195ThrGly: 9.195 ± 2.307
1.533ThrHis: 1.533 ± 0.933
3.831ThrIle: 3.831 ± 0.821
5.364ThrLys: 5.364 ± 0.833
5.364ThrLeu: 5.364 ± 1.481
0.0ThrMet: 0.0 ± 0.0
3.065ThrAsn: 3.065 ± 1.061
6.13ThrPro: 6.13 ± 3.177
3.831ThrGln: 3.831 ± 1.685
1.533ThrArg: 1.533 ± 1.075
6.13ThrSer: 6.13 ± 3.217
6.897ThrThr: 6.897 ± 1.619
1.533ThrVal: 1.533 ± 0.933
0.766ThrTrp: 0.766 ± 0.467
0.766ThrTyr: 0.766 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
1.533ValAla: 1.533 ± 0.933
0.766ValCys: 0.766 ± 0.467
0.0ValAsp: 0.0 ± 0.0
0.766ValGlu: 0.766 ± 0.467
0.0ValPhe: 0.0 ± 0.0
0.766ValGly: 0.766 ± 0.467
1.533ValHis: 1.533 ± 1.516
1.533ValIle: 1.533 ± 0.933
3.831ValLys: 3.831 ± 0.687
0.766ValLeu: 0.766 ± 0.467
0.0ValMet: 0.0 ± 0.0
1.533ValAsn: 1.533 ± 0.933
1.533ValPro: 1.533 ± 0.933
2.299ValGln: 2.299 ± 1.4
2.299ValArg: 2.299 ± 0.784
2.299ValSer: 2.299 ± 0.915
0.766ValThr: 0.766 ± 0.467
1.533ValVal: 1.533 ± 0.698
0.766ValTrp: 0.766 ± 0.467
0.766ValTyr: 0.766 ± 0.947
0.0ValXaa: 0.0 ± 0.0
Trp
1.533TrpAla: 1.533 ± 0.933
2.299TrpCys: 2.299 ± 1.626
0.766TrpAsp: 0.766 ± 0.467
1.533TrpGlu: 1.533 ± 0.933
0.766TrpPhe: 0.766 ± 0.467
0.766TrpGly: 0.766 ± 0.467
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.065TrpLeu: 3.065 ± 1.061
2.299TrpMet: 2.299 ± 1.626
0.766TrpAsn: 0.766 ± 0.467
1.533TrpPro: 1.533 ± 0.933
0.766TrpGln: 0.766 ± 0.467
0.0TrpArg: 0.0 ± 0.0
0.766TrpSer: 0.766 ± 0.467
1.533TrpThr: 1.533 ± 0.933
0.766TrpVal: 0.766 ± 0.467
0.766TrpTrp: 0.766 ± 0.467
1.533TrpTyr: 1.533 ± 0.933
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.766TyrAla: 0.766 ± 0.467
0.0TyrCys: 0.0 ± 0.0
0.766TyrAsp: 0.766 ± 0.467
1.533TyrGlu: 1.533 ± 0.933
2.299TyrPhe: 2.299 ± 0.784
2.299TyrGly: 2.299 ± 1.4
0.0TyrHis: 0.0 ± 0.0
2.299TyrIle: 2.299 ± 1.4
6.897TyrLys: 6.897 ± 0.605
3.065TyrLeu: 3.065 ± 1.866
0.766TyrMet: 0.766 ± 0.947
6.897TyrAsn: 6.897 ± 1.398
3.065TyrPro: 3.065 ± 1.061
1.533TyrGln: 1.533 ± 0.698
1.533TyrArg: 1.533 ± 0.933
2.299TyrSer: 2.299 ± 1.4
0.766TyrThr: 0.766 ± 0.467
2.299TyrVal: 2.299 ± 1.4
0.0TyrTrp: 0.0 ± 0.0
1.533TyrTyr: 1.533 ± 0.933
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1306 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski