Amino acid dipepetide frequency for Torque teno mini virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
2.026AlaAsp: 2.026 ± 1.431
2.026AlaGlu: 2.026 ± 1.451
0.0AlaPhe: 0.0 ± 0.0
1.013AlaGly: 1.013 ± 0.527
0.0AlaHis: 0.0 ± 0.0
2.026AlaIle: 2.026 ± 1.054
2.026AlaLys: 2.026 ± 1.396
1.013AlaLeu: 1.013 ± 1.643
0.0AlaMet: 0.0 ± 0.0
1.013AlaAsn: 1.013 ± 1.735
4.053AlaPro: 4.053 ± 1.739
3.04AlaGln: 3.04 ± 1.325
1.013AlaArg: 1.013 ± 1.663
0.0AlaSer: 0.0 ± 0.0
5.066AlaThr: 5.066 ± 3.03
1.013AlaVal: 1.013 ± 0.527
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.013CysAsp: 1.013 ± 0.527
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.013CysGly: 1.013 ± 1.643
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.026CysLys: 2.026 ± 2.516
5.066CysLeu: 5.066 ± 1.741
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.013CysPro: 1.013 ± 0.527
1.013CysGln: 1.013 ± 1.663
1.013CysArg: 1.013 ± 0.527
1.013CysSer: 1.013 ± 0.527
1.013CysThr: 1.013 ± 0.527
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.013CysTyr: 1.013 ± 0.527
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
3.04AspAsp: 3.04 ± 3.003
2.026AspGlu: 2.026 ± 1.396
1.013AspPhe: 1.013 ± 1.643
3.04AspGly: 3.04 ± 3.003
0.0AspHis: 0.0 ± 0.0
5.066AspIle: 5.066 ± 1.741
2.026AspLys: 2.026 ± 1.431
1.013AspLeu: 1.013 ± 1.643
3.04AspMet: 3.04 ± 1.815
3.04AspAsn: 3.04 ± 1.581
4.053AspPro: 4.053 ± 1.739
0.0AspGln: 0.0 ± 0.0
2.026AspArg: 2.026 ± 1.396
2.026AspSer: 2.026 ± 1.054
7.092AspThr: 7.092 ± 2.675
4.053AspVal: 4.053 ± 1.454
2.026AspTrp: 2.026 ± 1.054
4.053AspTyr: 4.053 ± 2.107
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
6.079GluAsp: 6.079 ± 2.649
8.105GluGlu: 8.105 ± 3.977
1.013GluPhe: 1.013 ± 0.527
5.066GluGly: 5.066 ± 2.984
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
6.079GluLys: 6.079 ± 1.391
6.079GluLeu: 6.079 ± 4.824
0.0GluMet: 0.0 ± 0.0
3.04GluAsn: 3.04 ± 2.098
4.053GluPro: 4.053 ± 2.515
1.013GluGln: 1.013 ± 0.527
6.079GluArg: 6.079 ± 2.65
5.066GluSer: 5.066 ± 3.028
6.079GluThr: 6.079 ± 1.409
0.0GluVal: 0.0 ± 0.0
1.013GluTrp: 1.013 ± 0.527
2.026GluTyr: 2.026 ± 1.431
0.0GluXaa: 0.0 ± 0.0
Phe
2.026PheAla: 2.026 ± 1.054
0.0PheCys: 0.0 ± 0.0
3.04PheAsp: 3.04 ± 1.325
2.026PheGlu: 2.026 ± 1.396
3.04PhePhe: 3.04 ± 1.325
1.013PheGly: 1.013 ± 0.527
2.026PheHis: 2.026 ± 1.054
0.0PheIle: 0.0 ± 0.0
3.04PheLys: 3.04 ± 1.325
3.04PheLeu: 3.04 ± 1.966
1.013PheMet: 1.013 ± 1.216
3.04PheAsn: 3.04 ± 1.325
4.053PhePro: 4.053 ± 1.589
5.066PheGln: 5.066 ± 2.634
3.04PheArg: 3.04 ± 1.581
1.013PheSer: 1.013 ± 1.643
4.053PheThr: 4.053 ± 1.401
0.0PheVal: 0.0 ± 0.0
1.013PheTrp: 1.013 ± 0.527
2.026PheTyr: 2.026 ± 3.47
0.0PheXaa: 0.0 ± 0.0
Gly
3.04GlyAla: 3.04 ± 2.098
2.026GlyCys: 2.026 ± 1.396
2.026GlyAsp: 2.026 ± 1.451
6.079GlyGlu: 6.079 ± 6.464
3.04GlyPhe: 3.04 ± 1.581
4.053GlyGly: 4.053 ± 2.107
1.013GlyHis: 1.013 ± 1.735
1.013GlyIle: 1.013 ± 1.663
4.053GlyLys: 4.053 ± 1.739
1.013GlyLeu: 1.013 ± 0.527
1.013GlyMet: 1.013 ± 1.363
6.079GlyAsn: 6.079 ± 1.391
3.04GlyPro: 3.04 ± 1.581
0.0GlyGln: 0.0 ± 0.0
1.013GlyArg: 1.013 ± 1.735
2.026GlySer: 2.026 ± 1.054
4.053GlyThr: 4.053 ± 2.863
1.013GlyVal: 1.013 ± 0.527
1.013GlyTrp: 1.013 ± 0.527
2.026GlyTyr: 2.026 ± 1.054
0.0GlyXaa: 0.0 ± 0.0
His
1.013HisAla: 1.013 ± 0.527
1.013HisCys: 1.013 ± 1.735
2.026HisAsp: 2.026 ± 1.396
2.026HisGlu: 2.026 ± 1.451
2.026HisPhe: 2.026 ± 1.054
1.013HisGly: 1.013 ± 0.527
2.026HisHis: 2.026 ± 3.325
0.0HisIle: 0.0 ± 0.0
2.026HisLys: 2.026 ± 1.431
3.04HisLeu: 3.04 ± 1.325
0.0HisMet: 0.0 ± 0.0
1.013HisAsn: 1.013 ± 0.527
2.026HisPro: 2.026 ± 2.401
1.013HisGln: 1.013 ± 1.663
3.04HisArg: 3.04 ± 1.374
1.013HisSer: 1.013 ± 1.663
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.013HisTrp: 1.013 ± 0.527
1.013HisTyr: 1.013 ± 0.527
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
5.066IleCys: 5.066 ± 2.634
3.04IleAsp: 3.04 ± 1.581
2.026IleGlu: 2.026 ± 1.054
5.066IlePhe: 5.066 ± 1.989
0.0IleGly: 0.0 ± 0.0
1.013IleHis: 1.013 ± 0.527
2.026IleIle: 2.026 ± 3.286
4.053IleLys: 4.053 ± 1.739
3.04IleLeu: 3.04 ± 1.971
2.026IleMet: 2.026 ± 0.972
2.026IleAsn: 2.026 ± 1.054
4.053IlePro: 4.053 ± 1.454
5.066IleGln: 5.066 ± 1.798
1.013IleArg: 1.013 ± 0.527
2.026IleSer: 2.026 ± 1.054
2.026IleThr: 2.026 ± 1.396
2.026IleVal: 2.026 ± 1.054
0.0IleTrp: 0.0 ± 0.0
2.026IleTyr: 2.026 ± 1.054
0.0IleXaa: 0.0 ± 0.0
Lys
3.04LysAla: 3.04 ± 2.098
1.013LysCys: 1.013 ± 1.643
7.092LysAsp: 7.092 ± 4.05
7.092LysGlu: 7.092 ± 4.228
3.04LysPhe: 3.04 ± 1.966
1.013LysGly: 1.013 ± 0.527
2.026LysHis: 2.026 ± 2.401
2.026LysIle: 2.026 ± 1.451
4.053LysLys: 4.053 ± 1.511
7.092LysLeu: 7.092 ± 2.607
2.026LysMet: 2.026 ± 1.431
7.092LysAsn: 7.092 ± 2.87
6.079LysPro: 6.079 ± 2.121
4.053LysGln: 4.053 ± 3.369
11.145LysArg: 11.145 ± 2.543
3.04LysSer: 3.04 ± 1.374
3.04LysThr: 3.04 ± 1.581
5.066LysVal: 5.066 ± 1.65
2.026LysTrp: 2.026 ± 1.054
4.053LysTyr: 4.053 ± 1.589
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
1.013LeuCys: 1.013 ± 0.527
1.013LeuAsp: 1.013 ± 1.643
4.053LeuGlu: 4.053 ± 1.454
5.066LeuPhe: 5.066 ± 1.483
4.053LeuGly: 4.053 ± 1.739
4.053LeuHis: 4.053 ± 1.642
6.079LeuIle: 6.079 ± 2.8
7.092LeuLys: 7.092 ± 3.688
5.066LeuLeu: 5.066 ± 1.741
4.053LeuMet: 4.053 ± 1.642
2.026LeuAsn: 2.026 ± 1.396
3.04LeuPro: 3.04 ± 1.966
6.079LeuGln: 6.079 ± 1.409
2.026LeuArg: 2.026 ± 1.451
5.066LeuSer: 5.066 ± 1.318
2.026LeuThr: 2.026 ± 1.054
4.053LeuVal: 4.053 ± 3.34
2.026LeuTrp: 2.026 ± 1.054
2.026LeuTyr: 2.026 ± 1.054
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.013MetCys: 1.013 ± 1.643
0.0MetAsp: 0.0 ± 0.0
1.013MetGlu: 1.013 ± 0.527
0.0MetPhe: 0.0 ± 0.0
1.013MetGly: 1.013 ± 1.735
1.013MetHis: 1.013 ± 0.527
3.04MetIle: 3.04 ± 1.374
2.026MetLys: 2.026 ± 3.325
2.026MetLeu: 2.026 ± 1.054
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.04MetPro: 3.04 ± 1.325
2.026MetGln: 2.026 ± 1.054
0.0MetArg: 0.0 ± 0.0
1.013MetSer: 1.013 ± 1.643
1.013MetThr: 1.013 ± 0.527
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.013MetTyr: 1.013 ± 1.663
0.0MetXaa: 0.0 ± 0.0
Asn
2.026AsnAla: 2.026 ± 1.396
0.0AsnCys: 0.0 ± 0.0
1.013AsnAsp: 1.013 ± 0.527
1.013AsnGlu: 1.013 ± 1.663
4.053AsnPhe: 4.053 ± 1.401
2.026AsnGly: 2.026 ± 1.431
2.026AsnHis: 2.026 ± 1.431
7.092AsnIle: 7.092 ± 2.731
8.105AsnLys: 8.105 ± 4.037
2.026AsnLeu: 2.026 ± 1.054
0.0AsnMet: 0.0 ± 0.0
4.053AsnAsn: 4.053 ± 1.739
9.119AsnPro: 9.119 ± 2.371
2.026AsnGln: 2.026 ± 1.451
3.04AsnArg: 3.04 ± 2.098
6.079AsnSer: 6.079 ± 2.746
5.066AsnThr: 5.066 ± 1.318
2.026AsnVal: 2.026 ± 1.054
1.013AsnTrp: 1.013 ± 0.527
4.053AsnTyr: 4.053 ± 2.107
0.0AsnXaa: 0.0 ± 0.0
Pro
2.026ProAla: 2.026 ± 1.451
1.013ProCys: 1.013 ± 0.527
3.04ProAsp: 3.04 ± 1.325
7.092ProGlu: 7.092 ± 1.491
4.053ProPhe: 4.053 ± 2.902
6.079ProGly: 6.079 ± 1.463
1.013ProHis: 1.013 ± 0.527
3.04ProIle: 3.04 ± 1.581
7.092ProLys: 7.092 ± 2.544
5.066ProLeu: 5.066 ± 1.65
1.013ProMet: 1.013 ± 0.527
2.026ProAsn: 2.026 ± 1.451
5.066ProPro: 5.066 ± 2.634
2.026ProGln: 2.026 ± 1.451
3.04ProArg: 3.04 ± 1.374
5.066ProSer: 5.066 ± 2.634
7.092ProThr: 7.092 ± 2.619
5.066ProVal: 5.066 ± 2.984
2.026ProTrp: 2.026 ± 3.47
7.092ProTyr: 7.092 ± 3.688
0.0ProXaa: 0.0 ± 0.0
Gln
6.079GlnAla: 6.079 ± 4.353
0.0GlnCys: 0.0 ± 0.0
1.013GlnAsp: 1.013 ± 0.527
3.04GlnGlu: 3.04 ± 1.374
1.013GlnPhe: 1.013 ± 0.527
2.026GlnGly: 2.026 ± 1.054
2.026GlnHis: 2.026 ± 1.054
3.04GlnIle: 3.04 ± 1.581
4.053GlnLys: 4.053 ± 1.511
4.053GlnLeu: 4.053 ± 2.515
1.013GlnMet: 1.013 ± 0.527
7.092GlnAsn: 7.092 ± 2.544
2.026GlnPro: 2.026 ± 1.054
2.026GlnGln: 2.026 ± 1.451
2.026GlnArg: 2.026 ± 1.431
2.026GlnSer: 2.026 ± 1.054
7.092GlnThr: 7.092 ± 4.165
1.013GlnVal: 1.013 ± 0.527
1.013GlnTrp: 1.013 ± 1.643
2.026GlnTyr: 2.026 ± 1.431
0.0GlnXaa: 0.0 ± 0.0
Arg
1.013ArgAla: 1.013 ± 1.663
2.026ArgCys: 2.026 ± 1.054
2.026ArgAsp: 2.026 ± 1.054
1.013ArgGlu: 1.013 ± 1.643
2.026ArgPhe: 2.026 ± 1.451
5.066ArgGly: 5.066 ± 4.594
1.013ArgHis: 1.013 ± 1.663
4.053ArgIle: 4.053 ± 1.511
10.132ArgLys: 10.132 ± 3.596
3.04ArgLeu: 3.04 ± 1.325
1.013ArgMet: 1.013 ± 0.527
3.04ArgAsn: 3.04 ± 1.325
5.066ArgPro: 5.066 ± 1.318
5.066ArgGln: 5.066 ± 1.318
13.171ArgArg: 13.171 ± 3.927
2.026ArgSer: 2.026 ± 1.431
6.079ArgThr: 6.079 ± 1.226
1.013ArgVal: 1.013 ± 0.527
1.013ArgTrp: 1.013 ± 0.527
4.053ArgTyr: 4.053 ± 2.107
0.0ArgXaa: 0.0 ± 0.0
Ser
2.026SerAla: 2.026 ± 1.054
0.0SerCys: 0.0 ± 0.0
1.013SerAsp: 1.013 ± 1.663
4.053SerGlu: 4.053 ± 1.642
1.013SerPhe: 1.013 ± 0.527
1.013SerGly: 1.013 ± 0.527
2.026SerHis: 2.026 ± 1.396
0.0SerIle: 0.0 ± 0.0
5.066SerLys: 5.066 ± 5.628
4.053SerLeu: 4.053 ± 1.511
1.013SerMet: 1.013 ± 1.663
7.092SerAsn: 7.092 ± 2.84
4.053SerPro: 4.053 ± 2.107
4.053SerGln: 4.053 ± 1.454
3.04SerArg: 3.04 ± 1.581
10.132SerSer: 10.132 ± 7.157
3.04SerThr: 3.04 ± 1.374
1.013SerVal: 1.013 ± 0.527
1.013SerTrp: 1.013 ± 0.527
2.026SerTyr: 2.026 ± 1.054
0.0SerXaa: 0.0 ± 0.0
Thr
1.013ThrAla: 1.013 ± 0.527
1.013ThrCys: 1.013 ± 1.663
5.066ThrAsp: 5.066 ± 1.483
5.066ThrGlu: 5.066 ± 3.211
3.04ThrPhe: 3.04 ± 1.325
4.053ThrGly: 4.053 ± 1.642
2.026ThrHis: 2.026 ± 1.451
2.026ThrIle: 2.026 ± 1.054
4.053ThrLys: 4.053 ± 1.401
7.092ThrLeu: 7.092 ± 2.607
0.0ThrMet: 0.0 ± 0.0
7.092ThrAsn: 7.092 ± 1.352
8.105ThrPro: 8.105 ± 2.801
6.079ThrGln: 6.079 ± 2.65
5.066ThrArg: 5.066 ± 2.97
5.066ThrSer: 5.066 ± 2.756
6.079ThrThr: 6.079 ± 4.675
2.026ThrVal: 2.026 ± 1.054
3.04ThrTrp: 3.04 ± 1.581
1.013ThrTyr: 1.013 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
1.013ValAla: 1.013 ± 1.663
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
3.04ValPhe: 3.04 ± 1.581
1.013ValGly: 1.013 ± 0.527
2.026ValHis: 2.026 ± 1.431
2.026ValIle: 2.026 ± 1.054
4.053ValLys: 4.053 ± 1.454
2.026ValLeu: 2.026 ± 1.396
0.0ValMet: 0.0 ± 0.0
3.04ValAsn: 3.04 ± 1.325
3.04ValPro: 3.04 ± 1.581
4.053ValGln: 4.053 ± 1.401
3.04ValArg: 3.04 ± 1.581
1.013ValSer: 1.013 ± 0.527
2.026ValThr: 2.026 ± 1.431
1.013ValVal: 1.013 ± 0.527
1.013ValTrp: 1.013 ± 0.527
1.013ValTyr: 1.013 ± 1.643
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.013TrpAsp: 1.013 ± 0.527
3.04TrpGlu: 3.04 ± 1.325
0.0TrpPhe: 0.0 ± 0.0
3.04TrpGly: 3.04 ± 1.581
1.013TrpHis: 1.013 ± 0.527
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.013TrpLeu: 1.013 ± 1.735
0.0TrpMet: 0.0 ± 0.0
2.026TrpAsn: 2.026 ± 1.054
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
6.079TrpArg: 6.079 ± 2.121
0.0TrpSer: 0.0 ± 0.0
2.026TrpThr: 2.026 ± 1.054
1.013TrpVal: 1.013 ± 0.527
0.0TrpTrp: 0.0 ± 0.0
1.013TrpTyr: 1.013 ± 0.527
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.013TyrAla: 1.013 ± 1.663
0.0TyrCys: 0.0 ± 0.0
4.053TyrAsp: 4.053 ± 1.401
0.0TyrGlu: 0.0 ± 0.0
3.04TyrPhe: 3.04 ± 1.581
2.026TyrGly: 2.026 ± 1.396
0.0TyrHis: 0.0 ± 0.0
5.066TyrIle: 5.066 ± 2.634
4.053TyrLys: 4.053 ± 2.863
4.053TyrLeu: 4.053 ± 1.401
1.013TyrMet: 1.013 ± 0.527
2.026TyrAsn: 2.026 ± 1.054
4.053TyrPro: 4.053 ± 1.401
0.0TyrGln: 0.0 ± 0.0
3.04TyrArg: 3.04 ± 1.374
2.026TyrSer: 2.026 ± 1.054
4.053TyrThr: 4.053 ± 2.107
3.04TyrVal: 3.04 ± 1.581
1.013TyrTrp: 1.013 ± 0.527
4.053TyrTyr: 4.053 ± 1.454
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (988 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski