Amino acid dipepetide frequency for Torque teno mini virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.296AlaAla: 6.296 ± 5.962
2.099AlaCys: 2.099 ± 1.137
2.099AlaAsp: 2.099 ± 1.137
1.049AlaGlu: 1.049 ± 0.568
1.049AlaPhe: 1.049 ± 0.568
2.099AlaGly: 2.099 ± 1.989
1.049AlaHis: 1.049 ± 1.496
2.099AlaIle: 2.099 ± 1.989
3.148AlaLys: 3.148 ± 0.947
3.148AlaLeu: 3.148 ± 1.784
1.049AlaMet: 1.049 ± 0.568
4.197AlaAsn: 4.197 ± 1.746
2.099AlaPro: 2.099 ± 1.115
4.197AlaGln: 4.197 ± 2.231
1.049AlaArg: 1.049 ± 0.568
1.049AlaSer: 1.049 ± 1.496
5.247AlaThr: 5.247 ± 1.989
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.049AlaTyr: 1.049 ± 0.568
0.0AlaXaa: 0.0 ± 0.0
Cys
1.049CysAla: 1.049 ± 0.568
0.0CysCys: 0.0 ± 0.0
1.049CysAsp: 1.049 ± 2.319
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.099CysGly: 2.099 ± 1.989
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.099CysLeu: 2.099 ± 1.989
0.0CysMet: 0.0 ± 0.0
1.049CysAsn: 1.049 ± 0.568
1.049CysPro: 1.049 ± 0.568
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.148CysSer: 3.148 ± 1.705
0.0CysThr: 0.0 ± 0.0
1.049CysVal: 1.049 ± 0.568
1.049CysTrp: 1.049 ± 0.568
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.099AspAla: 2.099 ± 4.637
0.0AspCys: 0.0 ± 0.0
3.148AspAsp: 3.148 ± 4.283
6.296AspGlu: 6.296 ± 0.897
2.099AspPhe: 2.099 ± 1.137
2.099AspGly: 2.099 ± 4.637
1.049AspHis: 1.049 ± 0.568
1.049AspIle: 1.049 ± 0.568
5.247AspLys: 5.247 ± 1.266
4.197AspLeu: 4.197 ± 5.613
1.049AspMet: 1.049 ± 2.319
3.148AspAsn: 3.148 ± 1.705
4.197AspPro: 4.197 ± 1.093
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
4.197AspSer: 4.197 ± 2.273
8.395AspThr: 8.395 ± 3.492
3.148AspVal: 3.148 ± 1.705
1.049AspTrp: 1.049 ± 0.568
2.099AspTyr: 2.099 ± 1.137
0.0AspXaa: 0.0 ± 0.0
Glu
1.049GluAla: 1.049 ± 0.568
0.0GluCys: 0.0 ± 0.0
5.247GluAsp: 5.247 ± 1.266
0.0GluGlu: 0.0 ± 0.0
1.049GluPhe: 1.049 ± 0.568
3.148GluGly: 3.148 ± 0.947
1.049GluHis: 1.049 ± 0.568
2.099GluIle: 2.099 ± 1.989
3.148GluLys: 3.148 ± 0.947
3.148GluLeu: 3.148 ± 4.283
0.0GluMet: 0.0 ± 1.154
2.099GluAsn: 2.099 ± 1.137
1.049GluPro: 1.049 ± 2.319
0.0GluGln: 0.0 ± 0.0
1.049GluArg: 1.049 ± 1.496
3.148GluSer: 3.148 ± 1.705
6.296GluThr: 6.296 ± 3.568
0.0GluVal: 0.0 ± 0.0
1.049GluTrp: 1.049 ± 1.496
2.099GluTyr: 2.099 ± 1.137
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.049PheCys: 1.049 ± 2.319
3.148PheAsp: 3.148 ± 0.947
1.049PheGlu: 1.049 ± 2.319
1.049PhePhe: 1.049 ± 0.568
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.049PheIle: 1.049 ± 1.496
5.247PheLys: 5.247 ± 1.989
4.197PheLeu: 4.197 ± 1.746
0.0PheMet: 0.0 ± 0.0
2.099PheAsn: 2.099 ± 2.806
3.148PhePro: 3.148 ± 0.947
5.247PheGln: 5.247 ± 1.462
3.148PheArg: 3.148 ± 0.947
2.099PheSer: 2.099 ± 1.137
2.099PheThr: 2.099 ± 1.989
3.148PheVal: 3.148 ± 1.705
1.049PheTrp: 1.049 ± 0.568
4.197PheTyr: 4.197 ± 1.093
0.0PheXaa: 0.0 ± 0.0
Gly
1.049GlyAla: 1.049 ± 2.319
2.099GlyCys: 2.099 ± 1.989
5.247GlyAsp: 5.247 ± 6.263
2.099GlyGlu: 2.099 ± 1.989
1.049GlyPhe: 1.049 ± 0.568
4.197GlyGly: 4.197 ± 2.273
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
3.148GlyLys: 3.148 ± 0.947
4.197GlyLeu: 4.197 ± 1.748
3.148GlyMet: 3.148 ± 1.32
3.148GlyAsn: 3.148 ± 1.705
5.247GlyPro: 5.247 ± 1.266
0.0GlyGln: 0.0 ± 0.0
1.049GlyArg: 1.049 ± 0.568
3.148GlySer: 3.148 ± 1.705
3.148GlyThr: 3.148 ± 2.577
2.099GlyVal: 2.099 ± 1.989
2.099GlyTrp: 2.099 ± 1.137
3.148GlyTyr: 3.148 ± 1.705
0.0GlyXaa: 0.0 ± 0.0
His
1.049HisAla: 1.049 ± 0.568
1.049HisCys: 1.049 ± 0.568
1.049HisAsp: 1.049 ± 2.319
0.0HisGlu: 0.0 ± 0.0
1.049HisPhe: 1.049 ± 0.568
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.099HisIle: 2.099 ± 1.989
1.049HisLys: 1.049 ± 1.496
1.049HisLeu: 1.049 ± 0.568
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.049HisPro: 1.049 ± 0.568
1.049HisGln: 1.049 ± 1.496
2.099HisArg: 2.099 ± 1.137
2.099HisSer: 2.099 ± 1.115
1.049HisThr: 1.049 ± 0.568
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.099HisTyr: 2.099 ± 1.115
0.0HisXaa: 0.0 ± 0.0
Ile
1.049IleAla: 1.049 ± 1.496
1.049IleCys: 1.049 ± 0.568
2.099IleAsp: 2.099 ± 1.115
4.197IleGlu: 4.197 ± 1.748
6.296IlePhe: 6.296 ± 3.573
0.0IleGly: 0.0 ± 0.0
1.049IleHis: 1.049 ± 1.496
4.197IleIle: 4.197 ± 1.093
4.197IleLys: 4.197 ± 1.748
2.099IleLeu: 2.099 ± 1.989
1.049IleMet: 1.049 ± 0.568
1.049IleAsn: 1.049 ± 1.496
5.247IlePro: 5.247 ± 1.989
4.197IleGln: 4.197 ± 1.746
1.049IleArg: 1.049 ± 0.568
6.296IleSer: 6.296 ± 3.41
7.345IleThr: 7.345 ± 1.964
1.049IleVal: 1.049 ± 0.568
1.049IleTrp: 1.049 ± 0.568
1.049IleTyr: 1.049 ± 0.568
0.0IleXaa: 0.0 ± 0.0
Lys
4.197LysAla: 4.197 ± 1.746
2.099LysCys: 2.099 ± 1.989
5.247LysAsp: 5.247 ± 1.462
4.197LysGlu: 4.197 ± 2.231
3.148LysPhe: 3.148 ± 0.947
4.197LysGly: 4.197 ± 1.748
4.197LysHis: 4.197 ± 1.748
7.345LysIle: 7.345 ± 1.585
13.641LysLys: 13.641 ± 3.596
8.395LysLeu: 8.395 ± 2.186
1.049LysMet: 1.049 ± 0.568
5.247LysAsn: 5.247 ± 1.462
6.296LysPro: 6.296 ± 5.962
4.197LysGln: 4.197 ± 3.286
6.296LysArg: 6.296 ± 1.893
5.247LysSer: 5.247 ± 2.841
3.148LysThr: 3.148 ± 0.947
2.099LysVal: 2.099 ± 1.137
1.049LysTrp: 1.049 ± 0.568
3.148LysTyr: 3.148 ± 1.705
0.0LysXaa: 0.0 ± 0.0
Leu
2.099LeuAla: 2.099 ± 1.115
0.0LeuCys: 0.0 ± 0.0
3.148LeuAsp: 3.148 ± 4.283
3.148LeuGlu: 3.148 ± 2.267
5.247LeuPhe: 5.247 ± 5.07
5.247LeuGly: 5.247 ± 1.462
1.049LeuHis: 1.049 ± 0.568
2.099LeuIle: 2.099 ± 1.137
10.493LeuLys: 10.493 ± 1.941
2.099LeuLeu: 2.099 ± 1.989
2.099LeuMet: 2.099 ± 1.115
7.345LeuAsn: 7.345 ± 0.807
3.148LeuPro: 3.148 ± 1.705
4.197LeuGln: 4.197 ± 1.746
3.148LeuArg: 3.148 ± 0.947
7.345LeuSer: 7.345 ± 5.889
5.247LeuThr: 5.247 ± 1.89
2.099LeuVal: 2.099 ± 1.137
4.197LeuTrp: 4.197 ± 2.273
1.049LeuTyr: 1.049 ± 0.568
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.049MetAsp: 1.049 ± 2.319
1.049MetGlu: 1.049 ± 0.568
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.049MetIle: 1.049 ± 1.496
2.099MetLys: 2.099 ± 1.137
2.099MetLeu: 2.099 ± 1.137
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.099MetPro: 2.099 ± 1.137
0.0MetGln: 0.0 ± 0.0
2.099MetArg: 2.099 ± 1.115
2.099MetSer: 2.099 ± 1.989
0.0MetThr: 0.0 ± 0.0
1.049MetVal: 1.049 ± 0.568
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.197AsnAla: 4.197 ± 2.273
1.049AsnCys: 1.049 ± 0.568
1.049AsnAsp: 1.049 ± 1.496
0.0AsnGlu: 0.0 ± 0.0
3.148AsnPhe: 3.148 ± 2.577
3.148AsnGly: 3.148 ± 2.267
1.049AsnHis: 1.049 ± 2.319
4.197AsnIle: 4.197 ± 2.273
6.296AsnLys: 6.296 ± 3.573
2.099AsnLeu: 2.099 ± 2.806
0.0AsnMet: 0.0 ± 0.0
3.148AsnAsn: 3.148 ± 1.705
4.197AsnPro: 4.197 ± 2.273
3.148AsnGln: 3.148 ± 1.705
2.099AsnArg: 2.099 ± 1.115
3.148AsnSer: 3.148 ± 1.784
9.444AsnThr: 9.444 ± 2.518
0.0AsnVal: 0.0 ± 0.0
3.148AsnTrp: 3.148 ± 1.705
7.345AsnTyr: 7.345 ± 3.978
0.0AsnXaa: 0.0 ± 0.0
Pro
4.197ProAla: 4.197 ± 1.093
1.049ProCys: 1.049 ± 0.568
1.049ProAsp: 1.049 ± 0.568
2.099ProGlu: 2.099 ± 1.137
2.099ProPhe: 2.099 ± 1.115
2.099ProGly: 2.099 ± 1.137
1.049ProHis: 1.049 ± 0.568
4.197ProIle: 4.197 ± 2.231
5.247ProLys: 5.247 ± 1.462
7.345ProLeu: 7.345 ± 2.442
1.049ProMet: 1.049 ± 0.568
2.099ProAsn: 2.099 ± 2.806
6.296ProPro: 6.296 ± 2.151
5.247ProGln: 5.247 ± 4.575
6.296ProArg: 6.296 ± 0.897
4.197ProSer: 4.197 ± 1.093
6.296ProThr: 6.296 ± 1.893
5.247ProVal: 5.247 ± 7.48
1.049ProTrp: 1.049 ± 1.496
3.148ProTyr: 3.148 ± 1.705
0.0ProXaa: 0.0 ± 0.0
Gln
5.247GlnAla: 5.247 ± 3.679
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.099GlnGlu: 2.099 ± 1.989
2.099GlnPhe: 2.099 ± 2.806
3.148GlnGly: 3.148 ± 1.784
1.049GlnHis: 1.049 ± 0.568
1.049GlnIle: 1.049 ± 1.496
3.148GlnLys: 3.148 ± 0.947
2.099GlnLeu: 2.099 ± 2.992
1.049GlnMet: 1.049 ± 0.568
5.247GlnAsn: 5.247 ± 2.718
5.247GlnPro: 5.247 ± 1.462
3.148GlnGln: 3.148 ± 1.705
3.148GlnArg: 3.148 ± 0.947
2.099GlnSer: 2.099 ± 1.137
4.197GlnThr: 4.197 ± 2.231
1.049GlnVal: 1.049 ± 0.568
2.099GlnTrp: 2.099 ± 1.137
2.099GlnTyr: 2.099 ± 1.137
0.0GlnXaa: 0.0 ± 0.0
Arg
2.099ArgAla: 2.099 ± 1.137
1.049ArgCys: 1.049 ± 0.568
2.099ArgAsp: 2.099 ± 1.115
1.049ArgGlu: 1.049 ± 0.568
1.049ArgPhe: 1.049 ± 0.568
0.0ArgGly: 0.0 ± 0.0
3.148ArgHis: 3.148 ± 0.947
1.049ArgIle: 1.049 ± 1.496
6.296ArgLys: 6.296 ± 1.931
4.197ArgLeu: 4.197 ± 1.093
1.049ArgMet: 1.049 ± 0.568
3.148ArgAsn: 3.148 ± 1.705
4.197ArgPro: 4.197 ± 2.231
1.049ArgGln: 1.049 ± 0.568
13.641ArgArg: 13.641 ± 4.366
4.197ArgSer: 4.197 ± 4.064
2.099ArgThr: 2.099 ± 1.137
2.099ArgVal: 2.099 ± 1.137
2.099ArgTrp: 2.099 ± 1.137
4.197ArgTyr: 4.197 ± 1.748
0.0ArgXaa: 0.0 ± 0.0
Ser
1.049SerAla: 1.049 ± 0.568
0.0SerCys: 0.0 ± 0.0
5.247SerAsp: 5.247 ± 3.736
1.049SerGlu: 1.049 ± 0.568
3.148SerPhe: 3.148 ± 1.705
1.049SerGly: 1.049 ± 0.568
0.0SerHis: 0.0 ± 0.0
10.493SerIle: 10.493 ± 2.544
7.345SerLys: 7.345 ± 2.442
8.395SerLeu: 8.395 ± 4.546
0.0SerMet: 0.0 ± 0.0
8.395SerAsn: 8.395 ± 2.974
3.148SerPro: 3.148 ± 2.577
5.247SerGln: 5.247 ± 1.462
2.099SerArg: 2.099 ± 1.115
14.69SerSer: 14.69 ± 7.956
2.099SerThr: 2.099 ± 1.115
0.0SerVal: 0.0 ± 0.0
2.099SerTrp: 2.099 ± 1.115
1.049SerTyr: 1.049 ± 2.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.296ThrAla: 6.296 ± 1.893
0.0ThrCys: 0.0 ± 0.0
5.247ThrAsp: 5.247 ± 2.841
3.148ThrGlu: 3.148 ± 1.784
1.049ThrPhe: 1.049 ± 1.496
10.493ThrGly: 10.493 ± 2.544
0.0ThrHis: 0.0 ± 0.0
6.296ThrIle: 6.296 ± 1.931
6.296ThrLys: 6.296 ± 2.151
7.345ThrLeu: 7.345 ± 1.585
1.049ThrMet: 1.049 ± 0.519
5.247ThrAsn: 5.247 ± 1.89
5.247ThrPro: 5.247 ± 3.679
3.148ThrGln: 3.148 ± 2.577
2.099ThrArg: 2.099 ± 1.137
5.247ThrSer: 5.247 ± 1.989
10.493ThrThr: 10.493 ± 1.941
0.0ThrVal: 0.0 ± 0.0
1.049ThrTrp: 1.049 ± 0.568
1.049ThrTyr: 1.049 ± 1.496
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.049ValCys: 1.049 ± 0.568
3.148ValAsp: 3.148 ± 1.705
1.049ValGlu: 1.049 ± 0.568
1.049ValPhe: 1.049 ± 0.568
0.0ValGly: 0.0 ± 0.0
1.049ValHis: 1.049 ± 1.496
2.099ValIle: 2.099 ± 1.137
4.197ValLys: 4.197 ± 2.231
2.099ValLeu: 2.099 ± 1.989
0.0ValMet: 0.0 ± 0.0
2.099ValAsn: 2.099 ± 1.137
1.049ValPro: 1.049 ± 1.496
1.049ValGln: 1.049 ± 0.568
3.148ValArg: 3.148 ± 1.705
1.049ValSer: 1.049 ± 0.568
1.049ValThr: 1.049 ± 1.496
1.049ValVal: 1.049 ± 0.568
0.0ValTrp: 0.0 ± 0.0
3.148ValTyr: 3.148 ± 1.705
0.0ValXaa: 0.0 ± 0.0
Trp
1.049TrpAla: 1.049 ± 0.568
0.0TrpCys: 0.0 ± 0.0
1.049TrpAsp: 1.049 ± 0.568
4.197TrpGlu: 4.197 ± 2.231
1.049TrpPhe: 1.049 ± 0.568
4.197TrpGly: 4.197 ± 2.273
1.049TrpHis: 1.049 ± 0.568
2.099TrpIle: 2.099 ± 1.137
1.049TrpLys: 1.049 ± 0.568
1.049TrpLeu: 1.049 ± 1.496
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.099TrpPro: 2.099 ± 1.137
1.049TrpGln: 1.049 ± 0.568
4.197TrpArg: 4.197 ± 2.273
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.049TrpVal: 1.049 ± 0.568
0.0TrpTrp: 0.0 ± 0.0
1.049TrpTyr: 1.049 ± 0.568
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
3.148TyrAsp: 3.148 ± 0.947
0.0TyrGlu: 0.0 ± 0.0
6.296TyrPhe: 6.296 ± 1.931
2.099TyrGly: 2.099 ± 1.989
0.0TyrHis: 0.0 ± 0.0
1.049TyrIle: 1.049 ± 0.568
3.148TyrLys: 3.148 ± 1.784
3.148TyrLeu: 3.148 ± 1.705
0.0TyrMet: 0.0 ± 0.0
3.148TyrAsn: 3.148 ± 1.705
5.247TyrPro: 5.247 ± 1.462
3.148TyrGln: 3.148 ± 1.705
2.099TyrArg: 2.099 ± 1.137
2.099TyrSer: 2.099 ± 1.137
3.148TyrThr: 3.148 ± 1.705
3.148TyrVal: 3.148 ± 1.705
2.099TyrTrp: 2.099 ± 1.137
3.148TyrTyr: 3.148 ± 1.705
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski