Amino acid dipepetide frequency for Torque teno mini virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.688AlaAla: 2.688 ± 1.151
1.344AlaCys: 1.344 ± 0.576
1.344AlaAsp: 1.344 ± 0.576
4.032AlaGlu: 4.032 ± 7.736
1.344AlaPhe: 1.344 ± 0.576
1.344AlaGly: 1.344 ± 4.156
2.688AlaHis: 2.688 ± 3.58
0.0AlaIle: 0.0 ± 0.0
8.065AlaLys: 8.065 ± 3.453
2.688AlaLeu: 2.688 ± 3.58
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
1.344AlaPro: 1.344 ± 0.576
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
1.344AlaSer: 1.344 ± 0.576
2.688AlaThr: 2.688 ± 1.151
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.344CysAsp: 1.344 ± 4.156
1.344CysGlu: 1.344 ± 4.156
0.0CysPhe: 0.0 ± 0.0
1.344CysGly: 1.344 ± 0.576
2.688CysHis: 2.688 ± 8.312
2.688CysIle: 2.688 ± 1.151
4.032CysLys: 4.032 ± 1.727
1.344CysLeu: 1.344 ± 0.576
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.344CysPro: 1.344 ± 0.576
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.344CysSer: 1.344 ± 0.576
1.344CysThr: 1.344 ± 0.576
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.344CysTyr: 1.344 ± 0.576
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
1.344AspAsp: 1.344 ± 4.156
1.344AspGlu: 1.344 ± 0.576
2.688AspPhe: 2.688 ± 3.58
2.688AspGly: 2.688 ± 3.58
2.688AspHis: 2.688 ± 3.58
1.344AspIle: 1.344 ± 0.576
1.344AspLys: 1.344 ± 0.576
6.72AspLeu: 6.72 ± 6.585
0.0AspMet: 0.0 ± 0.0
1.344AspAsn: 1.344 ± 0.576
5.376AspPro: 5.376 ± 2.302
1.344AspGln: 1.344 ± 0.576
2.688AspArg: 2.688 ± 3.58
4.032AspSer: 4.032 ± 3.005
4.032AspThr: 4.032 ± 1.727
1.344AspVal: 1.344 ± 0.576
2.688AspTrp: 2.688 ± 1.151
2.688AspTyr: 2.688 ± 1.151
0.0AspXaa: 0.0 ± 0.0
Glu
5.376GluAla: 5.376 ± 2.302
0.0GluCys: 0.0 ± 0.0
5.376GluAsp: 5.376 ± 2.429
8.065GluGlu: 8.065 ± 1.278
0.0GluPhe: 0.0 ± 0.0
5.376GluGly: 5.376 ± 7.161
2.688GluHis: 2.688 ± 1.151
4.032GluIle: 4.032 ± 3.005
1.344GluLys: 1.344 ± 0.576
4.032GluLeu: 4.032 ± 3.005
2.688GluMet: 2.688 ± 0.979
5.376GluAsn: 5.376 ± 2.429
2.688GluPro: 2.688 ± 3.58
0.0GluGln: 0.0 ± 0.0
1.344GluArg: 1.344 ± 0.576
4.032GluSer: 4.032 ± 1.727
2.688GluThr: 2.688 ± 3.58
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
2.688GluTyr: 2.688 ± 1.151
0.0GluXaa: 0.0 ± 0.0
Phe
2.688PheAla: 2.688 ± 3.58
0.0PheCys: 0.0 ± 0.0
2.688PheAsp: 2.688 ± 1.151
4.032PheGlu: 4.032 ± 3.005
0.0PhePhe: 0.0 ± 0.0
2.688PheGly: 2.688 ± 3.58
2.688PheHis: 2.688 ± 1.151
2.688PheIle: 2.688 ± 1.151
4.032PheLys: 4.032 ± 3.005
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
2.688PheAsn: 2.688 ± 1.151
0.0PhePro: 0.0 ± 0.0
5.376PheGln: 5.376 ± 2.302
1.344PheArg: 1.344 ± 0.576
2.688PheSer: 2.688 ± 1.151
2.688PheThr: 2.688 ± 3.58
0.0PheVal: 0.0 ± 0.0
1.344PheTrp: 1.344 ± 0.576
4.032PheTyr: 4.032 ± 3.005
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
2.688GlyCys: 2.688 ± 1.151
5.376GlyAsp: 5.376 ± 11.892
4.032GlyGlu: 4.032 ± 3.005
5.376GlyPhe: 5.376 ± 2.429
5.376GlyGly: 5.376 ± 2.302
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
2.688GlyLys: 2.688 ± 3.58
1.344GlyLeu: 1.344 ± 0.576
0.0GlyMet: 0.0 ± 2.117
6.72GlyAsn: 6.72 ± 1.854
2.688GlyPro: 2.688 ± 1.151
4.032GlyGln: 4.032 ± 1.727
0.0GlyArg: 0.0 ± 0.0
0.0GlySer: 0.0 ± 0.0
6.72GlyThr: 6.72 ± 2.878
0.0GlyVal: 0.0 ± 0.0
2.688GlyTrp: 2.688 ± 1.151
2.688GlyTyr: 2.688 ± 1.151
0.0GlyXaa: 0.0 ± 0.0
His
1.344HisAla: 1.344 ± 0.576
1.344HisCys: 1.344 ± 4.156
1.344HisAsp: 1.344 ± 4.156
1.344HisGlu: 1.344 ± 0.576
1.344HisPhe: 1.344 ± 0.576
1.344HisGly: 1.344 ± 0.576
0.0HisHis: 0.0 ± 0.0
1.344HisIle: 1.344 ± 0.576
1.344HisLys: 1.344 ± 0.576
4.032HisLeu: 4.032 ± 3.005
0.0HisMet: 0.0 ± 0.0
1.344HisAsn: 1.344 ± 0.576
5.376HisPro: 5.376 ± 2.429
1.344HisGln: 1.344 ± 0.576
2.688HisArg: 2.688 ± 1.151
2.688HisSer: 2.688 ± 3.58
2.688HisThr: 2.688 ± 8.312
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.344HisTyr: 1.344 ± 0.576
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
5.376IleAsp: 5.376 ± 2.302
0.0IleGlu: 0.0 ± 0.0
2.688IlePhe: 2.688 ± 3.58
1.344IleGly: 1.344 ± 0.576
1.344IleHis: 1.344 ± 0.576
2.688IleIle: 2.688 ± 3.58
4.032IleLys: 4.032 ± 1.727
6.72IleLeu: 6.72 ± 2.878
2.688IleMet: 2.688 ± 1.151
2.688IleAsn: 2.688 ± 1.151
1.344IlePro: 1.344 ± 0.576
5.376IleGln: 5.376 ± 2.302
2.688IleArg: 2.688 ± 1.151
4.032IleSer: 4.032 ± 1.727
2.688IleThr: 2.688 ± 1.151
2.688IleVal: 2.688 ± 1.151
0.0IleTrp: 0.0 ± 0.0
4.032IleTyr: 4.032 ± 1.727
0.0IleXaa: 0.0 ± 0.0
Lys
1.344LysAla: 1.344 ± 4.156
0.0LysCys: 0.0 ± 0.0
4.032LysAsp: 4.032 ± 1.727
0.0LysGlu: 0.0 ± 0.0
4.032LysPhe: 4.032 ± 1.727
5.376LysGly: 5.376 ± 2.429
1.344LysHis: 1.344 ± 4.156
6.72LysIle: 6.72 ± 2.878
5.376LysLys: 5.376 ± 2.302
10.753LysLeu: 10.753 ± 0.127
0.0LysMet: 0.0 ± 0.0
5.376LysAsn: 5.376 ± 2.302
2.688LysPro: 2.688 ± 3.58
5.376LysGln: 5.376 ± 2.302
2.688LysArg: 2.688 ± 1.151
2.688LysSer: 2.688 ± 3.58
4.032LysThr: 4.032 ± 1.727
2.688LysVal: 2.688 ± 1.151
1.344LysTrp: 1.344 ± 0.576
6.72LysTyr: 6.72 ± 2.878
0.0LysXaa: 0.0 ± 0.0
Leu
1.344LeuAla: 1.344 ± 0.576
4.032LeuCys: 4.032 ± 12.468
2.688LeuAsp: 2.688 ± 3.58
8.065LeuGlu: 8.065 ± 3.453
4.032LeuPhe: 4.032 ± 3.005
2.688LeuGly: 2.688 ± 1.151
4.032LeuHis: 4.032 ± 1.727
4.032LeuIle: 4.032 ± 1.727
8.065LeuLys: 8.065 ± 6.01
5.376LeuLeu: 5.376 ± 7.161
1.344LeuMet: 1.344 ± 0.576
5.376LeuAsn: 5.376 ± 2.429
4.032LeuPro: 4.032 ± 1.727
10.753LeuGln: 10.753 ± 0.127
1.344LeuArg: 1.344 ± 0.576
0.0LeuSer: 0.0 ± 0.0
5.376LeuThr: 5.376 ± 2.302
4.032LeuVal: 4.032 ± 1.727
4.032LeuTrp: 4.032 ± 1.727
6.72LeuTyr: 6.72 ± 2.878
0.0LeuXaa: 0.0 ± 0.0
Met
1.344MetAla: 1.344 ± 0.576
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.688MetGlu: 2.688 ± 1.151
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.688MetLeu: 2.688 ± 1.151
2.688MetMet: 2.688 ± 1.151
1.344MetAsn: 1.344 ± 0.576
2.688MetPro: 2.688 ± 1.151
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.344MetSer: 1.344 ± 4.156
1.344MetThr: 1.344 ± 0.576
2.688MetVal: 2.688 ± 1.151
0.0MetTrp: 0.0 ± 0.0
2.688MetTyr: 2.688 ± 1.151
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
4.032AsnCys: 4.032 ± 1.727
0.0AsnAsp: 0.0 ± 0.0
1.344AsnGlu: 1.344 ± 4.156
4.032AsnPhe: 4.032 ± 3.005
2.688AsnGly: 2.688 ± 1.151
1.344AsnHis: 1.344 ± 0.576
1.344AsnIle: 1.344 ± 0.576
2.688AsnLys: 2.688 ± 1.151
5.376AsnLeu: 5.376 ± 2.302
0.0AsnMet: 0.0 ± 0.0
9.409AsnAsn: 9.409 ± 4.029
2.688AsnPro: 2.688 ± 1.151
0.0AsnGln: 0.0 ± 0.0
1.344AsnArg: 1.344 ± 0.576
1.344AsnSer: 1.344 ± 4.156
9.409AsnThr: 9.409 ± 4.029
0.0AsnVal: 0.0 ± 0.0
6.72AsnTrp: 6.72 ± 2.878
1.344AsnTyr: 1.344 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
2.688ProCys: 2.688 ± 1.151
0.0ProAsp: 0.0 ± 0.0
2.688ProGlu: 2.688 ± 3.58
2.688ProPhe: 2.688 ± 1.151
2.688ProGly: 2.688 ± 1.151
0.0ProHis: 0.0 ± 0.0
2.688ProIle: 2.688 ± 1.151
5.376ProLys: 5.376 ± 2.429
8.065ProLeu: 8.065 ± 1.278
4.032ProMet: 4.032 ± 1.727
0.0ProAsn: 0.0 ± 0.0
5.376ProPro: 5.376 ± 2.302
1.344ProGln: 1.344 ± 0.576
4.032ProArg: 4.032 ± 1.727
4.032ProSer: 4.032 ± 1.727
9.409ProThr: 9.409 ± 0.703
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
6.72ProTyr: 6.72 ± 2.878
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.344GlnCys: 1.344 ± 0.576
2.688GlnAsp: 2.688 ± 1.151
2.688GlnGlu: 2.688 ± 1.151
1.344GlnPhe: 1.344 ± 0.576
1.344GlnGly: 1.344 ± 0.576
1.344GlnHis: 1.344 ± 0.576
1.344GlnIle: 1.344 ± 0.576
6.72GlnLys: 6.72 ± 2.878
6.72GlnLeu: 6.72 ± 1.854
2.688GlnMet: 2.688 ± 1.151
2.688GlnAsn: 2.688 ± 1.151
5.376GlnPro: 5.376 ± 2.302
5.376GlnGln: 5.376 ± 2.302
4.032GlnArg: 4.032 ± 1.727
5.376GlnSer: 5.376 ± 2.302
8.065GlnThr: 8.065 ± 3.453
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
5.376GlnTyr: 5.376 ± 2.302
0.0GlnXaa: 0.0 ± 0.0
Arg
1.344ArgAla: 1.344 ± 0.576
0.0ArgCys: 0.0 ± 0.0
1.344ArgAsp: 1.344 ± 0.576
1.344ArgGlu: 1.344 ± 0.576
1.344ArgPhe: 1.344 ± 0.576
2.688ArgGly: 2.688 ± 1.151
1.344ArgHis: 1.344 ± 0.576
5.376ArgIle: 5.376 ± 2.302
1.344ArgLys: 1.344 ± 0.576
2.688ArgLeu: 2.688 ± 3.58
0.0ArgMet: 0.0 ± 0.0
2.688ArgAsn: 2.688 ± 1.151
2.688ArgPro: 2.688 ± 1.151
5.376ArgGln: 5.376 ± 2.302
16.129ArgArg: 16.129 ± 6.906
0.0ArgSer: 0.0 ± 0.0
2.688ArgThr: 2.688 ± 1.151
1.344ArgVal: 1.344 ± 0.576
1.344ArgTrp: 1.344 ± 0.576
4.032ArgTyr: 4.032 ± 1.727
0.0ArgXaa: 0.0 ± 0.0
Ser
1.344SerAla: 1.344 ± 0.576
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
5.376SerGlu: 5.376 ± 2.302
2.688SerPhe: 2.688 ± 1.151
2.688SerGly: 2.688 ± 3.58
2.688SerHis: 2.688 ± 8.312
4.032SerIle: 4.032 ± 1.727
4.032SerLys: 4.032 ± 3.005
4.032SerLeu: 4.032 ± 1.727
0.0SerMet: 0.0 ± 0.0
2.688SerAsn: 2.688 ± 1.151
5.376SerPro: 5.376 ± 2.302
2.688SerGln: 2.688 ± 1.151
2.688SerArg: 2.688 ± 1.151
1.344SerSer: 1.344 ± 0.576
1.344SerThr: 1.344 ± 0.576
2.688SerVal: 2.688 ± 3.58
0.0SerTrp: 0.0 ± 0.0
1.344SerTyr: 1.344 ± 0.576
0.0SerXaa: 0.0 ± 0.0
Thr
8.065ThrAla: 8.065 ± 6.01
1.344ThrCys: 1.344 ± 0.576
6.72ThrAsp: 6.72 ± 2.878
6.72ThrGlu: 6.72 ± 1.854
2.688ThrPhe: 2.688 ± 3.58
6.72ThrGly: 6.72 ± 1.854
2.688ThrHis: 2.688 ± 1.151
5.376ThrIle: 5.376 ± 2.302
9.409ThrLys: 9.409 ± 4.029
4.032ThrLeu: 4.032 ± 1.727
4.032ThrMet: 4.032 ± 1.727
1.344ThrAsn: 1.344 ± 0.576
5.376ThrPro: 5.376 ± 2.429
2.688ThrGln: 2.688 ± 1.151
0.0ThrArg: 0.0 ± 0.0
4.032ThrSer: 4.032 ± 1.727
13.441ThrThr: 13.441 ± 3.708
2.688ThrVal: 2.688 ± 1.151
0.0ThrTrp: 0.0 ± 0.0
5.376ThrTyr: 5.376 ± 2.302
0.0ThrXaa: 0.0 ± 0.0
Val
2.688ValAla: 2.688 ± 3.58
0.0ValCys: 0.0 ± 0.0
1.344ValAsp: 1.344 ± 0.576
1.344ValGlu: 1.344 ± 0.576
0.0ValPhe: 0.0 ± 0.0
1.344ValGly: 1.344 ± 0.576
0.0ValHis: 0.0 ± 0.0
2.688ValIle: 2.688 ± 1.151
1.344ValLys: 1.344 ± 0.576
0.0ValLeu: 0.0 ± 0.0
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
2.688ValPro: 2.688 ± 1.151
1.344ValGln: 1.344 ± 0.576
4.032ValArg: 4.032 ± 1.727
1.344ValSer: 1.344 ± 0.576
1.344ValThr: 1.344 ± 0.576
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.344TrpAsp: 1.344 ± 0.576
0.0TrpGlu: 0.0 ± 0.0
1.344TrpPhe: 1.344 ± 0.576
2.688TrpGly: 2.688 ± 1.151
1.344TrpHis: 1.344 ± 0.576
1.344TrpIle: 1.344 ± 0.576
0.0TrpLys: 0.0 ± 0.0
1.344TrpLeu: 1.344 ± 0.576
0.0TrpMet: 0.0 ± 0.0
1.344TrpAsn: 1.344 ± 0.576
0.0TrpPro: 0.0 ± 0.0
2.688TrpGln: 2.688 ± 1.151
2.688TrpArg: 2.688 ± 1.151
0.0TrpSer: 0.0 ± 0.0
2.688TrpThr: 2.688 ± 1.151
0.0TrpVal: 0.0 ± 0.0
1.344TrpTrp: 1.344 ± 0.576
1.344TrpTyr: 1.344 ± 0.576
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.344TyrAla: 1.344 ± 0.576
1.344TyrCys: 1.344 ± 0.576
1.344TyrAsp: 1.344 ± 0.576
1.344TyrGlu: 1.344 ± 0.576
4.032TyrPhe: 4.032 ± 1.727
1.344TyrGly: 1.344 ± 0.576
1.344TyrHis: 1.344 ± 0.576
2.688TyrIle: 2.688 ± 1.151
1.344TyrLys: 1.344 ± 0.576
9.409TyrLeu: 9.409 ± 4.029
0.0TyrMet: 0.0 ± 0.0
2.688TyrAsn: 2.688 ± 1.151
2.688TyrPro: 2.688 ± 1.151
9.409TyrGln: 9.409 ± 4.029
5.376TyrArg: 5.376 ± 2.302
5.376TyrSer: 5.376 ± 2.302
8.065TyrThr: 8.065 ± 1.278
1.344TyrVal: 1.344 ± 0.576
0.0TyrTrp: 0.0 ± 0.0
5.376TyrTyr: 5.376 ± 2.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (745 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski