Amino acid dipepetide frequency for Torque teno virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.264AlaAla: 8.264 ± 3.787
1.181AlaCys: 1.181 ± 0.537
2.361AlaAsp: 2.361 ± 2.698
5.903AlaGlu: 5.903 ± 8.631
1.181AlaPhe: 1.181 ± 0.537
1.181AlaGly: 1.181 ± 0.537
3.542AlaHis: 3.542 ± 2.162
3.542AlaIle: 3.542 ± 1.61
2.361AlaLys: 2.361 ± 1.073
7.084AlaLeu: 7.084 ± 4.324
1.181AlaMet: 1.181 ± 3.235
0.0AlaAsn: 0.0 ± 0.0
5.903AlaPro: 5.903 ± 12.403
1.181AlaGln: 1.181 ± 0.537
7.084AlaArg: 7.084 ± 3.219
4.723AlaSer: 4.723 ± 1.625
5.903AlaThr: 5.903 ± 2.683
1.181AlaVal: 1.181 ± 0.537
0.0AlaTrp: 0.0 ± 0.0
4.723AlaTyr: 4.723 ± 2.146
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.361CysAsp: 2.361 ± 1.073
0.0CysGlu: 0.0 ± 0.0
2.361CysPhe: 2.361 ± 2.698
3.542CysGly: 3.542 ± 5.933
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.181CysLys: 1.181 ± 0.537
2.361CysLeu: 2.361 ± 1.073
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.181CysPro: 1.181 ± 0.537
0.0CysGln: 0.0 ± 0.0
1.181CysArg: 1.181 ± 0.537
2.361CysSer: 2.361 ± 1.073
0.0CysThr: 0.0 ± 0.0
1.181CysVal: 1.181 ± 0.537
0.0CysTrp: 0.0 ± 0.0
2.361CysTyr: 2.361 ± 1.073
0.0CysXaa: 0.0 ± 0.0
Asp
4.723AspAla: 4.723 ± 12.939
1.181AspCys: 1.181 ± 0.537
2.361AspAsp: 2.361 ± 2.698
3.542AspGlu: 3.542 ± 1.61
3.542AspPhe: 3.542 ± 1.61
3.542AspGly: 3.542 ± 5.933
1.181AspHis: 1.181 ± 3.235
3.542AspIle: 3.542 ± 1.61
4.723AspLys: 4.723 ± 2.146
7.084AspLeu: 7.084 ± 4.324
1.181AspMet: 1.181 ± 0.537
1.181AspAsn: 1.181 ± 0.537
2.361AspPro: 2.361 ± 1.073
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
3.542AspSer: 3.542 ± 1.61
7.084AspThr: 7.084 ± 3.219
5.903AspVal: 5.903 ± 1.089
1.181AspTrp: 1.181 ± 0.537
2.361AspTyr: 2.361 ± 1.073
0.0AspXaa: 0.0 ± 0.0
Glu
2.361GluAla: 2.361 ± 1.073
0.0GluCys: 0.0 ± 0.0
4.723GluAsp: 4.723 ± 1.625
4.723GluGlu: 4.723 ± 2.146
2.361GluPhe: 2.361 ± 1.073
3.542GluGly: 3.542 ± 5.933
0.0GluHis: 0.0 ± 0.0
1.181GluIle: 1.181 ± 0.537
2.361GluLys: 2.361 ± 1.073
3.542GluLeu: 3.542 ± 2.162
1.181GluMet: 1.181 ± 1.152
1.181GluAsn: 1.181 ± 0.537
1.181GluPro: 1.181 ± 0.537
2.361GluGln: 2.361 ± 1.073
2.361GluArg: 2.361 ± 6.47
3.542GluSer: 3.542 ± 2.162
3.542GluThr: 3.542 ± 1.61
1.181GluVal: 1.181 ± 0.537
0.0GluTrp: 0.0 ± 0.0
2.361GluTyr: 2.361 ± 1.073
0.0GluXaa: 0.0 ± 0.0
Phe
2.361PheAla: 2.361 ± 1.073
0.0PheCys: 0.0 ± 0.0
1.181PheAsp: 1.181 ± 0.537
1.181PheGlu: 1.181 ± 0.537
1.181PhePhe: 1.181 ± 0.537
7.084PheGly: 7.084 ± 0.552
0.0PheHis: 0.0 ± 0.0
3.542PheIle: 3.542 ± 2.162
3.542PheLys: 3.542 ± 1.61
1.181PheLeu: 1.181 ± 0.537
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.181PhePro: 1.181 ± 0.537
3.542PheGln: 3.542 ± 1.61
3.542PheArg: 3.542 ± 2.162
1.181PheSer: 1.181 ± 0.537
3.542PheThr: 3.542 ± 1.61
1.181PheVal: 1.181 ± 0.537
1.181PheTrp: 1.181 ± 0.537
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.723GlyAla: 4.723 ± 1.625
1.181GlyCys: 1.181 ± 3.235
10.626GlyAsp: 10.626 ± 14.028
1.181GlyGlu: 1.181 ± 0.537
2.361GlyPhe: 2.361 ± 1.073
11.806GlyGly: 11.806 ± 17.263
2.361GlyHis: 2.361 ± 1.073
1.181GlyIle: 1.181 ± 0.537
0.0GlyLys: 0.0 ± 0.0
4.723GlyLeu: 4.723 ± 1.625
3.542GlyMet: 3.542 ± 1.61
3.542GlyAsn: 3.542 ± 2.162
4.723GlyPro: 4.723 ± 1.625
0.0GlyGln: 0.0 ± 0.0
10.626GlyArg: 10.626 ± 2.714
3.542GlySer: 3.542 ± 1.61
0.0GlyThr: 0.0 ± 0.0
1.181GlyVal: 1.181 ± 0.537
0.0GlyTrp: 0.0 ± 0.0
2.361GlyTyr: 2.361 ± 1.073
0.0GlyXaa: 0.0 ± 0.0
His
2.361HisAla: 2.361 ± 2.698
2.361HisCys: 2.361 ± 1.073
1.181HisAsp: 1.181 ± 3.235
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.181HisLys: 1.181 ± 0.537
3.542HisLeu: 3.542 ± 2.162
0.0HisMet: 0.0 ± 0.0
1.181HisAsn: 1.181 ± 0.537
2.361HisPro: 2.361 ± 1.073
3.542HisGln: 3.542 ± 1.61
3.542HisArg: 3.542 ± 1.61
3.542HisSer: 3.542 ± 1.61
2.361HisThr: 2.361 ± 1.073
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.361IleAla: 2.361 ± 1.073
1.181IleCys: 1.181 ± 0.537
2.361IleAsp: 2.361 ± 2.698
2.361IleGlu: 2.361 ± 1.073
1.181IlePhe: 1.181 ± 0.537
2.361IleGly: 2.361 ± 1.073
2.361IleHis: 2.361 ± 1.073
1.181IleIle: 1.181 ± 0.537
1.181IleLys: 1.181 ± 0.537
4.723IleLeu: 4.723 ± 2.146
1.181IleMet: 1.181 ± 0.537
2.361IleAsn: 2.361 ± 1.073
3.542IlePro: 3.542 ± 1.61
0.0IleGln: 0.0 ± 0.0
4.723IleArg: 4.723 ± 1.625
0.0IleSer: 0.0 ± 0.0
5.903IleThr: 5.903 ± 2.683
3.542IleVal: 3.542 ± 1.61
1.181IleTrp: 1.181 ± 0.537
2.361IleTyr: 2.361 ± 1.073
0.0IleXaa: 0.0 ± 0.0
Lys
3.542LysAla: 3.542 ± 1.61
0.0LysCys: 0.0 ± 0.0
1.181LysAsp: 1.181 ± 0.537
3.542LysGlu: 3.542 ± 1.61
4.723LysPhe: 4.723 ± 2.146
4.723LysGly: 4.723 ± 2.146
3.542LysHis: 3.542 ± 1.61
3.542LysIle: 3.542 ± 1.61
3.542LysLys: 3.542 ± 1.61
2.361LysLeu: 2.361 ± 1.073
0.0LysMet: 0.0 ± 0.0
3.542LysAsn: 3.542 ± 1.61
3.542LysPro: 3.542 ± 1.61
1.181LysGln: 1.181 ± 0.537
4.723LysArg: 4.723 ± 2.146
3.542LysSer: 3.542 ± 1.61
3.542LysThr: 3.542 ± 1.61
1.181LysVal: 1.181 ± 0.537
3.542LysTrp: 3.542 ± 1.61
2.361LysTyr: 2.361 ± 1.073
0.0LysXaa: 0.0 ± 0.0
Leu
8.264LeuAla: 8.264 ± 7.558
1.181LeuCys: 1.181 ± 0.537
3.542LeuAsp: 3.542 ± 1.61
5.903LeuGlu: 5.903 ± 4.86
3.542LeuPhe: 3.542 ± 1.61
5.903LeuGly: 5.903 ± 2.683
0.0LeuHis: 0.0 ± 0.0
4.723LeuIle: 4.723 ± 2.146
5.903LeuLys: 5.903 ± 2.683
3.542LeuLeu: 3.542 ± 2.162
1.181LeuMet: 1.181 ± 0.537
4.723LeuAsn: 4.723 ± 1.625
4.723LeuPro: 4.723 ± 5.397
5.903LeuGln: 5.903 ± 2.683
3.542LeuArg: 3.542 ± 1.61
3.542LeuSer: 3.542 ± 1.61
2.361LeuThr: 2.361 ± 1.073
3.542LeuVal: 3.542 ± 1.61
2.361LeuTrp: 2.361 ± 2.698
3.542LeuTyr: 3.542 ± 1.61
0.0LeuXaa: 0.0 ± 0.0
Met
1.181MetAla: 1.181 ± 0.537
2.361MetCys: 2.361 ± 2.698
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
4.723MetLeu: 4.723 ± 2.146
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.361MetPro: 2.361 ± 1.073
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.181MetSer: 1.181 ± 3.235
2.361MetThr: 2.361 ± 1.073
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.181MetTyr: 1.181 ± 0.537
0.0MetXaa: 0.0 ± 0.0
Asn
1.181AsnAla: 1.181 ± 0.537
2.361AsnCys: 2.361 ± 1.073
5.903AsnAsp: 5.903 ± 1.089
1.181AsnGlu: 1.181 ± 0.537
4.723AsnPhe: 4.723 ± 1.625
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
2.361AsnIle: 2.361 ± 1.073
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
1.181AsnAsn: 1.181 ± 0.537
4.723AsnPro: 4.723 ± 1.625
1.181AsnGln: 1.181 ± 0.537
0.0AsnArg: 0.0 ± 0.0
1.181AsnSer: 1.181 ± 0.537
8.264AsnThr: 8.264 ± 3.756
0.0AsnVal: 0.0 ± 0.0
1.181AsnTrp: 1.181 ± 3.235
1.181AsnTyr: 1.181 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
3.542ProAla: 3.542 ± 9.705
1.181ProCys: 1.181 ± 0.537
2.361ProAsp: 2.361 ± 1.073
5.903ProGlu: 5.903 ± 4.86
3.542ProPhe: 3.542 ± 1.61
5.903ProGly: 5.903 ± 1.089
1.181ProHis: 1.181 ± 0.537
4.723ProIle: 4.723 ± 1.625
3.542ProLys: 3.542 ± 1.61
5.903ProLeu: 5.903 ± 1.089
1.181ProMet: 1.181 ± 0.537
1.181ProAsn: 1.181 ± 3.235
9.445ProPro: 9.445 ± 14.565
4.723ProGln: 4.723 ± 1.625
4.723ProArg: 4.723 ± 5.397
2.361ProSer: 2.361 ± 1.073
2.361ProThr: 2.361 ± 1.073
0.0ProVal: 0.0 ± 0.0
2.361ProTrp: 2.361 ± 2.698
4.723ProTyr: 4.723 ± 2.146
0.0ProXaa: 0.0 ± 0.0
Gln
1.181GlnAla: 1.181 ± 0.537
0.0GlnCys: 0.0 ± 0.0
1.181GlnAsp: 1.181 ± 0.537
4.723GlnGlu: 4.723 ± 2.146
1.181GlnPhe: 1.181 ± 0.537
1.181GlnGly: 1.181 ± 0.537
0.0GlnHis: 0.0 ± 0.0
1.181GlnIle: 1.181 ± 0.537
1.181GlnLys: 1.181 ± 0.537
3.542GlnLeu: 3.542 ± 1.61
1.181GlnMet: 1.181 ± 0.497
0.0GlnAsn: 0.0 ± 0.0
4.723GlnPro: 4.723 ± 1.625
7.084GlnGln: 7.084 ± 3.219
4.723GlnArg: 4.723 ± 2.146
1.181GlnSer: 1.181 ± 0.537
3.542GlnThr: 3.542 ± 1.61
7.084GlnVal: 7.084 ± 3.219
1.181GlnTrp: 1.181 ± 0.537
1.181GlnTyr: 1.181 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
8.264ArgAla: 8.264 ± 0.016
2.361ArgCys: 2.361 ± 1.073
3.542ArgAsp: 3.542 ± 1.61
2.361ArgGlu: 2.361 ± 1.073
1.181ArgPhe: 1.181 ± 3.235
5.903ArgGly: 5.903 ± 4.86
3.542ArgHis: 3.542 ± 1.61
3.542ArgIle: 3.542 ± 1.61
5.903ArgLys: 5.903 ± 2.683
5.903ArgLeu: 5.903 ± 2.683
2.361ArgMet: 2.361 ± 1.073
2.361ArgAsn: 2.361 ± 2.698
8.264ArgPro: 8.264 ± 11.33
2.361ArgGln: 2.361 ± 1.073
35.419ArgArg: 35.419 ± 12.325
4.723ArgSer: 4.723 ± 5.397
0.0ArgThr: 0.0 ± 0.0
1.181ArgVal: 1.181 ± 0.537
4.723ArgTrp: 4.723 ± 2.146
1.181ArgTyr: 1.181 ± 0.537
0.0ArgXaa: 0.0 ± 0.0
Ser
1.181SerAla: 1.181 ± 0.537
1.181SerCys: 1.181 ± 3.235
3.542SerAsp: 3.542 ± 1.61
0.0SerGlu: 0.0 ± 0.0
0.0SerPhe: 0.0 ± 0.0
3.542SerGly: 3.542 ± 2.162
1.181SerHis: 1.181 ± 3.235
3.542SerIle: 3.542 ± 1.61
10.626SerLys: 10.626 ± 4.829
4.723SerLeu: 4.723 ± 1.625
0.0SerMet: 0.0 ± 0.0
3.542SerAsn: 3.542 ± 1.61
3.542SerPro: 3.542 ± 1.61
1.181SerGln: 1.181 ± 0.537
1.181SerArg: 1.181 ± 3.235
4.723SerSer: 4.723 ± 2.146
2.361SerThr: 2.361 ± 1.073
2.361SerVal: 2.361 ± 1.073
2.361SerTrp: 2.361 ± 1.073
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.903ThrAla: 5.903 ± 2.683
0.0ThrCys: 0.0 ± 0.0
5.903ThrAsp: 5.903 ± 2.683
1.181ThrGlu: 1.181 ± 0.537
3.542ThrPhe: 3.542 ± 1.61
5.903ThrGly: 5.903 ± 1.089
3.542ThrHis: 3.542 ± 1.61
2.361ThrIle: 2.361 ± 1.073
3.542ThrLys: 3.542 ± 1.61
4.723ThrLeu: 4.723 ± 2.146
0.0ThrMet: 0.0 ± 0.0
4.723ThrAsn: 4.723 ± 2.146
1.181ThrPro: 1.181 ± 0.537
5.903ThrGln: 5.903 ± 2.683
4.723ThrArg: 4.723 ± 2.146
2.361ThrSer: 2.361 ± 1.073
1.181ThrThr: 1.181 ± 0.537
1.181ThrVal: 1.181 ± 0.537
1.181ThrTrp: 1.181 ± 0.537
2.361ThrTyr: 2.361 ± 1.073
0.0ThrXaa: 0.0 ± 0.0
Val
1.181ValAla: 1.181 ± 0.537
1.181ValCys: 1.181 ± 0.537
3.542ValAsp: 3.542 ± 1.61
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
1.181ValGly: 1.181 ± 0.537
0.0ValHis: 0.0 ± 0.0
3.542ValIle: 3.542 ± 1.61
2.361ValLys: 2.361 ± 1.073
3.542ValLeu: 3.542 ± 1.61
0.0ValMet: 0.0 ± 0.0
1.181ValAsn: 1.181 ± 0.537
3.542ValPro: 3.542 ± 1.61
4.723ValGln: 4.723 ± 2.146
4.723ValArg: 4.723 ± 2.146
1.181ValSer: 1.181 ± 0.537
2.361ValThr: 2.361 ± 2.698
3.542ValVal: 3.542 ± 1.61
1.181ValTrp: 1.181 ± 0.537
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.181TrpAla: 1.181 ± 0.537
1.181TrpCys: 1.181 ± 0.537
1.181TrpAsp: 1.181 ± 0.537
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.181TrpGly: 1.181 ± 0.537
1.181TrpHis: 1.181 ± 0.537
1.181TrpIle: 1.181 ± 0.537
0.0TrpLys: 0.0 ± 0.0
2.361TrpLeu: 2.361 ± 1.073
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.181TrpGln: 1.181 ± 0.537
8.264TrpArg: 8.264 ± 3.787
0.0TrpSer: 0.0 ± 0.0
1.181TrpThr: 1.181 ± 0.537
1.181TrpVal: 1.181 ± 0.537
2.361TrpTrp: 2.361 ± 1.073
3.542TrpTyr: 3.542 ± 2.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.723TyrAla: 4.723 ± 2.146
0.0TyrCys: 0.0 ± 0.0
1.181TyrAsp: 1.181 ± 0.537
1.181TyrGlu: 1.181 ± 3.235
0.0TyrPhe: 0.0 ± 0.0
1.181TyrGly: 1.181 ± 0.537
3.542TyrHis: 3.542 ± 1.61
1.181TyrIle: 1.181 ± 0.537
4.723TyrLys: 4.723 ± 2.146
2.361TyrLeu: 2.361 ± 1.073
0.0TyrMet: 0.0 ± 0.0
4.723TyrAsn: 4.723 ± 2.146
2.361TyrPro: 2.361 ± 1.073
1.181TyrGln: 1.181 ± 0.537
0.0TyrArg: 0.0 ± 0.0
2.361TyrSer: 2.361 ± 1.073
3.542TyrThr: 3.542 ± 1.61
2.361TyrVal: 2.361 ± 1.073
1.181TyrTrp: 1.181 ± 0.537
1.181TyrTyr: 1.181 ± 0.537
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski