Amino acid dipepetide frequency for Torque teno mini virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.111AlaAla: 1.111 ± 2.507
0.0AlaCys: 0.0 ± 0.0
2.222AlaAsp: 2.222 ± 5.013
1.111AlaGlu: 1.111 ± 0.599
1.111AlaPhe: 1.111 ± 0.599
0.0AlaGly: 0.0 ± 0.0
1.111AlaHis: 1.111 ± 2.507
0.0AlaIle: 0.0 ± 0.0
1.111AlaLys: 1.111 ± 0.599
2.222AlaLeu: 2.222 ± 1.198
1.111AlaMet: 1.111 ± 0.599
1.111AlaAsn: 1.111 ± 0.599
3.333AlaPro: 3.333 ± 1.689
3.333AlaGln: 3.333 ± 1.229
2.222AlaArg: 2.222 ± 1.427
1.111AlaSer: 1.111 ± 1.811
0.0AlaThr: 0.0 ± 0.0
0.0AlaVal: 0.0 ± 0.0
1.111AlaTrp: 1.111 ± 0.599
2.222AlaTyr: 2.222 ± 1.198
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.111CysAsp: 1.111 ± 2.507
1.111CysGlu: 1.111 ± 0.599
1.111CysPhe: 1.111 ± 0.599
2.222CysGly: 2.222 ± 2.052
1.111CysHis: 1.111 ± 0.599
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.111CysLeu: 1.111 ± 2.507
0.0CysMet: 0.0 ± 0.0
3.333CysAsn: 3.333 ± 1.796
1.111CysPro: 1.111 ± 0.599
0.0CysGln: 0.0 ± 0.0
2.222CysArg: 2.222 ± 1.198
2.222CysSer: 2.222 ± 1.427
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.111CysTrp: 1.111 ± 0.599
1.111CysTyr: 1.111 ± 0.599
0.0CysXaa: 0.0 ± 0.0
Asp
3.333AspAla: 3.333 ± 5.667
0.0AspCys: 0.0 ± 0.0
2.222AspAsp: 2.222 ± 3.379
4.444AspGlu: 4.444 ± 4.103
2.222AspPhe: 2.222 ± 2.052
4.444AspGly: 4.444 ± 4.103
0.0AspHis: 0.0 ± 0.0
1.111AspIle: 1.111 ± 0.599
4.444AspLys: 4.444 ± 1.304
3.333AspLeu: 3.333 ± 4.542
2.222AspMet: 2.222 ± 1.128
3.333AspAsn: 3.333 ± 4.542
2.222AspPro: 2.222 ± 1.198
5.556AspGln: 5.556 ± 1.593
2.222AspArg: 2.222 ± 2.052
2.222AspSer: 2.222 ± 1.198
4.444AspThr: 4.444 ± 2.186
1.111AspVal: 1.111 ± 1.811
1.111AspTrp: 1.111 ± 0.599
2.222AspTyr: 2.222 ± 1.198
0.0AspXaa: 0.0 ± 0.0
Glu
2.222GluAla: 2.222 ± 1.198
1.111GluCys: 1.111 ± 0.599
7.778GluAsp: 7.778 ± 6.474
6.667GluGlu: 6.667 ± 1.755
0.0GluPhe: 0.0 ± 0.0
2.222GluGly: 2.222 ± 2.052
0.0GluHis: 0.0 ± 0.0
3.333GluIle: 3.333 ± 1.689
3.333GluLys: 3.333 ± 1.689
0.0GluLeu: 0.0 ± 0.0
1.111GluMet: 1.111 ± 0.543
1.111GluAsn: 1.111 ± 2.507
2.222GluPro: 2.222 ± 2.052
0.0GluGln: 0.0 ± 0.0
2.222GluArg: 2.222 ± 1.198
3.333GluSer: 3.333 ± 3.206
3.333GluThr: 3.333 ± 1.796
1.111GluVal: 1.111 ± 0.599
2.222GluTrp: 2.222 ± 1.198
3.333GluTyr: 3.333 ± 1.229
0.0GluXaa: 0.0 ± 0.0
Phe
2.222PheAla: 2.222 ± 1.198
1.111PheCys: 1.111 ± 2.507
2.222PheAsp: 2.222 ± 1.198
1.111PheGlu: 1.111 ± 2.507
4.444PhePhe: 4.444 ± 1.304
1.111PheGly: 1.111 ± 2.507
2.222PheHis: 2.222 ± 2.052
2.222PheIle: 2.222 ± 1.198
3.333PheLys: 3.333 ± 1.796
3.333PheLeu: 3.333 ± 3.206
2.222PheMet: 2.222 ± 1.427
1.111PheAsn: 1.111 ± 0.599
3.333PhePro: 3.333 ± 1.229
3.333PheGln: 3.333 ± 1.796
0.0PheArg: 0.0 ± 0.0
2.222PheSer: 2.222 ± 1.198
2.222PheThr: 2.222 ± 2.052
1.111PheVal: 1.111 ± 0.599
3.333PheTrp: 3.333 ± 1.689
2.222PheTyr: 2.222 ± 1.198
0.0PheXaa: 0.0 ± 0.0
Gly
1.111GlyAla: 1.111 ± 0.599
3.333GlyCys: 3.333 ± 1.689
3.333GlyAsp: 3.333 ± 7.52
2.222GlyGlu: 2.222 ± 3.379
1.111GlyPhe: 1.111 ± 0.599
5.556GlyGly: 5.556 ± 2.994
1.111GlyHis: 1.111 ± 2.507
1.111GlyIle: 1.111 ± 0.599
1.111GlyLys: 1.111 ± 0.599
3.333GlyLeu: 3.333 ± 1.796
0.0GlyMet: 0.0 ± 0.0
3.333GlyAsn: 3.333 ± 1.796
2.222GlyPro: 2.222 ± 1.198
0.0GlyGln: 0.0 ± 0.0
2.222GlyArg: 2.222 ± 1.198
1.111GlySer: 1.111 ± 0.599
6.667GlyThr: 6.667 ± 1.755
1.111GlyVal: 1.111 ± 0.599
1.111GlyTrp: 1.111 ± 0.599
3.333GlyTyr: 3.333 ± 1.796
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.111HisAsp: 1.111 ± 2.507
1.111HisGlu: 1.111 ± 0.599
2.222HisPhe: 2.222 ± 2.052
0.0HisGly: 0.0 ± 0.0
1.111HisHis: 1.111 ± 1.811
1.111HisIle: 1.111 ± 0.599
3.333HisLys: 3.333 ± 1.796
5.556HisLeu: 5.556 ± 1.593
1.111HisMet: 1.111 ± 0.599
1.111HisAsn: 1.111 ± 2.507
0.0HisPro: 0.0 ± 0.0
2.222HisGln: 2.222 ± 1.427
5.556HisArg: 5.556 ± 3.642
0.0HisSer: 0.0 ± 0.0
3.333HisThr: 3.333 ± 2.782
0.0HisVal: 0.0 ± 0.0
2.222HisTrp: 2.222 ± 1.198
3.333HisTyr: 3.333 ± 1.229
0.0HisXaa: 0.0 ± 0.0
Ile
1.111IleAla: 1.111 ± 0.599
4.444IleCys: 4.444 ± 2.395
2.222IleAsp: 2.222 ± 2.052
2.222IleGlu: 2.222 ± 2.052
1.111IlePhe: 1.111 ± 0.599
1.111IleGly: 1.111 ± 0.599
2.222IleHis: 2.222 ± 2.052
2.222IleIle: 2.222 ± 2.052
7.778IleLys: 7.778 ± 2.143
4.444IleLeu: 4.444 ± 2.395
1.111IleMet: 1.111 ± 0.599
4.444IleAsn: 4.444 ± 1.487
4.444IlePro: 4.444 ± 2.395
5.556IleGln: 5.556 ± 1.615
2.222IleArg: 2.222 ± 1.198
4.444IleSer: 4.444 ± 2.395
1.111IleThr: 1.111 ± 0.599
2.222IleVal: 2.222 ± 1.198
1.111IleTrp: 1.111 ± 2.507
2.222IleTyr: 2.222 ± 1.198
0.0IleXaa: 0.0 ± 0.0
Lys
2.222LysAla: 2.222 ± 1.198
2.222LysCys: 2.222 ± 2.052
6.667LysAsp: 6.667 ± 2.458
4.444LysGlu: 4.444 ± 5.1
2.222LysPhe: 2.222 ± 1.427
3.333LysGly: 3.333 ± 1.796
4.444LysHis: 4.444 ± 6.431
4.444LysIle: 4.444 ± 2.395
17.778LysLys: 17.778 ± 7.605
14.444LysLeu: 14.444 ± 1.813
0.0LysMet: 0.0 ± 0.0
2.222LysAsn: 2.222 ± 1.198
3.333LysPro: 3.333 ± 1.689
5.556LysGln: 5.556 ± 1.615
3.333LysArg: 3.333 ± 1.796
4.444LysSer: 4.444 ± 5.008
7.778LysThr: 7.778 ± 2.463
0.0LysVal: 0.0 ± 0.0
1.111LysTrp: 1.111 ± 0.599
1.111LysTyr: 1.111 ± 0.599
0.0LysXaa: 0.0 ± 0.0
Leu
2.222LeuAla: 2.222 ± 2.052
2.222LeuCys: 2.222 ± 1.427
3.333LeuAsp: 3.333 ± 1.689
0.0LeuGlu: 0.0 ± 0.0
3.333LeuPhe: 3.333 ± 2.782
2.222LeuGly: 2.222 ± 1.198
2.222LeuHis: 2.222 ± 1.198
4.444LeuIle: 4.444 ± 1.487
11.111LeuLys: 11.111 ± 5.191
7.778LeuLeu: 7.778 ± 2.564
0.0LeuMet: 0.0 ± 0.0
8.889LeuAsn: 8.889 ± 0.333
5.556LeuPro: 5.556 ± 1.513
7.778LeuGln: 7.778 ± 3.461
0.0LeuArg: 0.0 ± 0.0
5.556LeuSer: 5.556 ± 1.615
4.444LeuThr: 4.444 ± 2.395
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
4.444LeuTyr: 4.444 ± 2.395
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.111MetAsp: 1.111 ± 1.811
1.111MetGlu: 1.111 ± 0.599
1.111MetPhe: 1.111 ± 0.599
0.0MetGly: 0.0 ± 0.0
1.111MetHis: 1.111 ± 0.599
1.111MetIle: 1.111 ± 0.599
0.0MetLys: 0.0 ± 0.0
2.222MetLeu: 2.222 ± 1.427
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.111MetPro: 1.111 ± 0.599
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.222MetSer: 2.222 ± 2.052
2.222MetThr: 2.222 ± 1.427
1.111MetVal: 1.111 ± 0.599
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.222AsnAla: 2.222 ± 2.052
0.0AsnCys: 0.0 ± 0.0
1.111AsnAsp: 1.111 ± 0.599
2.222AsnGlu: 2.222 ± 2.052
3.333AsnPhe: 3.333 ± 1.689
2.222AsnGly: 2.222 ± 1.198
1.111AsnHis: 1.111 ± 0.599
3.333AsnIle: 3.333 ± 4.542
5.556AsnLys: 5.556 ± 1.615
3.333AsnLeu: 3.333 ± 1.229
0.0AsnMet: 0.0 ± 0.0
8.889AsnAsn: 8.889 ± 2.612
7.778AsnPro: 7.778 ± 4.191
4.444AsnGln: 4.444 ± 1.304
0.0AsnArg: 0.0 ± 0.0
4.444AsnSer: 4.444 ± 1.304
7.778AsnThr: 7.778 ± 2.564
3.333AsnVal: 3.333 ± 1.796
3.333AsnTrp: 3.333 ± 1.689
5.556AsnTyr: 5.556 ± 2.994
0.0AsnXaa: 0.0 ± 0.0
Pro
3.333ProAla: 3.333 ± 1.796
2.222ProCys: 2.222 ± 1.198
1.111ProAsp: 1.111 ± 0.599
2.222ProGlu: 2.222 ± 1.198
3.333ProPhe: 3.333 ± 1.796
1.111ProGly: 1.111 ± 0.599
2.222ProHis: 2.222 ± 1.198
6.667ProIle: 6.667 ± 1.755
3.333ProLys: 3.333 ± 1.796
3.333ProLeu: 3.333 ± 1.229
2.222ProMet: 2.222 ± 1.198
2.222ProAsn: 2.222 ± 1.198
3.333ProPro: 3.333 ± 1.796
1.111ProGln: 1.111 ± 2.507
1.111ProArg: 1.111 ± 0.599
3.333ProSer: 3.333 ± 1.229
8.889ProThr: 8.889 ± 2.974
2.222ProVal: 2.222 ± 1.427
1.111ProTrp: 1.111 ± 0.599
4.444ProTyr: 4.444 ± 1.487
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.222GlnAsp: 2.222 ± 1.198
3.333GlnGlu: 3.333 ± 1.796
1.111GlnPhe: 1.111 ± 0.599
4.444GlnGly: 4.444 ± 2.395
3.333GlnHis: 3.333 ± 2.782
6.667GlnIle: 6.667 ± 3.593
7.778GlnLys: 7.778 ± 7.824
4.444GlnLeu: 4.444 ± 1.487
1.111GlnMet: 1.111 ± 1.811
6.667GlnAsn: 6.667 ± 2.458
3.333GlnPro: 3.333 ± 1.796
3.333GlnGln: 3.333 ± 1.796
3.333GlnArg: 3.333 ± 1.796
1.111GlnSer: 1.111 ± 0.599
3.333GlnThr: 3.333 ± 3.206
2.222GlnVal: 2.222 ± 1.198
1.111GlnTrp: 1.111 ± 1.811
3.333GlnTyr: 3.333 ± 1.229
0.0GlnXaa: 0.0 ± 0.0
Arg
1.111ArgAla: 1.111 ± 1.811
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
1.111ArgGlu: 1.111 ± 1.811
3.333ArgPhe: 3.333 ± 1.796
0.0ArgGly: 0.0 ± 0.0
3.333ArgHis: 3.333 ± 1.229
4.444ArgIle: 4.444 ± 1.304
5.556ArgLys: 5.556 ± 1.615
5.556ArgLeu: 5.556 ± 1.513
0.0ArgMet: 0.0 ± 1.836
0.0ArgAsn: 0.0 ± 0.0
4.444ArgPro: 4.444 ± 2.395
5.556ArgGln: 5.556 ± 1.513
18.889ArgArg: 18.889 ± 8.333
1.111ArgSer: 1.111 ± 1.811
0.0ArgThr: 0.0 ± 0.0
0.0ArgVal: 0.0 ± 0.0
2.222ArgTrp: 2.222 ± 1.198
3.333ArgTyr: 3.333 ± 1.796
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
2.222SerAsp: 2.222 ± 3.623
4.444SerGlu: 4.444 ± 1.304
3.333SerPhe: 3.333 ± 1.689
4.444SerGly: 4.444 ± 1.304
4.444SerHis: 4.444 ± 1.304
4.444SerIle: 4.444 ± 2.395
3.333SerLys: 3.333 ± 1.229
2.222SerLeu: 2.222 ± 1.427
0.0SerMet: 0.0 ± 0.0
7.778SerAsn: 7.778 ± 2.564
1.111SerPro: 1.111 ± 1.811
3.333SerGln: 3.333 ± 3.206
2.222SerArg: 2.222 ± 3.623
13.333SerSer: 13.333 ± 15.024
4.444SerThr: 4.444 ± 2.855
2.222SerVal: 2.222 ± 1.198
0.0SerTrp: 0.0 ± 0.0
1.111SerTyr: 1.111 ± 0.599
0.0SerXaa: 0.0 ± 0.0
Thr
2.222ThrAla: 2.222 ± 1.427
0.0ThrCys: 0.0 ± 0.0
8.889ThrAsp: 8.889 ± 2.974
4.444ThrGlu: 4.444 ± 2.395
4.444ThrPhe: 4.444 ± 4.223
2.222ThrGly: 2.222 ± 2.052
1.111ThrHis: 1.111 ± 0.599
3.333ThrIle: 3.333 ± 1.796
3.333ThrLys: 3.333 ± 1.229
2.222ThrLeu: 2.222 ± 1.427
0.0ThrMet: 0.0 ± 0.0
3.333ThrAsn: 3.333 ± 1.229
5.556ThrPro: 5.556 ± 1.593
3.333ThrGln: 3.333 ± 1.229
3.333ThrArg: 3.333 ± 2.782
6.667ThrSer: 6.667 ± 2.458
10.0ThrThr: 10.0 ± 7.8
3.333ThrVal: 3.333 ± 1.796
2.222ThrTrp: 2.222 ± 1.198
4.444ThrTyr: 4.444 ± 1.304
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
3.333ValPhe: 3.333 ± 1.796
1.111ValGly: 1.111 ± 0.599
2.222ValHis: 2.222 ± 1.198
2.222ValIle: 2.222 ± 1.198
4.444ValLys: 4.444 ± 2.855
0.0ValLeu: 0.0 ± 0.0
0.0ValMet: 0.0 ± 0.0
2.222ValAsn: 2.222 ± 1.198
1.111ValPro: 1.111 ± 0.599
1.111ValGln: 1.111 ± 0.599
2.222ValArg: 2.222 ± 1.198
0.0ValSer: 0.0 ± 0.0
1.111ValThr: 1.111 ± 1.811
1.111ValVal: 1.111 ± 0.599
1.111ValTrp: 1.111 ± 0.599
1.111ValTyr: 1.111 ± 0.599
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.111TrpAsp: 1.111 ± 0.599
1.111TrpGlu: 1.111 ± 0.599
0.0TrpPhe: 0.0 ± 0.0
3.333TrpGly: 3.333 ± 1.796
0.0TrpHis: 0.0 ± 0.0
1.111TrpIle: 1.111 ± 2.507
3.333TrpLys: 3.333 ± 5.667
3.333TrpLeu: 3.333 ± 1.796
1.111TrpMet: 1.111 ± 0.599
1.111TrpAsn: 1.111 ± 0.599
0.0TrpPro: 0.0 ± 0.0
2.222TrpGln: 2.222 ± 1.198
3.333TrpArg: 3.333 ± 1.796
2.222TrpSer: 2.222 ± 1.198
1.111TrpThr: 1.111 ± 0.599
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.222TrpTyr: 2.222 ± 1.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.111TyrAla: 1.111 ± 0.599
2.222TyrCys: 2.222 ± 1.198
2.222TyrAsp: 2.222 ± 1.198
2.222TyrGlu: 2.222 ± 1.198
2.222TyrPhe: 2.222 ± 1.198
3.333TyrGly: 3.333 ± 1.689
0.0TyrHis: 0.0 ± 0.0
4.444TyrIle: 4.444 ± 2.395
1.111TyrLys: 1.111 ± 0.599
3.333TyrLeu: 3.333 ± 1.796
0.0TyrMet: 0.0 ± 0.0
7.778TyrAsn: 7.778 ± 4.191
3.333TyrPro: 3.333 ± 1.796
4.444TyrGln: 4.444 ± 2.395
4.444TyrArg: 4.444 ± 2.395
3.333TyrSer: 3.333 ± 3.206
2.222TyrThr: 2.222 ± 1.427
2.222TyrVal: 2.222 ± 1.427
1.111TyrTrp: 1.111 ± 0.599
2.222TyrTyr: 2.222 ± 1.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski