Amino acid dipepetide frequency for Torque teno mini virus 1 (isolate TLMV-CBD279)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.669AlaAla: 5.669 ± 1.405
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
2.268AlaGlu: 2.268 ± 1.229
1.134AlaPhe: 1.134 ± 0.615
1.134AlaGly: 1.134 ± 2.411
2.268AlaHis: 2.268 ± 1.229
1.134AlaIle: 1.134 ± 2.411
1.134AlaLys: 1.134 ± 0.615
2.268AlaLeu: 2.268 ± 2.114
0.0AlaMet: 0.0 ± 0.0
2.268AlaAsn: 2.268 ± 3.606
4.535AlaPro: 4.535 ± 2.458
1.134AlaGln: 1.134 ± 2.556
1.134AlaArg: 1.134 ± 0.615
1.134AlaSer: 1.134 ± 0.615
0.0AlaThr: 0.0 ± 0.0
1.134AlaVal: 1.134 ± 2.411
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.268CysGly: 2.268 ± 2.114
1.134CysHis: 1.134 ± 0.615
0.0CysIle: 0.0 ± 0.0
1.134CysLys: 1.134 ± 0.615
2.268CysLeu: 2.268 ± 2.114
0.0CysMet: 0.0 ± 0.0
1.134CysAsn: 1.134 ± 2.411
0.0CysPro: 0.0 ± 0.0
1.134CysGln: 1.134 ± 2.556
1.134CysArg: 1.134 ± 0.615
1.134CysSer: 1.134 ± 0.615
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.268CysTyr: 2.268 ± 1.229
0.0CysXaa: 0.0 ± 0.0
Asp
2.268AspAla: 2.268 ± 4.821
0.0AspCys: 0.0 ± 0.0
2.268AspAsp: 2.268 ± 2.114
3.401AspGlu: 3.401 ± 1.844
5.669AspPhe: 5.669 ± 1.812
2.268AspGly: 2.268 ± 2.114
1.134AspHis: 1.134 ± 0.615
4.535AspIle: 4.535 ± 2.012
3.401AspLys: 3.401 ± 1.844
6.803AspLeu: 6.803 ± 6.833
0.0AspMet: 0.0 ± 0.0
1.134AspAsn: 1.134 ± 0.615
1.134AspPro: 1.134 ± 0.615
6.803AspGln: 6.803 ± 3.687
0.0AspArg: 0.0 ± 0.0
2.268AspSer: 2.268 ± 2.064
0.0AspThr: 0.0 ± 0.0
1.134AspVal: 1.134 ± 0.615
0.0AspTrp: 0.0 ± 0.0
2.268AspTyr: 2.268 ± 2.114
0.0AspXaa: 0.0 ± 0.0
Glu
2.268GluAla: 2.268 ± 2.064
1.134GluCys: 1.134 ± 0.615
4.535GluAsp: 4.535 ± 4.227
9.07GluGlu: 9.07 ± 5.966
0.0GluPhe: 0.0 ± 0.0
3.401GluGly: 3.401 ± 1.97
0.0GluHis: 0.0 ± 0.0
1.134GluIle: 1.134 ± 0.615
3.401GluLys: 3.401 ± 1.97
5.669GluLeu: 5.669 ± 3.073
0.0GluMet: 0.0 ± 0.0
2.268GluAsn: 2.268 ± 2.114
2.268GluPro: 2.268 ± 1.229
4.535GluGln: 4.535 ± 2.402
1.134GluArg: 1.134 ± 2.556
5.669GluSer: 5.669 ± 3.692
9.07GluThr: 9.07 ± 4.917
1.134GluVal: 1.134 ± 0.615
1.134GluTrp: 1.134 ± 0.615
2.268GluTyr: 2.268 ± 1.229
0.0GluXaa: 0.0 ± 0.0
Phe
1.134PheAla: 1.134 ± 2.411
0.0PheCys: 0.0 ± 0.0
2.268PheAsp: 2.268 ± 2.114
2.268PheGlu: 2.268 ± 2.114
2.268PhePhe: 2.268 ± 1.229
1.134PheGly: 1.134 ± 0.615
0.0PheHis: 0.0 ± 0.0
2.268PheIle: 2.268 ± 1.229
4.535PheLys: 4.535 ± 2.458
4.535PheLeu: 4.535 ± 2.402
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.268PhePro: 2.268 ± 1.229
2.268PheGln: 2.268 ± 1.229
5.669PheArg: 5.669 ± 1.405
2.268PheSer: 2.268 ± 1.229
4.535PheThr: 4.535 ± 1.407
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
3.401PheTyr: 3.401 ± 1.656
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
1.134GlyCys: 1.134 ± 2.411
1.134GlyAsp: 1.134 ± 2.411
3.401GlyGlu: 3.401 ± 5.576
0.0GlyPhe: 0.0 ± 0.0
3.401GlyGly: 3.401 ± 1.844
1.134GlyHis: 1.134 ± 0.615
1.134GlyIle: 1.134 ± 0.615
2.268GlyLys: 2.268 ± 1.229
3.401GlyLeu: 3.401 ± 1.97
0.0GlyMet: 0.0 ± 1.996
6.803GlyAsn: 6.803 ± 1.65
3.401GlyPro: 3.401 ± 1.844
1.134GlyGln: 1.134 ± 0.615
3.401GlyArg: 3.401 ± 1.656
3.401GlySer: 3.401 ± 1.844
5.669GlyThr: 5.669 ± 2.229
0.0GlyVal: 0.0 ± 0.0
1.134GlyTrp: 1.134 ± 0.615
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.134HisAsp: 1.134 ± 2.411
1.134HisGlu: 1.134 ± 0.615
1.134HisPhe: 1.134 ± 2.556
1.134HisGly: 1.134 ± 2.411
1.134HisHis: 1.134 ± 2.556
1.134HisIle: 1.134 ± 0.615
3.401HisLys: 3.401 ± 1.656
5.669HisLeu: 5.669 ± 1.812
0.0HisMet: 0.0 ± 0.0
2.268HisAsn: 2.268 ± 1.229
3.401HisPro: 3.401 ± 1.844
1.134HisGln: 1.134 ± 2.556
0.0HisArg: 0.0 ± 0.0
1.134HisSer: 1.134 ± 2.556
0.0HisThr: 0.0 ± 0.0
1.134HisVal: 1.134 ± 0.615
1.134HisTrp: 1.134 ± 0.615
1.134HisTyr: 1.134 ± 0.615
0.0HisXaa: 0.0 ± 0.0
Ile
1.134IleAla: 1.134 ± 0.615
0.0IleCys: 0.0 ± 0.0
2.268IleAsp: 2.268 ± 1.229
5.669IleGlu: 5.669 ± 3.073
1.134IlePhe: 1.134 ± 2.556
0.0IleGly: 0.0 ± 0.0
1.134IleHis: 1.134 ± 0.615
3.401IleIle: 3.401 ± 1.844
1.134IleLys: 1.134 ± 2.411
6.803IleLeu: 6.803 ± 2.578
0.0IleMet: 0.0 ± 0.0
4.535IleAsn: 4.535 ± 2.012
1.134IlePro: 1.134 ± 0.615
2.268IleGln: 2.268 ± 1.229
5.669IleArg: 5.669 ± 3.073
3.401IleSer: 3.401 ± 1.844
5.669IleThr: 5.669 ± 3.073
3.401IleVal: 3.401 ± 1.844
1.134IleTrp: 1.134 ± 2.411
2.268IleTyr: 2.268 ± 1.229
0.0IleXaa: 0.0 ± 0.0
Lys
3.401LysAla: 3.401 ± 1.844
1.134LysCys: 1.134 ± 0.615
6.803LysAsp: 6.803 ± 1.247
2.268LysGlu: 2.268 ± 1.229
2.268LysPhe: 2.268 ± 1.229
3.401LysGly: 3.401 ± 1.97
3.401LysHis: 3.401 ± 5.767
3.401LysIle: 3.401 ± 1.844
12.472LysLys: 12.472 ± 8.5
6.803LysLeu: 6.803 ± 3.687
0.0LysMet: 0.0 ± 0.0
2.268LysAsn: 2.268 ± 3.606
3.401LysPro: 3.401 ± 1.844
6.803LysGln: 6.803 ± 1.65
7.937LysArg: 7.937 ± 3.012
5.669LysSer: 5.669 ± 3.692
6.803LysThr: 6.803 ± 6.193
1.134LysVal: 1.134 ± 0.615
3.401LysTrp: 3.401 ± 1.844
6.803LysTyr: 6.803 ± 1.65
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
2.268LeuCys: 2.268 ± 2.114
3.401LeuAsp: 3.401 ± 1.97
6.803LeuGlu: 6.803 ± 4.026
9.07LeuPhe: 9.07 ± 4.023
3.401LeuGly: 3.401 ± 1.844
1.134LeuHis: 1.134 ± 0.615
4.535LeuIle: 4.535 ± 2.458
10.204LeuLys: 10.204 ± 0.95
4.535LeuLeu: 4.535 ± 2.458
3.401LeuMet: 3.401 ± 2.054
3.401LeuAsn: 3.401 ± 1.844
7.937LeuPro: 7.937 ± 3.013
10.204LeuGln: 10.204 ± 10.788
3.401LeuArg: 3.401 ± 1.844
2.268LeuSer: 2.268 ± 1.229
9.07LeuThr: 9.07 ± 0.604
1.134LeuVal: 1.134 ± 0.615
1.134LeuTrp: 1.134 ± 0.615
4.535LeuTyr: 4.535 ± 2.458
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.134MetCys: 1.134 ± 2.411
0.0MetAsp: 0.0 ± 0.0
1.134MetGlu: 1.134 ± 0.615
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.134MetAsn: 1.134 ± 0.615
2.268MetPro: 2.268 ± 1.229
1.134MetGln: 1.134 ± 0.615
0.0MetArg: 0.0 ± 0.0
1.134MetSer: 1.134 ± 2.411
3.401MetThr: 3.401 ± 1.656
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.134MetTyr: 1.134 ± 0.615
0.0MetXaa: 0.0 ± 0.0
Asn
1.134AsnAla: 1.134 ± 2.411
0.0AsnCys: 0.0 ± 0.0
1.134AsnAsp: 1.134 ± 2.411
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
2.268AsnGly: 2.268 ± 2.114
2.268AsnHis: 2.268 ± 1.229
2.268AsnIle: 2.268 ± 1.229
4.535AsnLys: 4.535 ± 1.407
2.268AsnLeu: 2.268 ± 1.229
1.134AsnMet: 1.134 ± 0.615
3.401AsnAsn: 3.401 ± 1.97
2.268AsnPro: 2.268 ± 1.229
4.535AsnGln: 4.535 ± 2.402
0.0AsnArg: 0.0 ± 0.0
3.401AsnSer: 3.401 ± 4.606
7.937AsnThr: 7.937 ± 3.572
2.268AsnVal: 2.268 ± 1.229
3.401AsnTrp: 3.401 ± 1.844
3.401AsnTyr: 3.401 ± 1.844
0.0AsnXaa: 0.0 ± 0.0
Pro
6.803ProAla: 6.803 ± 2.578
1.134ProCys: 1.134 ± 0.615
0.0ProAsp: 0.0 ± 0.0
2.268ProGlu: 2.268 ± 1.229
3.401ProPhe: 3.401 ± 1.844
2.268ProGly: 2.268 ± 1.229
0.0ProHis: 0.0 ± 0.0
3.401ProIle: 3.401 ± 1.844
6.803ProLys: 6.803 ± 3.687
6.803ProLeu: 6.803 ± 3.687
1.134ProMet: 1.134 ± 0.615
1.134ProAsn: 1.134 ± 0.615
3.401ProPro: 3.401 ± 1.844
2.268ProGln: 2.268 ± 1.229
1.134ProArg: 1.134 ± 2.556
5.669ProSer: 5.669 ± 1.812
5.669ProThr: 5.669 ± 3.073
2.268ProVal: 2.268 ± 2.064
1.134ProTrp: 1.134 ± 0.615
2.268ProTyr: 2.268 ± 1.229
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.134GlnCys: 1.134 ± 2.556
5.669GlnAsp: 5.669 ± 3.073
3.401GlnGlu: 3.401 ± 1.844
0.0GlnPhe: 0.0 ± 0.0
2.268GlnGly: 2.268 ± 2.064
3.401GlnHis: 3.401 ± 4.606
5.669GlnIle: 5.669 ± 3.073
2.268GlnLys: 2.268 ± 5.113
9.07GlnLeu: 9.07 ± 5.826
2.268GlnMet: 2.268 ± 1.229
1.134GlnAsn: 1.134 ± 2.556
3.401GlnPro: 3.401 ± 1.844
2.268GlnGln: 2.268 ± 2.064
2.268GlnArg: 2.268 ± 1.229
3.401GlnSer: 3.401 ± 1.656
5.669GlnThr: 5.669 ± 4.553
1.134GlnVal: 1.134 ± 0.615
3.401GlnTrp: 3.401 ± 3.001
2.268GlnTyr: 2.268 ± 1.229
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.134ArgCys: 1.134 ± 0.615
2.268ArgAsp: 2.268 ± 5.113
2.268ArgGlu: 2.268 ± 2.064
4.535ArgPhe: 4.535 ± 2.458
2.268ArgGly: 2.268 ± 1.229
2.268ArgHis: 2.268 ± 1.229
1.134ArgIle: 1.134 ± 0.615
9.07ArgLys: 9.07 ± 2.815
3.401ArgLeu: 3.401 ± 1.844
0.0ArgMet: 0.0 ± 0.0
1.134ArgAsn: 1.134 ± 0.615
4.535ArgPro: 4.535 ± 1.407
1.134ArgGln: 1.134 ± 0.615
15.873ArgArg: 15.873 ± 5.992
3.401ArgSer: 3.401 ± 1.656
4.535ArgThr: 4.535 ± 4.128
2.268ArgVal: 2.268 ± 2.064
2.268ArgTrp: 2.268 ± 1.229
1.134ArgTyr: 1.134 ± 0.615
0.0ArgXaa: 0.0 ± 0.0
Ser
1.134SerAla: 1.134 ± 2.556
1.134SerCys: 1.134 ± 0.615
1.134SerAsp: 1.134 ± 0.615
9.07SerGlu: 9.07 ± 2.815
2.268SerPhe: 2.268 ± 2.114
3.401SerGly: 3.401 ± 1.656
0.0SerHis: 0.0 ± 0.0
1.134SerIle: 1.134 ± 0.615
5.669SerLys: 5.669 ± 4.553
6.803SerLeu: 6.803 ± 3.687
2.268SerMet: 2.268 ± 1.659
4.535SerAsn: 4.535 ± 1.407
5.669SerPro: 5.669 ± 3.692
2.268SerGln: 2.268 ± 2.064
5.669SerArg: 5.669 ± 6.664
11.338SerSer: 11.338 ± 20.167
6.803SerThr: 6.803 ± 1.247
0.0SerVal: 0.0 ± 0.0
1.134SerTrp: 1.134 ± 0.615
2.268SerTyr: 2.268 ± 1.229
0.0SerXaa: 0.0 ± 0.0
Thr
1.134ThrAla: 1.134 ± 0.615
1.134ThrCys: 1.134 ± 0.615
9.07ThrAsp: 9.07 ± 4.917
4.535ThrGlu: 4.535 ± 2.012
3.401ThrPhe: 3.401 ± 1.656
4.535ThrGly: 4.535 ± 1.407
3.401ThrHis: 3.401 ± 5.576
5.669ThrIle: 5.669 ± 1.812
12.472ThrLys: 12.472 ± 4.4
5.669ThrLeu: 5.669 ± 3.692
1.134ThrMet: 1.134 ± 0.615
4.535ThrAsn: 4.535 ± 2.458
4.535ThrPro: 4.535 ± 2.012
5.669ThrGln: 5.669 ± 1.812
2.268ThrArg: 2.268 ± 1.229
10.204ThrSer: 10.204 ± 4.969
7.937ThrThr: 7.937 ± 3.348
1.134ThrVal: 1.134 ± 0.615
1.134ThrTrp: 1.134 ± 0.615
2.268ThrTyr: 2.268 ± 1.229
0.0ThrXaa: 0.0 ± 0.0
Val
1.134ValAla: 1.134 ± 0.615
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
1.134ValGlu: 1.134 ± 0.615
2.268ValPhe: 2.268 ± 1.229
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
2.268ValIle: 2.268 ± 1.229
2.268ValLys: 2.268 ± 1.229
2.268ValLeu: 2.268 ± 1.229
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
3.401ValPro: 3.401 ± 1.844
1.134ValGln: 1.134 ± 2.556
2.268ValArg: 2.268 ± 1.229
1.134ValSer: 1.134 ± 0.615
3.401ValThr: 3.401 ± 1.656
1.134ValVal: 1.134 ± 0.615
0.0ValTrp: 0.0 ± 0.0
2.268ValTyr: 2.268 ± 2.114
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.401TrpGly: 3.401 ± 1.844
2.268TrpHis: 2.268 ± 1.229
2.268TrpIle: 2.268 ± 2.114
2.268TrpLys: 2.268 ± 2.064
2.268TrpLeu: 2.268 ± 1.229
0.0TrpMet: 0.0 ± 0.0
1.134TrpAsn: 1.134 ± 2.411
0.0TrpPro: 0.0 ± 0.0
1.134TrpGln: 1.134 ± 0.615
1.134TrpArg: 1.134 ± 0.615
1.134TrpSer: 1.134 ± 0.615
2.268TrpThr: 2.268 ± 1.229
1.134TrpVal: 1.134 ± 0.615
1.134TrpTrp: 1.134 ± 0.615
2.268TrpTyr: 2.268 ± 1.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.134TyrAla: 1.134 ± 0.615
1.134TyrCys: 1.134 ± 0.615
3.401TyrAsp: 3.401 ± 1.97
0.0TyrGlu: 0.0 ± 0.0
2.268TyrPhe: 2.268 ± 1.229
1.134TyrGly: 1.134 ± 0.615
1.134TyrHis: 1.134 ± 0.615
5.669TyrIle: 5.669 ± 3.073
1.134TyrLys: 1.134 ± 0.615
4.535TyrLeu: 4.535 ± 2.458
0.0TyrMet: 0.0 ± 0.0
3.401TyrAsn: 3.401 ± 1.844
0.0TyrPro: 0.0 ± 0.0
1.134TyrGln: 1.134 ± 0.615
4.535TyrArg: 4.535 ± 1.407
4.535TyrSer: 4.535 ± 2.402
3.401TyrThr: 3.401 ± 1.844
4.535TyrVal: 4.535 ± 2.458
1.134TyrTrp: 1.134 ± 0.615
4.535TyrTyr: 4.535 ± 2.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski