Amino acid dipepetide frequency for Torque teno mini virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.459AlaAla: 4.459 ± 1.95
1.115AlaCys: 1.115 ± 0.552
2.23AlaAsp: 2.23 ± 2.477
2.23AlaGlu: 2.23 ± 1.849
0.0AlaPhe: 0.0 ± 0.0
2.23AlaGly: 2.23 ± 5.733
0.0AlaHis: 0.0 ± 0.0
3.344AlaIle: 3.344 ± 1.657
3.344AlaLys: 3.344 ± 1.657
2.23AlaLeu: 2.23 ± 2.477
0.0AlaMet: 0.0 ± 0.0
1.115AlaAsn: 1.115 ± 0.552
5.574AlaPro: 5.574 ± 2.761
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
1.115AlaSer: 1.115 ± 0.552
3.344AlaThr: 3.344 ± 1.657
1.115AlaVal: 1.115 ± 0.552
1.115AlaTrp: 1.115 ± 0.552
1.115AlaTyr: 1.115 ± 0.552
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.115CysCys: 1.115 ± 2.867
1.115CysAsp: 1.115 ± 2.867
1.115CysGlu: 1.115 ± 0.552
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.115CysIle: 1.115 ± 0.552
2.23CysLys: 2.23 ± 2.477
2.23CysLeu: 2.23 ± 2.477
0.0CysMet: 0.0 ± 0.0
1.115CysAsn: 1.115 ± 0.552
2.23CysPro: 2.23 ± 1.105
2.23CysGln: 2.23 ± 1.849
2.23CysArg: 2.23 ± 1.849
1.115CysSer: 1.115 ± 0.552
2.23CysThr: 2.23 ± 2.477
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.23AspAla: 2.23 ± 5.733
1.115AspCys: 1.115 ± 2.867
3.344AspAsp: 3.344 ± 1.666
3.344AspGlu: 3.344 ± 1.657
2.23AspPhe: 2.23 ± 2.477
3.344AspGly: 3.344 ± 5.329
0.0AspHis: 0.0 ± 0.0
4.459AspIle: 4.459 ± 1.95
1.115AspLys: 1.115 ± 0.552
2.23AspLeu: 2.23 ± 2.477
1.115AspMet: 1.115 ± 1.586
2.23AspAsn: 2.23 ± 1.105
2.23AspPro: 2.23 ± 2.477
3.344AspGln: 3.344 ± 1.657
2.23AspArg: 2.23 ± 1.105
1.115AspSer: 1.115 ± 2.161
2.23AspThr: 2.23 ± 1.105
1.115AspVal: 1.115 ± 0.552
1.115AspTrp: 1.115 ± 0.552
2.23AspTyr: 2.23 ± 1.105
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.115GluCys: 1.115 ± 2.867
2.23GluAsp: 2.23 ± 2.477
6.689GluGlu: 6.689 ± 6.057
2.23GluPhe: 2.23 ± 1.105
3.344GluGly: 3.344 ± 2.16
1.115GluHis: 1.115 ± 0.552
1.115GluIle: 1.115 ± 0.552
3.344GluLys: 3.344 ± 3.028
6.689GluLeu: 6.689 ± 3.332
1.115GluMet: 1.115 ± 0.532
3.344GluAsn: 3.344 ± 2.16
2.23GluPro: 2.23 ± 2.477
3.344GluGln: 3.344 ± 1.657
2.23GluArg: 2.23 ± 1.105
4.459GluSer: 4.459 ± 3.698
3.344GluThr: 3.344 ± 1.657
1.115GluVal: 1.115 ± 0.552
0.0GluTrp: 0.0 ± 0.0
1.115GluTyr: 1.115 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 1.105
2.23PheCys: 2.23 ± 3.579
1.115PheAsp: 1.115 ± 0.552
1.115PheGlu: 1.115 ± 0.552
2.23PhePhe: 2.23 ± 1.105
2.23PheGly: 2.23 ± 5.733
3.344PheHis: 3.344 ± 1.657
1.115PheIle: 1.115 ± 0.552
3.344PheLys: 3.344 ± 1.657
1.115PheLeu: 1.115 ± 0.552
1.115PheMet: 1.115 ± 0.552
1.115PheAsn: 1.115 ± 2.867
0.0PhePro: 0.0 ± 0.0
2.23PheGln: 2.23 ± 1.105
2.23PheArg: 2.23 ± 1.105
1.115PheSer: 1.115 ± 0.552
4.459PheThr: 4.459 ± 1.95
2.23PheVal: 2.23 ± 1.105
1.115PheTrp: 1.115 ± 0.552
1.115PheTyr: 1.115 ± 0.552
0.0PheXaa: 0.0 ± 0.0
Gly
3.344GlyAla: 3.344 ± 5.329
1.115GlyCys: 1.115 ± 0.552
4.459GlyAsp: 4.459 ± 4.954
4.459GlyGlu: 4.459 ± 4.954
5.574GlyPhe: 5.574 ± 1.885
4.459GlyGly: 4.459 ± 2.209
2.23GlyHis: 2.23 ± 2.477
0.0GlyIle: 0.0 ± 0.0
2.23GlyLys: 2.23 ± 1.105
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 2.056
4.459GlyAsn: 4.459 ± 2.209
1.115GlyPro: 1.115 ± 0.552
3.344GlyGln: 3.344 ± 1.657
0.0GlyArg: 0.0 ± 0.0
0.0GlySer: 0.0 ± 0.0
7.804GlyThr: 7.804 ± 0.846
1.115GlyVal: 1.115 ± 2.161
2.23GlyTrp: 2.23 ± 1.105
1.115GlyTyr: 1.115 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.115HisAsp: 1.115 ± 2.867
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.344HisGly: 3.344 ± 2.16
1.115HisHis: 1.115 ± 2.161
0.0HisIle: 0.0 ± 0.0
4.459HisLys: 4.459 ± 1.656
1.115HisLeu: 1.115 ± 2.161
1.115HisMet: 1.115 ± 0.552
1.115HisAsn: 1.115 ± 0.552
1.115HisPro: 1.115 ± 0.552
1.115HisGln: 1.115 ± 0.552
4.459HisArg: 4.459 ± 3.698
2.23HisSer: 2.23 ± 1.849
1.115HisThr: 1.115 ± 2.161
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.23HisTyr: 2.23 ± 1.105
0.0HisXaa: 0.0 ± 0.0
Ile
3.344IleAla: 3.344 ± 1.657
1.115IleCys: 1.115 ± 0.552
1.115IleAsp: 1.115 ± 0.552
0.0IleGlu: 0.0 ± 0.0
3.344IlePhe: 3.344 ± 2.16
2.23IleGly: 2.23 ± 2.477
0.0IleHis: 0.0 ± 0.0
3.344IleIle: 3.344 ± 2.16
1.115IleLys: 1.115 ± 0.552
2.23IleLeu: 2.23 ± 1.105
1.115IleMet: 1.115 ± 0.552
3.344IleAsn: 3.344 ± 2.16
4.459IlePro: 4.459 ± 2.478
1.115IleGln: 1.115 ± 0.552
2.23IleArg: 2.23 ± 1.105
1.115IleSer: 1.115 ± 0.552
10.033IleThr: 10.033 ± 3.425
1.115IleVal: 1.115 ± 2.867
0.0IleTrp: 0.0 ± 0.0
2.23IleTyr: 2.23 ± 1.105
0.0IleXaa: 0.0 ± 0.0
Lys
2.23LysAla: 2.23 ± 1.105
1.115LysCys: 1.115 ± 2.867
5.574LysAsp: 5.574 ± 1.885
6.689LysGlu: 6.689 ± 3.332
3.344LysPhe: 3.344 ± 2.16
4.459LysGly: 4.459 ± 2.209
2.23LysHis: 2.23 ± 4.322
2.23LysIle: 2.23 ± 1.105
5.574LysLys: 5.574 ± 5.093
10.033LysLeu: 10.033 ± 0.385
0.0LysMet: 0.0 ± 0.0
3.344LysAsn: 3.344 ± 3.028
3.344LysPro: 3.344 ± 2.16
2.23LysGln: 2.23 ± 4.322
5.574LysArg: 5.574 ± 2.761
3.344LysSer: 3.344 ± 2.16
7.804LysThr: 7.804 ± 5.313
2.23LysVal: 2.23 ± 1.105
2.23LysTrp: 2.23 ± 1.105
4.459LysTyr: 4.459 ± 2.209
0.0LysXaa: 0.0 ± 0.0
Leu
1.115LeuAla: 1.115 ± 0.552
1.115LeuCys: 1.115 ± 0.552
6.689LeuAsp: 6.689 ± 1.978
2.23LeuGlu: 2.23 ± 2.477
1.115LeuPhe: 1.115 ± 2.867
3.344LeuGly: 3.344 ± 1.657
2.23LeuHis: 2.23 ± 1.105
3.344LeuIle: 3.344 ± 2.16
10.033LeuLys: 10.033 ± 4.407
13.378LeuLeu: 13.378 ± 1.968
3.344LeuMet: 3.344 ± 1.657
4.459LeuAsn: 4.459 ± 1.656
5.574LeuPro: 5.574 ± 2.761
8.919LeuGln: 8.919 ± 5.164
5.574LeuArg: 5.574 ± 1.93
6.689LeuSer: 6.689 ± 1.978
6.689LeuThr: 6.689 ± 2.123
2.23LeuVal: 2.23 ± 1.105
3.344LeuTrp: 3.344 ± 1.657
5.574LeuTyr: 5.574 ± 2.761
0.0LeuXaa: 0.0 ± 0.0
Met
1.115MetAla: 1.115 ± 0.552
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.115MetGlu: 1.115 ± 0.552
1.115MetPhe: 1.115 ± 2.161
1.115MetGly: 1.115 ± 0.552
0.0MetHis: 0.0 ± 0.0
1.115MetIle: 1.115 ± 0.552
2.23MetLys: 2.23 ± 1.105
1.115MetLeu: 1.115 ± 0.552
0.0MetMet: 0.0 ± 0.0
3.344MetAsn: 3.344 ± 3.028
3.344MetPro: 3.344 ± 1.657
1.115MetGln: 1.115 ± 0.552
0.0MetArg: 0.0 ± 0.0
3.344MetSer: 3.344 ± 2.16
1.115MetThr: 1.115 ± 0.552
2.23MetVal: 2.23 ± 1.105
0.0MetTrp: 0.0 ± 0.0
1.115MetTyr: 1.115 ± 0.552
0.0MetXaa: 0.0 ± 0.0
Asn
3.344AsnAla: 3.344 ± 1.666
0.0AsnCys: 0.0 ± 0.0
1.115AsnAsp: 1.115 ± 0.552
2.23AsnGlu: 2.23 ± 1.105
2.23AsnPhe: 2.23 ± 1.105
3.344AsnGly: 3.344 ± 1.657
0.0AsnHis: 0.0 ± 0.0
3.344AsnIle: 3.344 ± 1.657
4.459AsnLys: 4.459 ± 2.209
8.919AsnLeu: 8.919 ± 6.767
2.23AsnMet: 2.23 ± 1.105
4.459AsnAsn: 4.459 ± 1.656
3.344AsnPro: 3.344 ± 1.657
2.23AsnGln: 2.23 ± 3.579
2.23AsnArg: 2.23 ± 1.105
4.459AsnSer: 4.459 ± 3.698
4.459AsnThr: 4.459 ± 5.6
3.344AsnVal: 3.344 ± 1.657
2.23AsnTrp: 2.23 ± 1.105
1.115AsnTyr: 1.115 ± 2.161
0.0AsnXaa: 0.0 ± 0.0
Pro
2.23ProAla: 2.23 ± 1.105
1.115ProCys: 1.115 ± 0.552
3.344ProAsp: 3.344 ± 2.16
0.0ProGlu: 0.0 ± 0.0
3.344ProPhe: 3.344 ± 1.657
3.344ProGly: 3.344 ± 1.666
2.23ProHis: 2.23 ± 1.105
4.459ProIle: 4.459 ± 3.698
2.23ProLys: 2.23 ± 1.105
11.148ProLeu: 11.148 ± 3.383
2.23ProMet: 2.23 ± 1.105
3.344ProAsn: 3.344 ± 2.16
4.459ProPro: 4.459 ± 2.209
1.115ProGln: 1.115 ± 0.552
2.23ProArg: 2.23 ± 1.849
4.459ProSer: 4.459 ± 1.656
4.459ProThr: 4.459 ± 1.95
0.0ProVal: 0.0 ± 0.0
1.115ProTrp: 1.115 ± 0.552
4.459ProTyr: 4.459 ± 2.209
0.0ProXaa: 0.0 ± 0.0
Gln
1.115GlnAla: 1.115 ± 0.552
1.115GlnCys: 1.115 ± 0.552
2.23GlnAsp: 2.23 ± 1.105
3.344GlnGlu: 3.344 ± 2.16
0.0GlnPhe: 0.0 ± 0.0
1.115GlnGly: 1.115 ± 0.552
4.459GlnHis: 4.459 ± 3.698
3.344GlnIle: 3.344 ± 1.657
6.689GlnLys: 6.689 ± 3.603
5.574GlnLeu: 5.574 ± 3.476
1.115GlnMet: 1.115 ± 2.161
2.23GlnAsn: 2.23 ± 1.105
4.459GlnPro: 4.459 ± 3.698
5.574GlnGln: 5.574 ± 5.823
2.23GlnArg: 2.23 ± 1.105
4.459GlnSer: 4.459 ± 1.656
3.344GlnThr: 3.344 ± 1.657
2.23GlnVal: 2.23 ± 1.105
2.23GlnTrp: 2.23 ± 2.477
1.115GlnTyr: 1.115 ± 0.552
0.0GlnXaa: 0.0 ± 0.0
Arg
1.115ArgAla: 1.115 ± 0.552
1.115ArgCys: 1.115 ± 2.161
0.0ArgAsp: 0.0 ± 0.0
1.115ArgGlu: 1.115 ± 0.552
4.459ArgPhe: 4.459 ± 2.209
1.115ArgGly: 1.115 ± 0.552
1.115ArgHis: 1.115 ± 2.161
1.115ArgIle: 1.115 ± 0.552
6.689ArgLys: 6.689 ± 2.123
3.344ArgLeu: 3.344 ± 1.657
2.23ArgMet: 2.23 ± 1.105
3.344ArgAsn: 3.344 ± 1.666
3.344ArgPro: 3.344 ± 1.666
4.459ArgGln: 4.459 ± 1.95
17.837ArgArg: 17.837 ± 6.625
0.0ArgSer: 0.0 ± 0.0
2.23ArgThr: 2.23 ± 1.105
2.23ArgVal: 2.23 ± 1.849
3.344ArgTrp: 3.344 ± 1.657
5.574ArgTyr: 5.574 ± 1.885
0.0ArgXaa: 0.0 ± 0.0
Ser
3.344SerAla: 3.344 ± 1.657
0.0SerCys: 0.0 ± 0.0
4.459SerAsp: 4.459 ± 3.698
4.459SerGlu: 4.459 ± 1.656
0.0SerPhe: 0.0 ± 0.0
1.115SerGly: 1.115 ± 2.161
0.0SerHis: 0.0 ± 0.0
2.23SerIle: 2.23 ± 1.105
3.344SerLys: 3.344 ± 3.984
5.574SerLeu: 5.574 ± 1.822
0.0SerMet: 0.0 ± 0.0
2.23SerAsn: 2.23 ± 1.105
6.689SerPro: 6.689 ± 2.123
2.23SerGln: 2.23 ± 1.849
2.23SerArg: 2.23 ± 3.579
10.033SerSer: 10.033 ± 9.513
8.919SerThr: 8.919 ± 3.655
2.23SerVal: 2.23 ± 1.105
0.0SerTrp: 0.0 ± 0.0
4.459SerTyr: 4.459 ± 3.698
0.0SerXaa: 0.0 ± 0.0
Thr
2.23ThrAla: 2.23 ± 1.105
3.344ThrCys: 3.344 ± 1.657
1.115ThrAsp: 1.115 ± 0.552
8.919ThrGlu: 8.919 ± 3.655
2.23ThrPhe: 2.23 ± 1.105
5.574ThrGly: 5.574 ± 1.885
2.23ThrHis: 2.23 ± 1.849
4.459ThrIle: 4.459 ± 4.954
8.919ThrLys: 8.919 ± 3.655
7.804ThrLeu: 7.804 ± 0.846
1.115ThrMet: 1.115 ± 0.552
7.804ThrAsn: 7.804 ± 3.276
2.23ThrPro: 2.23 ± 1.105
5.574ThrGln: 5.574 ± 2.761
1.115ThrArg: 1.115 ± 0.552
7.804ThrSer: 7.804 ± 5.313
10.033ThrThr: 10.033 ± 4.97
2.23ThrVal: 2.23 ± 1.105
0.0ThrTrp: 0.0 ± 0.0
3.344ThrTyr: 3.344 ± 1.657
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.115ValCys: 1.115 ± 0.552
1.115ValAsp: 1.115 ± 0.552
1.115ValGlu: 1.115 ± 0.552
0.0ValPhe: 0.0 ± 0.0
0.0ValGly: 0.0 ± 0.0
2.23ValHis: 2.23 ± 2.477
2.23ValIle: 2.23 ± 1.105
1.115ValLys: 1.115 ± 0.552
2.23ValLeu: 2.23 ± 1.105
2.23ValMet: 2.23 ± 1.105
1.115ValAsn: 1.115 ± 0.552
2.23ValPro: 2.23 ± 1.105
3.344ValGln: 3.344 ± 3.984
2.23ValArg: 2.23 ± 1.105
3.344ValSer: 3.344 ± 1.657
1.115ValThr: 1.115 ± 0.552
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.23ValTyr: 2.23 ± 1.105
0.0ValXaa: 0.0 ± 0.0
Trp
2.23TrpAla: 2.23 ± 1.105
1.115TrpCys: 1.115 ± 0.552
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.115TrpPhe: 1.115 ± 0.552
3.344TrpGly: 3.344 ± 1.657
1.115TrpHis: 1.115 ± 0.552
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.115TrpMet: 1.115 ± 2.867
1.115TrpAsn: 1.115 ± 0.552
0.0TrpPro: 0.0 ± 0.0
1.115TrpGln: 1.115 ± 0.552
5.574TrpArg: 5.574 ± 2.761
0.0TrpSer: 0.0 ± 0.0
1.115TrpThr: 1.115 ± 0.552
1.115TrpVal: 1.115 ± 0.552
1.115TrpTrp: 1.115 ± 0.552
2.23TrpTyr: 2.23 ± 1.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
2.23TyrPhe: 2.23 ± 1.105
1.115TyrGly: 1.115 ± 0.552
0.0TyrHis: 0.0 ± 0.0
2.23TyrIle: 2.23 ± 2.477
5.574TyrLys: 5.574 ± 1.822
10.033TyrLeu: 10.033 ± 4.97
2.23TyrMet: 2.23 ± 1.105
4.459TyrAsn: 4.459 ± 2.209
3.344TyrPro: 3.344 ± 1.657
3.344TyrGln: 3.344 ± 1.657
3.344TyrArg: 3.344 ± 1.657
3.344TyrSer: 3.344 ± 3.984
2.23TyrThr: 2.23 ± 1.105
1.115TyrVal: 1.115 ± 0.552
2.23TyrTrp: 2.23 ± 1.105
1.115TyrTyr: 1.115 ± 0.552
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (898 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski