Amino acid dipepetide frequency for Hubei toti-like virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.121AlaAla: 6.121 ± 1.771
0.556AlaCys: 0.556 ± 0.388
2.226AlaAsp: 2.226 ± 0.461
3.895AlaGlu: 3.895 ± 1.31
2.226AlaPhe: 2.226 ± 0.211
2.782AlaGly: 2.782 ± 0.744
1.669AlaHis: 1.669 ± 0.849
3.895AlaIle: 3.895 ± 0.033
2.226AlaLys: 2.226 ± 0.882
6.121AlaLeu: 6.121 ± 0.915
2.226AlaMet: 2.226 ± 0.461
7.234AlaAsn: 7.234 ± 0.994
7.791AlaPro: 7.791 ± 0.605
1.669AlaGln: 1.669 ± 0.494
5.008AlaArg: 5.008 ± 0.138
12.243AlaSer: 12.243 ± 2.198
5.565AlaThr: 5.565 ± 0.816
4.452AlaVal: 4.452 ± 0.921
3.339AlaTrp: 3.339 ± 1.659
2.782AlaTyr: 2.782 ± 0.744
0.0AlaXaa: 0.0 ± 0.0
Cys
1.113CysAla: 1.113 ± 0.777
0.0CysCys: 0.0 ± 0.0
0.556CysAsp: 0.556 ± 0.283
1.113CysGlu: 1.113 ± 0.566
0.0CysPhe: 0.0 ± 0.0
0.556CysGly: 0.556 ± 0.388
0.556CysHis: 0.556 ± 0.388
1.113CysIle: 1.113 ± 0.105
0.556CysLys: 0.556 ± 0.283
1.113CysLeu: 1.113 ± 0.105
0.0CysMet: 0.0 ± 0.0
0.556CysAsn: 0.556 ± 0.283
0.556CysPro: 0.556 ± 0.283
0.0CysGln: 0.0 ± 0.0
0.556CysArg: 0.556 ± 0.388
1.669CysSer: 1.669 ± 0.494
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.113AspAla: 1.113 ± 0.566
0.556AspCys: 0.556 ± 0.388
1.113AspAsp: 1.113 ± 0.105
2.226AspGlu: 2.226 ± 1.554
0.0AspPhe: 0.0 ± 0.0
2.226AspGly: 2.226 ± 0.211
1.113AspHis: 1.113 ± 0.105
1.113AspIle: 1.113 ± 0.566
1.113AspLys: 1.113 ± 0.566
3.339AspLeu: 3.339 ± 0.316
2.226AspMet: 2.226 ± 0.219
0.556AspAsn: 0.556 ± 0.283
4.452AspPro: 4.452 ± 0.921
0.556AspGln: 0.556 ± 0.283
1.113AspArg: 1.113 ± 0.566
3.339AspSer: 3.339 ± 0.355
4.452AspThr: 4.452 ± 1.593
2.226AspVal: 2.226 ± 0.461
2.782AspTrp: 2.782 ± 0.072
1.669AspTyr: 1.669 ± 0.178
0.0AspXaa: 0.0 ± 0.0
Glu
4.452GluAla: 4.452 ± 0.25
1.113GluCys: 1.113 ± 0.777
1.669GluAsp: 1.669 ± 0.494
5.008GluGlu: 5.008 ± 2.824
2.782GluPhe: 2.782 ± 1.271
1.113GluGly: 1.113 ± 0.777
1.113GluHis: 1.113 ± 0.777
5.565GluIle: 5.565 ± 0.527
1.669GluLys: 1.669 ± 0.494
3.339GluLeu: 3.339 ± 0.316
0.0GluMet: 0.0 ± 0.0
1.669GluAsn: 1.669 ± 0.178
2.782GluPro: 2.782 ± 0.599
2.782GluGln: 2.782 ± 0.744
1.669GluArg: 1.669 ± 1.165
3.895GluSer: 3.895 ± 0.638
3.895GluThr: 3.895 ± 0.033
2.226GluVal: 2.226 ± 1.132
1.669GluTrp: 1.669 ± 0.178
1.113GluTyr: 1.113 ± 0.105
0.0GluXaa: 0.0 ± 0.0
Phe
2.782PheAla: 2.782 ± 0.072
0.0PheCys: 0.0 ± 0.0
1.669PheAsp: 1.669 ± 0.178
2.782PheGlu: 2.782 ± 0.744
0.556PhePhe: 0.556 ± 0.283
1.669PheGly: 1.669 ± 0.494
0.0PheHis: 0.0 ± 0.0
0.556PheIle: 0.556 ± 0.388
1.113PheLys: 1.113 ± 0.777
2.226PheLeu: 2.226 ± 0.211
1.113PheMet: 1.113 ± 0.105
0.556PheAsn: 0.556 ± 0.283
1.113PhePro: 1.113 ± 0.777
0.556PheGln: 0.556 ± 0.388
1.113PheArg: 1.113 ± 0.105
2.226PheSer: 2.226 ± 0.461
1.113PheThr: 1.113 ± 0.105
1.669PheVal: 1.669 ± 0.178
0.0PheTrp: 0.0 ± 0.0
1.113PheTyr: 1.113 ± 0.777
0.0PheXaa: 0.0 ± 0.0
Gly
3.895GlyAla: 3.895 ± 0.033
0.556GlyCys: 0.556 ± 0.388
1.113GlyAsp: 1.113 ± 0.105
2.226GlyGlu: 2.226 ± 0.882
1.669GlyPhe: 1.669 ± 0.849
4.452GlyGly: 4.452 ± 1.764
0.0GlyHis: 0.0 ± 0.0
2.782GlyIle: 2.782 ± 0.744
1.113GlyLys: 1.113 ± 0.105
3.339GlyLeu: 3.339 ± 0.355
0.556GlyMet: 0.556 ± 0.388
1.669GlyAsn: 1.669 ± 0.178
3.339GlyPro: 3.339 ± 0.355
3.339GlyGln: 3.339 ± 0.355
3.339GlyArg: 3.339 ± 0.316
5.565GlySer: 5.565 ± 0.816
6.121GlyThr: 6.121 ± 0.915
1.113GlyVal: 1.113 ± 0.777
1.669GlyTrp: 1.669 ± 0.178
1.669GlyTyr: 1.669 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
3.895HisAla: 3.895 ± 0.033
0.0HisCys: 0.0 ± 0.0
0.556HisAsp: 0.556 ± 0.388
0.0HisGlu: 0.0 ± 0.0
1.113HisPhe: 1.113 ± 0.566
1.113HisGly: 1.113 ± 0.105
0.556HisHis: 0.556 ± 0.283
2.226HisIle: 2.226 ± 0.211
0.556HisLys: 0.556 ± 0.388
4.452HisLeu: 4.452 ± 1.093
1.669HisMet: 1.669 ± 0.494
1.113HisAsn: 1.113 ± 0.105
1.113HisPro: 1.113 ± 0.566
0.556HisGln: 0.556 ± 0.388
1.113HisArg: 1.113 ± 0.105
0.556HisSer: 0.556 ± 0.283
2.226HisThr: 2.226 ± 0.882
1.113HisVal: 1.113 ± 0.566
0.0HisTrp: 0.0 ± 0.0
0.556HisTyr: 0.556 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
5.008IleAla: 5.008 ± 0.533
0.556IleCys: 0.556 ± 0.388
3.895IleAsp: 3.895 ± 0.638
1.113IleGlu: 1.113 ± 0.105
1.113IlePhe: 1.113 ± 0.105
1.113IleGly: 1.113 ± 0.566
2.782IleHis: 2.782 ± 0.599
5.565IleIle: 5.565 ± 0.145
3.339IleLys: 3.339 ± 0.355
1.669IleLeu: 1.669 ± 0.178
0.0IleMet: 0.0 ± 0.0
3.339IleAsn: 3.339 ± 0.355
6.121IlePro: 6.121 ± 0.244
1.669IleGln: 1.669 ± 0.178
2.782IleArg: 2.782 ± 0.599
9.46IleSer: 9.46 ± 1.231
6.121IleThr: 6.121 ± 0.244
2.782IleVal: 2.782 ± 0.072
0.0IleTrp: 0.0 ± 0.0
1.113IleTyr: 1.113 ± 0.105
0.0IleXaa: 0.0 ± 0.0
Lys
2.226LysAla: 2.226 ± 0.211
0.0LysCys: 0.0 ± 0.0
1.113LysAsp: 1.113 ± 0.105
5.008LysGlu: 5.008 ± 2.824
0.556LysPhe: 0.556 ± 0.283
2.226LysGly: 2.226 ± 0.882
1.669LysHis: 1.669 ± 0.494
3.339LysIle: 3.339 ± 1.659
4.452LysLys: 4.452 ± 0.25
2.226LysLeu: 2.226 ± 0.461
1.113LysMet: 1.113 ± 0.566
1.113LysAsn: 1.113 ± 0.566
3.895LysPro: 3.895 ± 1.31
2.226LysGln: 2.226 ± 0.882
3.895LysArg: 3.895 ± 0.705
1.669LysSer: 1.669 ± 0.178
7.791LysThr: 7.791 ± 0.738
2.782LysVal: 2.782 ± 0.599
1.669LysTrp: 1.669 ± 1.165
2.226LysTyr: 2.226 ± 1.554
0.0LysXaa: 0.0 ± 0.0
Leu
4.452LeuAla: 4.452 ± 0.25
1.669LeuCys: 1.669 ± 0.178
2.226LeuAsp: 2.226 ± 0.211
2.226LeuGlu: 2.226 ± 0.882
1.113LeuPhe: 1.113 ± 0.105
3.895LeuGly: 3.895 ± 0.705
1.669LeuHis: 1.669 ± 0.178
1.113LeuIle: 1.113 ± 0.566
7.234LeuLys: 7.234 ± 1.692
6.121LeuLeu: 6.121 ± 1.587
1.113LeuMet: 1.113 ± 0.777
3.339LeuAsn: 3.339 ± 2.331
6.678LeuPro: 6.678 ± 1.304
3.895LeuGln: 3.895 ± 1.981
4.452LeuArg: 4.452 ± 1.093
12.799LeuSer: 12.799 ± 2.481
6.121LeuThr: 6.121 ± 0.428
3.895LeuVal: 3.895 ± 0.033
0.556LeuTrp: 0.556 ± 0.388
3.339LeuTyr: 3.339 ± 0.316
0.0LeuXaa: 0.0 ± 0.0
Met
4.452MetAla: 4.452 ± 0.25
0.0MetCys: 0.0 ± 0.0
1.113MetAsp: 1.113 ± 0.105
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.669MetGly: 1.669 ± 0.494
0.556MetHis: 0.556 ± 0.388
1.113MetIle: 1.113 ± 0.566
0.0MetLys: 0.0 ± 0.0
2.782MetLeu: 2.782 ± 0.599
0.0MetMet: 0.0 ± 0.0
0.556MetAsn: 0.556 ± 0.388
2.782MetPro: 2.782 ± 0.072
0.0MetGln: 0.0 ± 0.0
0.556MetArg: 0.556 ± 0.388
2.782MetSer: 2.782 ± 0.599
1.669MetThr: 1.669 ± 0.178
1.113MetVal: 1.113 ± 0.105
1.113MetTrp: 1.113 ± 0.105
1.113MetTyr: 1.113 ± 0.777
0.0MetXaa: 0.0 ± 0.0
Asn
3.895AsnAla: 3.895 ± 1.31
0.0AsnCys: 0.0 ± 0.0
1.669AsnAsp: 1.669 ± 0.494
1.113AsnGlu: 1.113 ± 0.105
0.0AsnPhe: 0.0 ± 0.0
2.226AsnGly: 2.226 ± 0.461
1.113AsnHis: 1.113 ± 0.105
3.339AsnIle: 3.339 ± 0.316
2.226AsnLys: 2.226 ± 0.461
2.226AsnLeu: 2.226 ± 0.461
1.669AsnMet: 1.669 ± 0.178
2.782AsnAsn: 2.782 ± 0.744
3.339AsnPro: 3.339 ± 1.698
0.556AsnGln: 0.556 ± 0.283
4.452AsnArg: 4.452 ± 0.25
3.895AsnSer: 3.895 ± 0.638
4.452AsnThr: 4.452 ± 0.921
2.782AsnVal: 2.782 ± 1.271
1.669AsnTrp: 1.669 ± 0.178
1.113AsnTyr: 1.113 ± 0.105
0.0AsnXaa: 0.0 ± 0.0
Pro
6.121ProAla: 6.121 ± 0.428
1.669ProCys: 1.669 ± 0.849
4.452ProAsp: 4.452 ± 0.921
3.339ProGlu: 3.339 ± 0.355
1.113ProPhe: 1.113 ± 0.105
3.339ProGly: 3.339 ± 1.027
2.226ProHis: 2.226 ± 0.882
3.895ProIle: 3.895 ± 0.705
2.226ProLys: 2.226 ± 0.882
6.678ProLeu: 6.678 ± 1.382
1.113ProMet: 1.113 ± 0.105
1.669ProAsn: 1.669 ± 0.178
7.791ProPro: 7.791 ± 3.291
1.669ProGln: 1.669 ± 0.849
2.782ProArg: 2.782 ± 0.599
9.46ProSer: 9.46 ± 0.112
6.121ProThr: 6.121 ± 2.442
3.339ProVal: 3.339 ± 0.316
0.556ProTrp: 0.556 ± 0.388
3.895ProTyr: 3.895 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.895GlnAla: 3.895 ± 0.033
0.0GlnCys: 0.0 ± 0.0
0.556GlnAsp: 0.556 ± 0.283
2.226GlnGlu: 2.226 ± 0.211
0.556GlnPhe: 0.556 ± 0.388
1.669GlnGly: 1.669 ± 0.178
0.0GlnHis: 0.0 ± 0.0
0.556GlnIle: 0.556 ± 0.283
3.339GlnLys: 3.339 ± 1.659
4.452GlnLeu: 4.452 ± 1.093
0.556GlnMet: 0.556 ± 0.388
3.339GlnAsn: 3.339 ± 1.027
0.556GlnPro: 0.556 ± 0.283
2.226GlnGln: 2.226 ± 1.132
0.0GlnArg: 0.0 ± 0.0
2.782GlnSer: 2.782 ± 1.415
3.895GlnThr: 3.895 ± 0.638
0.556GlnVal: 0.556 ± 0.283
1.113GlnTrp: 1.113 ± 0.566
1.669GlnTyr: 1.669 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
3.339ArgAla: 3.339 ± 0.355
0.556ArgCys: 0.556 ± 0.388
1.113ArgAsp: 1.113 ± 0.566
2.226ArgGlu: 2.226 ± 0.211
2.226ArgPhe: 2.226 ± 0.882
1.113ArgGly: 1.113 ± 0.566
1.669ArgHis: 1.669 ± 0.178
2.782ArgIle: 2.782 ± 1.271
2.782ArgLys: 2.782 ± 0.599
3.895ArgLeu: 3.895 ± 0.705
1.113ArgMet: 1.113 ± 0.777
2.226ArgAsn: 2.226 ± 0.461
1.669ArgPro: 1.669 ± 0.178
2.226ArgGln: 2.226 ± 0.211
2.226ArgArg: 2.226 ± 1.554
5.008ArgSer: 5.008 ± 1.481
4.452ArgThr: 4.452 ± 1.093
2.226ArgVal: 2.226 ± 0.211
0.556ArgTrp: 0.556 ± 0.388
2.226ArgTyr: 2.226 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
9.46SerAla: 9.46 ± 0.783
0.556SerCys: 0.556 ± 0.283
3.339SerAsp: 3.339 ± 1.027
7.234SerGlu: 7.234 ± 0.994
3.895SerPhe: 3.895 ± 1.376
7.791SerGly: 7.791 ± 0.066
1.669SerHis: 1.669 ± 0.849
8.347SerIle: 8.347 ± 1.56
6.678SerLys: 6.678 ± 1.304
7.234SerLeu: 7.234 ± 1.021
0.556SerMet: 0.556 ± 0.283
5.008SerAsn: 5.008 ± 1.876
6.121SerPro: 6.121 ± 0.244
2.226SerGln: 2.226 ± 0.882
2.226SerArg: 2.226 ± 0.882
9.46SerSer: 9.46 ± 2.126
6.678SerThr: 6.678 ± 2.054
8.904SerVal: 8.904 ± 2.514
3.339SerTrp: 3.339 ± 0.355
7.234SerTyr: 7.234 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
10.017ThrAla: 10.017 ± 2.409
0.556ThrCys: 0.556 ± 0.283
3.339ThrAsp: 3.339 ± 0.316
2.782ThrGlu: 2.782 ± 0.744
2.782ThrPhe: 2.782 ± 0.744
3.339ThrGly: 3.339 ± 1.027
2.226ThrHis: 2.226 ± 0.882
5.565ThrIle: 5.565 ± 1.198
5.008ThrLys: 5.008 ± 1.481
7.234ThrLeu: 7.234 ± 0.994
4.452ThrMet: 4.452 ± 0.421
2.782ThrAsn: 2.782 ± 0.072
7.791ThrPro: 7.791 ± 1.277
2.782ThrGln: 2.782 ± 0.744
1.669ThrArg: 1.669 ± 0.178
8.904ThrSer: 8.904 ± 1.171
8.347ThrThr: 8.347 ± 2.231
3.895ThrVal: 3.895 ± 0.033
1.113ThrTrp: 1.113 ± 0.105
1.669ThrTyr: 1.669 ± 0.178
0.0ThrXaa: 0.0 ± 0.0
Val
5.008ValAla: 5.008 ± 0.138
1.669ValCys: 1.669 ± 0.178
2.782ValAsp: 2.782 ± 0.744
2.226ValGlu: 2.226 ± 0.882
1.113ValPhe: 1.113 ± 0.105
2.782ValGly: 2.782 ± 0.744
1.669ValHis: 1.669 ± 0.178
3.339ValIle: 3.339 ± 1.027
2.782ValLys: 2.782 ± 0.744
3.895ValLeu: 3.895 ± 0.033
1.669ValMet: 1.669 ± 0.494
1.669ValAsn: 1.669 ± 0.178
3.339ValPro: 3.339 ± 0.355
1.669ValGln: 1.669 ± 0.494
3.339ValArg: 3.339 ± 0.355
5.565ValSer: 5.565 ± 0.816
3.339ValThr: 3.339 ± 0.316
2.782ValVal: 2.782 ± 0.072
1.113ValTrp: 1.113 ± 0.777
1.669ValTyr: 1.669 ± 0.849
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 1.165
0.0TrpCys: 0.0 ± 0.0
0.556TrpAsp: 0.556 ± 0.283
0.556TrpGlu: 0.556 ± 0.283
0.0TrpPhe: 0.0 ± 0.0
1.113TrpGly: 1.113 ± 0.777
0.556TrpHis: 0.556 ± 0.388
2.782TrpIle: 2.782 ± 0.744
0.556TrpLys: 0.556 ± 0.388
1.113TrpLeu: 1.113 ± 0.777
0.556TrpMet: 0.556 ± 0.286
1.113TrpAsn: 1.113 ± 0.105
1.669TrpPro: 1.669 ± 0.494
1.669TrpGln: 1.669 ± 0.494
1.113TrpArg: 1.113 ± 0.777
2.226TrpSer: 2.226 ± 0.211
2.226TrpThr: 2.226 ± 0.461
2.226TrpVal: 2.226 ± 0.461
1.113TrpTrp: 1.113 ± 0.105
1.113TrpTyr: 1.113 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.113TyrAla: 1.113 ± 0.566
0.0TyrCys: 0.0 ± 0.0
2.782TyrAsp: 2.782 ± 0.744
2.226TyrGlu: 2.226 ± 1.554
1.113TyrPhe: 1.113 ± 0.105
3.339TyrGly: 3.339 ± 0.988
1.669TyrHis: 1.669 ± 0.178
0.556TyrIle: 0.556 ± 0.388
2.226TyrLys: 2.226 ± 0.211
4.452TyrLeu: 4.452 ± 0.421
1.113TyrMet: 1.113 ± 0.105
2.226TyrAsn: 2.226 ± 0.461
0.556TyrPro: 0.556 ± 0.283
1.669TyrGln: 1.669 ± 0.178
2.226TyrArg: 2.226 ± 0.461
4.452TyrSer: 4.452 ± 1.093
1.669TyrThr: 1.669 ± 0.178
3.339TyrVal: 3.339 ± 0.316
0.556TyrTrp: 0.556 ± 0.388
2.782TyrTyr: 2.782 ± 0.599
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski