Amino acid dipepetide frequency for Beihai picorna-like virus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.566AlaAla: 4.566 ± 0.906
1.66AlaCys: 1.66 ± 0.231
3.321AlaAsp: 3.321 ± 1.148
1.66AlaGlu: 1.66 ± 0.454
2.491AlaPhe: 2.491 ± 0.69
3.736AlaGly: 3.736 ± 0.692
0.83AlaHis: 0.83 ± 0.459
4.566AlaIle: 4.566 ± 0.465
6.227AlaLys: 6.227 ± 0.011
6.227AlaLeu: 6.227 ± 1.361
2.906AlaMet: 2.906 ± 0.919
2.076AlaAsn: 2.076 ± 0.911
2.491AlaPro: 2.491 ± 0.004
3.321AlaGln: 3.321 ± 0.463
4.151AlaArg: 4.151 ± 0.236
2.491AlaSer: 2.491 ± 0.004
3.736AlaThr: 3.736 ± 0.679
4.151AlaVal: 4.151 ± 0.45
0.415AlaTrp: 0.415 ± 0.456
3.321AlaTyr: 3.321 ± 0.463
0.0AlaXaa: 0.0 ± 0.0
Cys
1.66CysAla: 1.66 ± 0.231
0.0CysCys: 0.0 ± 0.0
0.83CysAsp: 0.83 ± 0.459
0.415CysGlu: 0.415 ± 0.229
1.66CysPhe: 1.66 ± 0.917
2.906CysGly: 2.906 ± 1.605
0.0CysHis: 0.0 ± 0.0
0.83CysIle: 0.83 ± 0.459
1.245CysLys: 1.245 ± 0.002
1.66CysLeu: 1.66 ± 0.917
0.0CysMet: 0.0 ± 0.0
0.415CysAsn: 0.415 ± 0.456
1.245CysPro: 1.245 ± 0.688
0.0CysGln: 0.0 ± 0.0
0.83CysArg: 0.83 ± 0.227
1.66CysSer: 1.66 ± 0.454
1.66CysThr: 1.66 ± 0.231
0.83CysVal: 0.83 ± 0.459
0.415CysTrp: 0.415 ± 0.229
1.245CysTyr: 1.245 ± 0.688
0.0CysXaa: 0.0 ± 0.0
Asp
3.736AspAla: 3.736 ± 1.378
0.415AspCys: 0.415 ± 0.229
2.906AspAsp: 2.906 ± 0.452
2.076AspGlu: 2.076 ± 0.225
4.151AspPhe: 4.151 ± 0.45
1.66AspGly: 1.66 ± 0.231
0.83AspHis: 0.83 ± 0.227
5.396AspIle: 5.396 ± 0.448
3.736AspLys: 3.736 ± 1.378
3.736AspLeu: 3.736 ± 1.378
1.245AspMet: 1.245 ± 0.688
2.491AspAsn: 2.491 ± 0.681
3.736AspPro: 3.736 ± 3.422
3.321AspGln: 3.321 ± 1.148
2.076AspArg: 2.076 ± 0.461
2.491AspSer: 2.491 ± 1.367
1.66AspThr: 1.66 ± 0.231
2.491AspVal: 2.491 ± 0.681
1.245AspTrp: 1.245 ± 0.688
2.491AspTyr: 2.491 ± 0.004
0.0AspXaa: 0.0 ± 0.0
Glu
2.906GluAla: 2.906 ± 0.452
2.906GluCys: 2.906 ± 1.605
4.151GluAsp: 4.151 ± 1.136
3.736GluGlu: 3.736 ± 1.378
2.491GluPhe: 2.491 ± 0.69
2.906GluGly: 2.906 ± 0.234
1.66GluHis: 1.66 ± 0.917
2.906GluIle: 2.906 ± 1.138
4.566GluLys: 4.566 ± 1.151
7.057GluLeu: 7.057 ± 0.469
1.66GluMet: 1.66 ± 1.14
2.491GluAsn: 2.491 ± 0.004
3.736GluPro: 3.736 ± 0.679
2.076GluGln: 2.076 ± 1.146
1.245GluArg: 1.245 ± 0.688
3.736GluSer: 3.736 ± 0.679
3.321GluThr: 3.321 ± 1.834
4.151GluVal: 4.151 ± 0.236
0.83GluTrp: 0.83 ± 0.459
0.83GluTyr: 0.83 ± 0.227
0.0GluXaa: 0.0 ± 0.0
Phe
4.981PheAla: 4.981 ± 0.677
0.0PheCys: 0.0 ± 0.0
2.491PheAsp: 2.491 ± 0.004
7.057PheGlu: 7.057 ± 1.155
2.491PhePhe: 2.491 ± 1.367
5.812PheGly: 5.812 ± 0.467
0.83PheHis: 0.83 ± 0.227
0.415PheIle: 0.415 ± 0.229
3.736PheLys: 3.736 ± 2.063
2.906PheLeu: 2.906 ± 0.919
1.245PheMet: 1.245 ± 0.688
3.736PheAsn: 3.736 ± 1.365
2.076PhePro: 2.076 ± 0.225
2.906PheGln: 2.906 ± 1.824
3.321PheArg: 3.321 ± 0.223
1.66PheSer: 1.66 ± 0.454
5.396PheThr: 5.396 ± 1.819
3.736PheVal: 3.736 ± 0.006
0.0PheTrp: 0.0 ± 0.0
1.245PheTyr: 1.245 ± 0.002
0.0PheXaa: 0.0 ± 0.0
Gly
4.151GlyAla: 4.151 ± 0.921
0.83GlyCys: 0.83 ± 0.459
4.566GlyAsp: 4.566 ± 0.465
3.736GlyGlu: 3.736 ± 0.006
2.491GlyPhe: 2.491 ± 1.367
3.736GlyGly: 3.736 ± 0.006
0.83GlyHis: 0.83 ± 0.459
3.736GlyIle: 3.736 ± 0.692
4.566GlyLys: 4.566 ± 0.465
4.981GlyLeu: 4.981 ± 0.694
1.66GlyMet: 1.66 ± 0.231
4.981GlyAsn: 4.981 ± 0.677
2.906GlyPro: 2.906 ± 1.138
1.66GlyGln: 1.66 ± 0.454
3.736GlyArg: 3.736 ± 2.736
5.812GlySer: 5.812 ± 0.467
2.906GlyThr: 2.906 ± 0.234
7.472GlyVal: 7.472 ± 0.013
0.83GlyTrp: 0.83 ± 0.227
1.66GlyTyr: 1.66 ± 1.14
0.0GlyXaa: 0.0 ± 0.0
His
2.076HisAla: 2.076 ± 0.225
0.83HisCys: 0.83 ± 0.459
1.245HisAsp: 1.245 ± 0.002
0.415HisGlu: 0.415 ± 0.229
2.076HisPhe: 2.076 ± 0.225
0.83HisGly: 0.83 ± 0.459
0.0HisHis: 0.0 ± 0.0
1.245HisIle: 1.245 ± 0.688
1.245HisLys: 1.245 ± 0.002
1.66HisLeu: 1.66 ± 0.917
0.83HisMet: 0.83 ± 0.459
0.0HisAsn: 0.0 ± 0.0
1.66HisPro: 1.66 ± 0.231
0.0HisGln: 0.0 ± 0.0
0.415HisArg: 0.415 ± 0.229
0.83HisSer: 0.83 ± 0.459
0.0HisThr: 0.0 ± 0.0
2.076HisVal: 2.076 ± 0.225
0.83HisTrp: 0.83 ± 0.227
1.245HisTyr: 1.245 ± 0.002
0.0HisXaa: 0.0 ± 0.0
Ile
4.151IleAla: 4.151 ± 1.136
1.245IleCys: 1.245 ± 0.688
4.151IleAsp: 4.151 ± 0.45
3.736IleGlu: 3.736 ± 0.692
4.566IlePhe: 4.566 ± 1.151
4.566IleGly: 4.566 ± 0.465
1.245IleHis: 1.245 ± 0.684
2.076IleIle: 2.076 ± 1.146
2.906IleLys: 2.906 ± 0.234
2.906IleLeu: 2.906 ± 0.919
1.245IleMet: 1.245 ± 0.002
3.321IleAsn: 3.321 ± 0.223
3.736IlePro: 3.736 ± 0.006
0.83IleGln: 0.83 ± 0.227
2.491IleArg: 2.491 ± 0.004
4.151IleSer: 4.151 ± 0.45
1.66IleThr: 1.66 ± 0.231
1.66IleVal: 1.66 ± 0.454
0.83IleTrp: 0.83 ± 0.459
0.83IleTyr: 0.83 ± 0.459
0.0IleXaa: 0.0 ± 0.0
Lys
2.491LysAla: 2.491 ± 1.376
0.415LysCys: 0.415 ± 0.229
2.491LysAsp: 2.491 ± 1.376
4.566LysGlu: 4.566 ± 1.836
4.981LysPhe: 4.981 ± 1.38
3.736LysGly: 3.736 ± 0.006
1.66LysHis: 1.66 ± 0.231
1.66LysIle: 1.66 ± 0.454
3.321LysLys: 3.321 ± 1.834
5.396LysLeu: 5.396 ± 0.238
2.491LysMet: 2.491 ± 0.004
2.076LysAsn: 2.076 ± 0.225
3.321LysPro: 3.321 ± 0.463
1.66LysGln: 1.66 ± 0.231
4.151LysArg: 4.151 ± 0.921
2.906LysSer: 2.906 ± 0.919
3.736LysThr: 3.736 ± 2.063
3.321LysVal: 3.321 ± 0.463
0.415LysTrp: 0.415 ± 0.456
2.906LysTyr: 2.906 ± 1.605
0.0LysXaa: 0.0 ± 0.0
Leu
4.566LeuAla: 4.566 ± 1.151
3.321LeuCys: 3.321 ± 1.834
4.151LeuAsp: 4.151 ± 0.236
3.736LeuGlu: 3.736 ± 1.378
3.321LeuPhe: 3.321 ± 1.834
6.642LeuGly: 6.642 ± 0.446
2.491LeuHis: 2.491 ± 0.004
4.151LeuIle: 4.151 ± 0.921
4.981LeuLys: 4.981 ± 0.694
7.887LeuLeu: 7.887 ± 0.242
1.245LeuMet: 1.245 ± 0.688
4.981LeuAsn: 4.981 ± 2.049
4.981LeuPro: 4.981 ± 2.049
2.076LeuGln: 2.076 ± 0.225
4.566LeuArg: 4.566 ± 0.465
4.981LeuSer: 4.981 ± 2.066
8.717LeuThr: 8.717 ± 0.701
5.396LeuVal: 5.396 ± 1.609
1.66LeuTrp: 1.66 ± 0.454
2.906LeuTyr: 2.906 ± 0.234
0.0LeuXaa: 0.0 ± 0.0
Met
0.83MetAla: 0.83 ± 0.913
0.415MetCys: 0.415 ± 0.229
1.245MetAsp: 1.245 ± 0.688
2.906MetGlu: 2.906 ± 0.919
1.245MetPhe: 1.245 ± 0.684
0.83MetGly: 0.83 ± 0.913
0.415MetHis: 0.415 ± 0.229
1.66MetIle: 1.66 ± 0.917
1.245MetLys: 1.245 ± 0.002
2.076MetLeu: 2.076 ± 0.225
0.0MetMet: 0.0 ± 0.0
1.245MetAsn: 1.245 ± 0.002
1.245MetPro: 1.245 ± 0.002
0.415MetGln: 0.415 ± 0.229
1.245MetArg: 1.245 ± 0.002
1.245MetSer: 1.245 ± 0.688
2.076MetThr: 2.076 ± 0.461
0.83MetVal: 0.83 ± 0.227
0.415MetTrp: 0.415 ± 0.229
2.076MetTyr: 2.076 ± 0.911
0.0MetXaa: 0.0 ± 0.0
Asn
2.491AsnAla: 2.491 ± 0.004
0.83AsnCys: 0.83 ± 0.227
2.491AsnAsp: 2.491 ± 0.681
3.736AsnGlu: 3.736 ± 2.736
2.076AsnPhe: 2.076 ± 1.596
5.396AsnGly: 5.396 ± 1.134
0.83AsnHis: 0.83 ± 0.227
3.736AsnIle: 3.736 ± 0.006
0.83AsnLys: 0.83 ± 0.459
5.396AsnLeu: 5.396 ± 1.609
1.245AsnMet: 1.245 ± 0.183
1.66AsnAsn: 1.66 ± 0.454
2.076AsnPro: 2.076 ± 0.911
1.245AsnGln: 1.245 ± 0.688
2.491AsnArg: 2.491 ± 0.681
3.321AsnSer: 3.321 ± 0.223
3.736AsnThr: 3.736 ± 2.736
4.151AsnVal: 4.151 ± 0.236
1.66AsnTrp: 1.66 ± 0.231
2.491AsnTyr: 2.491 ± 1.367
0.0AsnXaa: 0.0 ± 0.0
Pro
3.736ProAla: 3.736 ± 1.365
0.83ProCys: 0.83 ± 0.227
2.906ProAsp: 2.906 ± 0.919
2.491ProGlu: 2.491 ± 0.681
3.321ProPhe: 3.321 ± 1.594
2.076ProGly: 2.076 ± 0.225
1.245ProHis: 1.245 ± 0.002
2.906ProIle: 2.906 ± 0.919
2.076ProLys: 2.076 ± 1.146
5.396ProLeu: 5.396 ± 0.923
0.83ProMet: 0.83 ± 0.697
2.906ProAsn: 2.906 ± 0.919
2.076ProPro: 2.076 ± 1.596
1.66ProGln: 1.66 ± 1.826
1.66ProArg: 1.66 ± 1.826
2.076ProSer: 2.076 ± 0.911
4.566ProThr: 4.566 ± 1.592
0.83ProVal: 0.83 ± 0.913
1.245ProTrp: 1.245 ± 0.002
3.736ProTyr: 3.736 ± 2.736
0.0ProXaa: 0.0 ± 0.0
Gln
2.076GlnAla: 2.076 ± 0.225
0.0GlnCys: 0.0 ± 0.0
1.245GlnAsp: 1.245 ± 0.684
2.076GlnGlu: 2.076 ± 0.225
0.83GlnPhe: 0.83 ± 0.459
2.906GlnGly: 2.906 ± 0.919
0.0GlnHis: 0.0 ± 0.0
1.245GlnIle: 1.245 ± 0.002
1.245GlnLys: 1.245 ± 0.688
4.151GlnLeu: 4.151 ± 2.293
0.83GlnMet: 0.83 ± 0.459
3.736GlnAsn: 3.736 ± 0.006
0.83GlnPro: 0.83 ± 0.913
2.906GlnGln: 2.906 ± 0.234
2.076GlnArg: 2.076 ± 0.461
3.736GlnSer: 3.736 ± 1.365
1.66GlnThr: 1.66 ± 0.454
2.491GlnVal: 2.491 ± 0.004
1.245GlnTrp: 1.245 ± 0.002
2.491GlnTyr: 2.491 ± 0.004
0.0GlnXaa: 0.0 ± 0.0
Arg
3.321ArgAla: 3.321 ± 1.148
1.245ArgCys: 1.245 ± 0.688
3.321ArgAsp: 3.321 ± 1.148
3.736ArgGlu: 3.736 ± 0.006
3.736ArgPhe: 3.736 ± 0.679
4.151ArgGly: 4.151 ± 1.136
0.83ArgHis: 0.83 ± 0.227
2.491ArgIle: 2.491 ± 0.681
3.321ArgLys: 3.321 ± 1.834
4.566ArgLeu: 4.566 ± 0.221
1.245ArgMet: 1.245 ± 0.002
1.245ArgAsn: 1.245 ± 0.688
3.736ArgPro: 3.736 ± 1.365
0.415ArgGln: 0.415 ± 0.229
2.491ArgArg: 2.491 ± 0.69
2.491ArgSer: 2.491 ± 0.004
2.906ArgThr: 2.906 ± 1.138
4.566ArgVal: 4.566 ± 0.465
0.415ArgTrp: 0.415 ± 0.229
1.66ArgTyr: 1.66 ± 1.14
0.0ArgXaa: 0.0 ± 0.0
Ser
2.906SerAla: 2.906 ± 1.824
0.415SerCys: 0.415 ± 0.229
1.66SerAsp: 1.66 ± 0.231
2.906SerGlu: 2.906 ± 0.452
3.321SerPhe: 3.321 ± 0.223
4.981SerGly: 4.981 ± 2.049
2.076SerHis: 2.076 ± 1.146
2.491SerIle: 2.491 ± 0.004
2.076SerLys: 2.076 ± 1.146
5.812SerLeu: 5.812 ± 0.467
2.076SerMet: 2.076 ± 1.596
3.736SerAsn: 3.736 ± 0.679
2.491SerPro: 2.491 ± 1.367
4.566SerGln: 4.566 ± 0.221
2.491SerArg: 2.491 ± 0.004
3.321SerSer: 3.321 ± 1.594
5.812SerThr: 5.812 ± 2.276
5.396SerVal: 5.396 ± 0.238
0.83SerTrp: 0.83 ± 0.227
2.906SerTyr: 2.906 ± 0.452
0.0SerXaa: 0.0 ± 0.0
Thr
3.321ThrAla: 3.321 ± 0.463
1.245ThrCys: 1.245 ± 0.002
2.906ThrAsp: 2.906 ± 0.234
4.151ThrGlu: 4.151 ± 0.236
3.736ThrPhe: 3.736 ± 0.006
5.812ThrGly: 5.812 ± 3.647
1.245ThrHis: 1.245 ± 0.688
4.566ThrIle: 4.566 ± 0.221
2.906ThrLys: 2.906 ± 0.234
4.151ThrLeu: 4.151 ± 1.136
1.245ThrMet: 1.245 ± 0.002
4.151ThrAsn: 4.151 ± 1.136
3.321ThrPro: 3.321 ± 0.463
3.321ThrGln: 3.321 ± 1.148
2.491ThrArg: 2.491 ± 0.69
6.227ThrSer: 6.227 ± 2.732
6.227ThrThr: 6.227 ± 1.361
4.151ThrVal: 4.151 ± 0.921
1.245ThrTrp: 1.245 ± 0.002
1.66ThrTyr: 1.66 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
5.812ValAla: 5.812 ± 1.153
2.076ValCys: 2.076 ± 0.461
2.906ValAsp: 2.906 ± 1.824
2.906ValGlu: 2.906 ± 0.919
3.321ValPhe: 3.321 ± 0.909
0.83ValGly: 0.83 ± 0.459
1.245ValHis: 1.245 ± 0.688
2.906ValIle: 2.906 ± 0.234
3.321ValLys: 3.321 ± 0.223
5.812ValLeu: 5.812 ± 0.467
1.245ValMet: 1.245 ± 0.002
4.151ValAsn: 4.151 ± 0.45
2.491ValPro: 2.491 ± 0.004
2.906ValGln: 2.906 ± 0.919
3.736ValArg: 3.736 ± 0.692
4.981ValSer: 4.981 ± 0.677
3.321ValThr: 3.321 ± 0.909
3.321ValVal: 3.321 ± 1.148
1.66ValTrp: 1.66 ± 0.231
3.736ValTyr: 3.736 ± 0.006
0.0ValXaa: 0.0 ± 0.0
Trp
1.66TrpAla: 1.66 ± 0.454
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.913
1.245TrpGlu: 1.245 ± 0.688
0.83TrpPhe: 0.83 ± 0.459
0.0TrpGly: 0.0 ± 0.0
0.415TrpHis: 0.415 ± 0.229
0.83TrpIle: 0.83 ± 0.459
1.245TrpLys: 1.245 ± 0.688
0.83TrpLeu: 0.83 ± 0.459
0.0TrpMet: 0.0 ± 0.0
1.245TrpAsn: 1.245 ± 0.684
0.0TrpPro: 0.0 ± 0.0
0.83TrpGln: 0.83 ± 0.459
2.076TrpArg: 2.076 ± 0.911
0.83TrpSer: 0.83 ± 0.913
2.076TrpThr: 2.076 ± 0.461
1.245TrpVal: 1.245 ± 0.002
0.415TrpTrp: 0.415 ± 0.229
0.415TrpTyr: 0.415 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.906TyrAla: 2.906 ± 0.919
0.83TyrCys: 0.83 ± 0.913
2.076TyrAsp: 2.076 ± 0.911
2.076TyrGlu: 2.076 ± 0.461
2.906TyrPhe: 2.906 ± 1.138
3.736TyrGly: 3.736 ± 1.378
0.83TyrHis: 0.83 ± 0.227
2.491TyrIle: 2.491 ± 1.367
2.906TyrLys: 2.906 ± 0.234
4.151TyrLeu: 4.151 ± 1.136
0.0TyrMet: 0.0 ± 0.0
1.66TyrAsn: 1.66 ± 1.14
0.415TyrPro: 0.415 ± 0.229
1.66TyrGln: 1.66 ± 0.454
4.151TyrArg: 4.151 ± 1.607
3.321TyrSer: 3.321 ± 2.28
2.906TyrThr: 2.906 ± 0.919
0.415TyrVal: 0.415 ± 0.456
0.415TyrTrp: 0.415 ± 0.456
1.66TyrTyr: 1.66 ± 0.454
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2410 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski