Amino acid dipepetide frequency for Hubei picorna-like virus 77

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.927AlaAla: 3.927 ± 2.644
0.0AlaCys: 0.0 ± 0.0
3.213AlaAsp: 3.213 ± 0.425
2.142AlaGlu: 2.142 ± 0.59
2.142AlaPhe: 2.142 ± 1.166
7.14AlaGly: 7.14 ± 1.338
1.428AlaHis: 1.428 ± 0.552
4.998AlaIle: 4.998 ± 0.676
3.57AlaLys: 3.57 ± 0.693
4.998AlaLeu: 4.998 ± 2.412
0.714AlaMet: 0.714 ± 0.978
3.57AlaAsn: 3.57 ± 0.831
4.641AlaPro: 4.641 ± 1.688
1.428AlaGln: 1.428 ± 0.512
1.428AlaArg: 1.428 ± 0.736
4.641AlaSer: 4.641 ± 1.066
4.284AlaThr: 4.284 ± 2.23
3.927AlaVal: 3.927 ± 1.322
0.0AlaTrp: 0.0 ± 0.0
2.856AlaTyr: 2.856 ± 1.867
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.184
0.357CysCys: 0.357 ± 0.184
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.714CysPhe: 0.714 ± 0.669
1.785CysGly: 1.785 ± 0.92
0.0CysHis: 0.0 ± 0.0
1.071CysIle: 1.071 ± 0.567
0.0CysLys: 0.0 ± 0.0
2.499CysLeu: 2.499 ± 0.794
0.357CysMet: 0.357 ± 0.184
0.357CysAsn: 0.357 ± 0.184
2.142CysPro: 2.142 ± 0.59
0.0CysGln: 0.0 ± 0.0
0.714CysArg: 0.714 ± 0.368
1.071CysSer: 1.071 ± 0.583
0.714CysThr: 0.714 ± 0.368
1.785CysVal: 1.785 ± 0.521
0.0CysTrp: 0.0 ± 0.0
1.071CysTyr: 1.071 ± 0.552
0.0CysXaa: 0.0 ± 0.0
Asp
3.57AspAla: 3.57 ± 1.185
1.071AspCys: 1.071 ± 0.552
4.998AspAsp: 4.998 ± 2.576
4.284AspGlu: 4.284 ± 2.208
4.284AspPhe: 4.284 ± 1.601
1.785AspGly: 1.785 ± 0.92
1.785AspHis: 1.785 ± 0.591
4.284AspIle: 4.284 ± 2.208
1.071AspLys: 1.071 ± 0.51
7.497AspLeu: 7.497 ± 2.581
0.714AspMet: 0.714 ± 0.661
2.499AspAsn: 2.499 ± 1.126
5.355AspPro: 5.355 ± 2.034
1.428AspGln: 1.428 ± 0.512
1.428AspArg: 1.428 ± 0.557
5.355AspSer: 5.355 ± 1.773
2.142AspThr: 2.142 ± 0.675
5.355AspVal: 5.355 ± 0.73
0.357AspTrp: 0.357 ± 0.184
1.785AspTyr: 1.785 ± 0.591
0.0AspXaa: 0.0 ± 0.0
Glu
2.856GluAla: 2.856 ± 0.673
0.357GluCys: 0.357 ± 0.801
2.499GluAsp: 2.499 ± 0.712
2.142GluGlu: 2.142 ± 0.675
1.785GluPhe: 1.785 ± 0.591
0.714GluGly: 0.714 ± 0.368
0.714GluHis: 0.714 ± 0.368
2.499GluIle: 2.499 ± 0.702
2.499GluLys: 2.499 ± 0.92
2.856GluLeu: 2.856 ± 1.472
1.428GluMet: 1.428 ± 0.512
1.428GluAsn: 1.428 ± 0.557
1.428GluPro: 1.428 ± 0.736
1.071GluGln: 1.071 ± 0.902
1.071GluArg: 1.071 ± 0.552
1.785GluSer: 1.785 ± 0.542
4.284GluThr: 4.284 ± 2.208
2.499GluVal: 2.499 ± 1.288
1.071GluTrp: 1.071 ± 0.567
2.499GluTyr: 2.499 ± 0.702
0.0GluXaa: 0.0 ± 0.0
Phe
3.57PheAla: 3.57 ± 2.71
1.785PheCys: 1.785 ± 0.521
2.142PheAsp: 2.142 ± 0.675
3.213PheGlu: 3.213 ± 1.701
1.428PhePhe: 1.428 ± 0.552
2.142PheGly: 2.142 ± 1.605
1.785PheHis: 1.785 ± 0.904
3.57PheIle: 3.57 ± 0.264
2.499PheLys: 2.499 ± 0.702
4.284PheLeu: 4.284 ± 1.057
1.428PheMet: 1.428 ± 0.736
3.213PheAsn: 3.213 ± 1.087
1.785PhePro: 1.785 ± 0.646
2.142PheGln: 2.142 ± 1.02
1.785PheArg: 1.785 ± 1.232
2.142PheSer: 2.142 ± 2.308
6.069PheThr: 6.069 ± 0.765
3.213PheVal: 3.213 ± 0.57
0.357PheTrp: 0.357 ± 0.184
1.428PheTyr: 1.428 ± 0.736
0.0PheXaa: 0.0 ± 0.0
Gly
3.57GlyAla: 3.57 ± 0.831
0.0GlyCys: 0.0 ± 0.0
3.57GlyAsp: 3.57 ± 1.151
0.0GlyGlu: 0.0 ± 0.0
4.998GlyPhe: 4.998 ± 0.347
3.213GlyGly: 3.213 ± 1.249
1.428GlyHis: 1.428 ± 0.736
2.142GlyIle: 2.142 ± 1.104
2.499GlyLys: 2.499 ± 0.712
4.998GlyLeu: 4.998 ± 0.828
0.357GlyMet: 0.357 ± 0.801
3.213GlyAsn: 3.213 ± 1.481
1.785GlyPro: 1.785 ± 0.646
2.499GlyGln: 2.499 ± 0.92
1.428GlyArg: 1.428 ± 0.552
4.284GlySer: 4.284 ± 2.638
2.856GlyThr: 2.856 ± 0.839
4.284GlyVal: 4.284 ± 1.586
0.357GlyTrp: 0.357 ± 0.184
1.428GlyTyr: 1.428 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
2.142HisAla: 2.142 ± 0.59
0.357HisCys: 0.357 ± 0.184
2.142HisAsp: 2.142 ± 1.104
0.714HisGlu: 0.714 ± 0.368
0.714HisPhe: 0.714 ± 0.368
2.856HisGly: 2.856 ± 1.104
0.714HisHis: 0.714 ± 0.368
2.142HisIle: 2.142 ± 0.773
2.499HisLys: 2.499 ± 0.902
3.57HisLeu: 3.57 ± 1.303
0.357HisMet: 0.357 ± 0.184
0.714HisAsn: 0.714 ± 0.532
2.499HisPro: 2.499 ± 2.26
0.714HisGln: 0.714 ± 0.368
1.428HisArg: 1.428 ± 0.552
3.213HisSer: 3.213 ± 1.529
1.428HisThr: 1.428 ± 0.512
1.428HisVal: 1.428 ± 0.557
0.0HisTrp: 0.0 ± 0.0
1.071HisTyr: 1.071 ± 0.567
0.0HisXaa: 0.0 ± 0.0
Ile
3.57IleAla: 3.57 ± 1.154
1.071IleCys: 1.071 ± 0.552
3.213IleAsp: 3.213 ± 0.683
2.142IleGlu: 2.142 ± 0.456
2.142IlePhe: 2.142 ± 2.786
2.142IleGly: 2.142 ± 0.773
2.142IleHis: 2.142 ± 0.514
4.284IleIle: 4.284 ± 1.058
3.57IleLys: 3.57 ± 0.737
4.641IleLeu: 4.641 ± 1.133
0.714IleMet: 0.714 ± 0.532
2.142IleAsn: 2.142 ± 1.218
4.641IlePro: 4.641 ± 1.053
2.499IleGln: 2.499 ± 1.047
1.428IleArg: 1.428 ± 0.736
6.783IleSer: 6.783 ± 1.888
3.927IleThr: 3.927 ± 0.835
4.284IleVal: 4.284 ± 1.657
0.357IleTrp: 0.357 ± 0.184
4.284IleTyr: 4.284 ± 0.906
0.0IleXaa: 0.0 ± 0.0
Lys
3.213LysAla: 3.213 ± 1.768
0.357LysCys: 0.357 ± 0.184
2.856LysAsp: 2.856 ± 1.472
1.071LysGlu: 1.071 ± 0.567
3.213LysPhe: 3.213 ± 0.992
1.428LysGly: 1.428 ± 0.557
0.714LysHis: 0.714 ± 0.368
5.355LysIle: 5.355 ± 0.623
2.142LysLys: 2.142 ± 1.104
6.069LysLeu: 6.069 ± 1.343
1.785LysMet: 1.785 ± 0.92
1.785LysAsn: 1.785 ± 0.591
2.499LysPro: 2.499 ± 0.92
2.856LysGln: 2.856 ± 1.472
1.071LysArg: 1.071 ± 0.583
6.426LysSer: 6.426 ± 1.173
3.927LysThr: 3.927 ± 0.892
2.856LysVal: 2.856 ± 1.104
0.357LysTrp: 0.357 ± 0.184
3.213LysTyr: 3.213 ± 0.598
0.0LysXaa: 0.0 ± 0.0
Leu
4.641LeuAla: 4.641 ± 0.87
2.142LeuCys: 2.142 ± 1.104
6.783LeuAsp: 6.783 ± 1.412
3.57LeuGlu: 3.57 ± 1.196
3.57LeuPhe: 3.57 ± 1.248
3.213LeuGly: 3.213 ± 1.188
2.856LeuHis: 2.856 ± 0.934
3.57LeuIle: 3.57 ± 0.693
4.641LeuLys: 4.641 ± 1.687
6.783LeuLeu: 6.783 ± 1.457
0.714LeuMet: 0.714 ± 0.368
5.355LeuAsn: 5.355 ± 1.136
8.568LeuPro: 8.568 ± 1.003
3.213LeuGln: 3.213 ± 3.624
4.284LeuArg: 4.284 ± 0.892
7.14LeuSer: 7.14 ± 1.338
7.14LeuThr: 7.14 ± 2.236
8.925LeuVal: 8.925 ± 1.295
0.0LeuTrp: 0.0 ± 0.0
6.426LeuTyr: 6.426 ± 2.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.499MetAla: 2.499 ± 0.92
0.0MetCys: 0.0 ± 0.0
1.428MetAsp: 1.428 ± 1.037
1.785MetGlu: 1.785 ± 0.92
0.357MetPhe: 0.357 ± 0.611
0.714MetGly: 0.714 ± 0.368
0.714MetHis: 0.714 ± 0.532
1.428MetIle: 1.428 ± 0.736
1.071MetLys: 1.071 ± 0.552
1.785MetLeu: 1.785 ± 0.904
0.357MetMet: 0.357 ± 0.49
0.0MetAsn: 0.0 ± 0.0
1.071MetPro: 1.071 ± 0.583
0.0MetGln: 0.0 ± 0.0
1.071MetArg: 1.071 ± 0.552
1.785MetSer: 1.785 ± 1.466
1.071MetThr: 1.071 ± 0.552
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.428MetTyr: 1.428 ± 0.75
0.0MetXaa: 0.0 ± 0.0
Asn
3.927AsnAla: 3.927 ± 0.836
0.714AsnCys: 0.714 ± 0.661
2.499AsnAsp: 2.499 ± 1.288
2.142AsnGlu: 2.142 ± 0.773
3.57AsnPhe: 3.57 ± 0.831
2.499AsnGly: 2.499 ± 0.794
2.856AsnHis: 2.856 ± 1.078
3.57AsnIle: 3.57 ± 1.511
3.213AsnLys: 3.213 ± 0.683
3.57AsnLeu: 3.57 ± 1.182
1.071AsnMet: 1.071 ± 0.776
4.284AsnAsn: 4.284 ± 1.586
2.856AsnPro: 2.856 ± 1.512
1.785AsnGln: 1.785 ± 1.025
2.856AsnArg: 2.856 ± 1.53
4.641AsnSer: 4.641 ± 2.915
4.998AsnThr: 4.998 ± 2.167
3.57AsnVal: 3.57 ± 1.196
0.357AsnTrp: 0.357 ± 0.184
2.499AsnTyr: 2.499 ± 0.702
0.0AsnXaa: 0.0 ± 0.0
Pro
4.641ProAla: 4.641 ± 3.994
1.785ProCys: 1.785 ± 0.92
2.499ProAsp: 2.499 ± 1.288
1.785ProGlu: 1.785 ± 0.591
2.499ProPhe: 2.499 ± 0.436
2.142ProGly: 2.142 ± 0.793
2.856ProHis: 2.856 ± 2.282
3.213ProIle: 3.213 ± 1.249
1.785ProLys: 1.785 ± 0.521
8.925ProLeu: 8.925 ± 1.931
0.714ProMet: 0.714 ± 0.532
4.641ProAsn: 4.641 ± 1.735
2.142ProPro: 2.142 ± 1.319
1.785ProGln: 1.785 ± 0.92
1.785ProArg: 1.785 ± 1.657
7.14ProSer: 7.14 ± 4.691
3.57ProThr: 3.57 ± 0.869
3.927ProVal: 3.927 ± 1.008
0.357ProTrp: 0.357 ± 0.184
2.499ProTyr: 2.499 ± 1.481
0.0ProXaa: 0.0 ± 0.0
Gln
2.856GlnAla: 2.856 ± 0.487
1.785GlnCys: 1.785 ± 0.591
1.785GlnAsp: 1.785 ± 0.542
0.714GlnGlu: 0.714 ± 0.368
2.856GlnPhe: 2.856 ± 1.367
2.142GlnGly: 2.142 ± 0.514
1.428GlnHis: 1.428 ± 0.552
3.927GlnIle: 3.927 ± 1.695
1.071GlnLys: 1.071 ± 0.552
2.499GlnLeu: 2.499 ± 0.712
0.357GlnMet: 0.357 ± 0.184
1.785GlnAsn: 1.785 ± 0.92
2.142GlnPro: 2.142 ± 0.773
0.0GlnGln: 0.0 ± 0.0
0.714GlnArg: 0.714 ± 0.669
2.142GlnSer: 2.142 ± 0.773
1.785GlnThr: 1.785 ± 0.591
1.071GlnVal: 1.071 ± 0.552
0.714GlnTrp: 0.714 ± 0.368
1.428GlnTyr: 1.428 ± 1.063
0.0GlnXaa: 0.0 ± 0.0
Arg
2.856ArgAla: 2.856 ± 1.104
0.357ArgCys: 0.357 ± 0.184
3.213ArgAsp: 3.213 ± 0.772
0.714ArgGlu: 0.714 ± 0.368
1.071ArgPhe: 1.071 ± 0.552
2.142ArgGly: 2.142 ± 0.675
0.714ArgHis: 0.714 ± 0.368
1.071ArgIle: 1.071 ± 0.552
3.213ArgLys: 3.213 ± 1.243
3.927ArgLeu: 3.927 ± 0.647
1.428ArgMet: 1.428 ± 0.736
4.998ArgAsn: 4.998 ± 0.347
1.428ArgPro: 1.428 ± 1.063
1.071ArgGln: 1.071 ± 0.583
1.428ArgArg: 1.428 ± 0.736
3.927ArgSer: 3.927 ± 1.335
3.213ArgThr: 3.213 ± 0.598
2.142ArgVal: 2.142 ± 0.773
0.0ArgTrp: 0.0 ± 0.0
1.428ArgTyr: 1.428 ± 1.283
0.0ArgXaa: 0.0 ± 0.0
Ser
2.856SerAla: 2.856 ± 1.025
1.071SerCys: 1.071 ± 0.567
5.712SerAsp: 5.712 ± 2.2
1.428SerGlu: 1.428 ± 0.512
4.998SerPhe: 4.998 ± 1.301
4.641SerGly: 4.641 ± 2.327
1.785SerHis: 1.785 ± 1.245
4.641SerIle: 4.641 ± 1.832
5.355SerLys: 5.355 ± 3.452
7.854SerLeu: 7.854 ± 1.616
0.714SerMet: 0.714 ± 0.368
6.426SerAsn: 6.426 ± 2.934
4.284SerPro: 4.284 ± 1.657
2.142SerGln: 2.142 ± 1.115
5.712SerArg: 5.712 ± 2.958
7.854SerSer: 7.854 ± 6.521
7.854SerThr: 7.854 ± 5.056
6.069SerVal: 6.069 ± 1.756
0.714SerTrp: 0.714 ± 0.368
4.998SerTyr: 4.998 ± 2.502
0.0SerXaa: 0.0 ± 0.0
Thr
5.355ThrAla: 5.355 ± 1.868
1.071ThrCys: 1.071 ± 0.567
4.641ThrAsp: 4.641 ± 1.667
2.856ThrGlu: 2.856 ± 1.472
3.213ThrPhe: 3.213 ± 1.656
2.142ThrGly: 2.142 ± 1.804
2.856ThrHis: 2.856 ± 1.53
5.712ThrIle: 5.712 ± 1.347
3.57ThrLys: 3.57 ± 1.413
6.426ThrLeu: 6.426 ± 0.866
1.785ThrMet: 1.785 ± 0.967
3.927ThrAsn: 3.927 ± 1.343
5.355ThrPro: 5.355 ± 2.817
3.213ThrGln: 3.213 ± 1.087
3.57ThrArg: 3.57 ± 1.413
6.783ThrSer: 6.783 ± 3.227
6.069ThrThr: 6.069 ± 1.731
2.856ThrVal: 2.856 ± 0.487
0.714ThrTrp: 0.714 ± 0.669
1.785ThrTyr: 1.785 ± 0.591
0.0ThrXaa: 0.0 ± 0.0
Val
3.57ValAla: 3.57 ± 1.413
0.0ValCys: 0.0 ± 0.0
3.213ValAsp: 3.213 ± 0.57
2.142ValGlu: 2.142 ± 1.104
3.57ValPhe: 3.57 ± 0.768
3.927ValGly: 3.927 ± 0.164
2.142ValHis: 2.142 ± 1.02
1.428ValIle: 1.428 ± 0.736
5.712ValLys: 5.712 ± 2.2
5.355ValLeu: 5.355 ± 2.021
1.785ValMet: 1.785 ± 0.525
2.499ValAsn: 2.499 ± 1.065
3.57ValPro: 3.57 ± 2.298
3.57ValGln: 3.57 ± 1.154
3.927ValArg: 3.927 ± 0.164
4.998ValSer: 4.998 ± 1.818
6.069ValThr: 6.069 ± 1.14
4.641ValVal: 4.641 ± 1.76
0.714ValTrp: 0.714 ± 0.368
2.856ValTyr: 2.856 ± 1.078
0.0ValXaa: 0.0 ± 0.0
Trp
0.357TrpAla: 0.357 ± 0.801
0.357TrpCys: 0.357 ± 0.184
0.357TrpAsp: 0.357 ± 0.184
0.357TrpGlu: 0.357 ± 0.184
0.0TrpPhe: 0.0 ± 0.0
0.714TrpGly: 0.714 ± 0.368
0.357TrpHis: 0.357 ± 0.184
0.0TrpIle: 0.0 ± 0.0
1.428TrpLys: 1.428 ± 0.736
0.714TrpLeu: 0.714 ± 0.368
0.357TrpMet: 0.357 ± 0.184
0.357TrpAsn: 0.357 ± 0.184
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.714TrpArg: 0.714 ± 0.368
0.357TrpSer: 0.357 ± 0.184
0.357TrpThr: 0.357 ± 0.801
0.0TrpVal: 0.0 ± 0.0
0.357TrpTrp: 0.357 ± 0.184
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.714TyrAla: 0.714 ± 1.221
0.357TyrCys: 0.357 ± 0.184
4.284TyrAsp: 4.284 ± 1.811
4.284TyrGlu: 4.284 ± 0.227
2.856TyrPhe: 2.856 ± 1.115
1.428TyrGly: 1.428 ± 0.736
1.428TyrHis: 1.428 ± 0.512
1.071TyrIle: 1.071 ± 0.583
1.785TyrLys: 1.785 ± 0.92
4.284TyrLeu: 4.284 ± 1.161
1.071TyrMet: 1.071 ± 0.51
4.284TyrAsn: 4.284 ± 1.484
2.499TyrPro: 2.499 ± 0.436
1.785TyrGln: 1.785 ± 0.646
2.499TyrArg: 2.499 ± 1.126
4.641TyrSer: 4.641 ± 1.907
2.499TyrThr: 2.499 ± 1.288
2.856TyrVal: 2.856 ± 0.599
0.357TyrTrp: 0.357 ± 0.184
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski