Amino acid dipepetide frequency for Wenling hepe-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.76AlaAla: 3.76 ± 1.622
1.157AlaCys: 1.157 ± 0.536
2.603AlaAsp: 2.603 ± 0.907
4.05AlaGlu: 4.05 ± 1.553
1.736AlaPhe: 1.736 ± 0.745
2.893AlaGly: 2.893 ± 0.365
1.446AlaHis: 1.446 ± 0.541
5.496AlaIle: 5.496 ± 1.509
6.653AlaLys: 6.653 ± 2.231
5.496AlaLeu: 5.496 ± 1.373
0.289AlaMet: 0.289 ± 0.16
2.314AlaAsn: 2.314 ± 2.208
2.025AlaPro: 2.025 ± 1.526
3.471AlaGln: 3.471 ± 0.964
1.157AlaArg: 1.157 ± 0.479
4.05AlaSer: 4.05 ± 1.087
4.339AlaThr: 4.339 ± 1.362
3.182AlaVal: 3.182 ± 1.152
0.0AlaTrp: 0.0 ± 0.0
3.76AlaTyr: 3.76 ± 1.576
0.0AlaXaa: 0.0 ± 0.0
Cys
0.868CysAla: 0.868 ± 0.479
0.579CysCys: 0.579 ± 0.319
0.579CysAsp: 0.579 ± 0.319
1.157CysGlu: 1.157 ± 1.371
0.579CysPhe: 0.579 ± 0.319
0.868CysGly: 0.868 ± 0.479
0.579CysHis: 0.579 ± 0.319
0.289CysIle: 0.289 ± 0.16
1.157CysLys: 1.157 ± 0.638
0.868CysLeu: 0.868 ± 0.479
0.289CysMet: 0.289 ± 0.16
1.157CysAsn: 1.157 ± 0.638
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.736CysArg: 1.736 ± 0.42
0.868CysSer: 0.868 ± 0.883
0.289CysThr: 0.289 ± 0.16
1.157CysVal: 1.157 ± 0.43
0.0CysTrp: 0.0 ± 0.0
0.868CysTyr: 0.868 ± 0.479
0.0CysXaa: 0.0 ± 0.0
Asp
4.918AspAla: 4.918 ± 1.036
0.868AspCys: 0.868 ± 0.479
4.05AspAsp: 4.05 ± 1.427
4.05AspGlu: 4.05 ± 1.344
5.496AspPhe: 5.496 ± 1.631
4.628AspGly: 4.628 ± 1.818
1.446AspHis: 1.446 ± 0.897
3.182AspIle: 3.182 ± 1.311
3.182AspLys: 3.182 ± 1.344
5.496AspLeu: 5.496 ± 1.372
1.157AspMet: 1.157 ± 1.512
2.314AspAsn: 2.314 ± 0.958
2.025AspPro: 2.025 ± 0.611
1.157AspGln: 1.157 ± 0.818
1.736AspArg: 1.736 ± 0.637
3.76AspSer: 3.76 ± 1.138
3.76AspThr: 3.76 ± 0.462
3.471AspVal: 3.471 ± 0.519
0.579AspTrp: 0.579 ± 0.319
2.025AspTyr: 2.025 ± 0.991
0.0AspXaa: 0.0 ± 0.0
Glu
2.893GluAla: 2.893 ± 0.775
0.868GluCys: 0.868 ± 0.479
2.603GluAsp: 2.603 ± 1.031
6.653GluGlu: 6.653 ± 2.584
3.471GluPhe: 3.471 ± 1.145
2.603GluGly: 2.603 ± 1.877
1.736GluHis: 1.736 ± 0.637
4.339GluIle: 4.339 ± 1.173
5.207GluLys: 5.207 ± 1.317
4.918GluLeu: 4.918 ± 1.253
2.025GluMet: 2.025 ± 0.892
4.05GluAsn: 4.05 ± 1.769
2.603GluPro: 2.603 ± 0.827
3.182GluGln: 3.182 ± 1.034
3.76GluArg: 3.76 ± 0.673
3.471GluSer: 3.471 ± 0.907
3.76GluThr: 3.76 ± 1.094
4.339GluVal: 4.339 ± 1.284
0.289GluTrp: 0.289 ± 0.485
1.736GluTyr: 1.736 ± 0.626
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 1.423
0.289PheCys: 0.289 ± 0.16
3.76PheAsp: 3.76 ± 0.457
3.76PheGlu: 3.76 ± 0.593
0.868PhePhe: 0.868 ± 1.082
3.471PheGly: 3.471 ± 1.205
1.157PheHis: 1.157 ± 1.317
2.025PheIle: 2.025 ± 0.756
3.471PheLys: 3.471 ± 0.784
3.76PheLeu: 3.76 ± 1.172
0.868PheMet: 0.868 ± 0.717
4.339PheAsn: 4.339 ± 2.08
3.182PhePro: 3.182 ± 1.152
2.025PheGln: 2.025 ± 1.039
0.579PheArg: 0.579 ± 0.756
3.471PheSer: 3.471 ± 1.514
3.182PheThr: 3.182 ± 1.27
2.025PheVal: 2.025 ± 0.728
0.579PheTrp: 0.579 ± 0.319
1.446PheTyr: 1.446 ± 1.023
0.0PheXaa: 0.0 ± 0.0
Gly
3.471GlyAla: 3.471 ± 0.477
0.868GlyCys: 0.868 ± 0.479
4.05GlyAsp: 4.05 ± 1.328
2.025GlyGlu: 2.025 ± 0.77
2.025GlyPhe: 2.025 ± 0.861
2.893GlyGly: 2.893 ± 0.682
1.157GlyHis: 1.157 ± 0.638
3.471GlyIle: 3.471 ± 0.519
5.496GlyLys: 5.496 ± 1.159
4.05GlyLeu: 4.05 ± 2.03
1.446GlyMet: 1.446 ± 0.798
2.314GlyAsn: 2.314 ± 0.825
2.025GlyPro: 2.025 ± 0.846
1.446GlyGln: 1.446 ± 0.52
1.736GlyArg: 1.736 ± 0.958
2.893GlySer: 2.893 ± 1.309
2.893GlyThr: 2.893 ± 1.303
4.05GlyVal: 4.05 ± 1.21
0.579GlyTrp: 0.579 ± 0.409
0.579GlyTyr: 0.579 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
1.736HisAla: 1.736 ± 0.806
0.0HisCys: 0.0 ± 0.0
1.157HisAsp: 1.157 ± 0.638
1.446HisGlu: 1.446 ± 0.798
1.446HisPhe: 1.446 ± 1.023
1.446HisGly: 1.446 ± 0.541
0.0HisHis: 0.0 ± 0.0
1.446HisIle: 1.446 ± 0.744
1.736HisLys: 1.736 ± 0.776
1.157HisLeu: 1.157 ± 0.638
0.0HisMet: 0.0 ± 0.0
2.603HisAsn: 2.603 ± 1.164
0.868HisPro: 0.868 ± 0.479
0.289HisGln: 0.289 ± 0.16
0.868HisArg: 0.868 ± 0.628
1.736HisSer: 1.736 ± 0.933
1.446HisThr: 1.446 ± 1.315
3.182HisVal: 3.182 ± 0.844
0.289HisTrp: 0.289 ± 0.16
0.868HisTyr: 0.868 ± 0.479
0.0HisXaa: 0.0 ± 0.0
Ile
4.339IleAla: 4.339 ± 1.434
0.579IleCys: 0.579 ± 0.319
5.785IleAsp: 5.785 ± 1.473
2.893IleGlu: 2.893 ± 0.941
2.314IlePhe: 2.314 ± 0.86
2.314IleGly: 2.314 ± 1.213
2.314IleHis: 2.314 ± 0.518
4.05IleIle: 4.05 ± 1.828
6.075IleLys: 6.075 ± 1.819
4.05IleLeu: 4.05 ± 3.233
0.579IleMet: 0.579 ± 0.319
3.76IleAsn: 3.76 ± 2.49
2.314IlePro: 2.314 ± 0.825
2.603IleGln: 2.603 ± 1.569
3.182IleArg: 3.182 ± 0.872
4.05IleSer: 4.05 ± 2.323
5.207IleThr: 5.207 ± 1.562
4.339IleVal: 4.339 ± 1.093
0.0IleTrp: 0.0 ± 0.0
2.603IleTyr: 2.603 ± 1.229
0.0IleXaa: 0.0 ± 0.0
Lys
4.339LysAla: 4.339 ± 1.093
1.446LysCys: 1.446 ± 0.798
3.182LysAsp: 3.182 ± 0.935
4.339LysGlu: 4.339 ± 1.861
3.76LysPhe: 3.76 ± 1.017
3.76LysGly: 3.76 ± 1.367
1.736LysHis: 1.736 ± 0.958
3.76LysIle: 3.76 ± 0.599
4.628LysLys: 4.628 ± 2.137
8.1LysLeu: 8.1 ± 3.052
2.893LysMet: 2.893 ± 1.187
3.471LysAsn: 3.471 ± 1.915
3.76LysPro: 3.76 ± 1.247
5.785LysGln: 5.785 ± 1.647
1.736LysArg: 1.736 ± 0.637
4.918LysSer: 4.918 ± 2.251
3.76LysThr: 3.76 ± 1.622
5.496LysVal: 5.496 ± 2.086
0.579LysTrp: 0.579 ± 0.506
4.05LysTyr: 4.05 ± 0.725
0.0LysXaa: 0.0 ± 0.0
Leu
4.918LeuAla: 4.918 ± 1.354
2.025LeuCys: 2.025 ± 1.117
5.785LeuAsp: 5.785 ± 1.389
6.942LeuGlu: 6.942 ± 1.287
3.471LeuPhe: 3.471 ± 0.864
3.182LeuGly: 3.182 ± 1.819
2.025LeuHis: 2.025 ± 0.444
4.05LeuIle: 4.05 ± 0.494
6.075LeuLys: 6.075 ± 0.941
5.785LeuLeu: 5.785 ± 1.029
1.446LeuMet: 1.446 ± 0.639
5.785LeuAsn: 5.785 ± 1.848
4.339LeuPro: 4.339 ± 0.987
2.893LeuGln: 2.893 ± 1.677
4.05LeuArg: 4.05 ± 1.365
4.628LeuSer: 4.628 ± 1.436
6.942LeuThr: 6.942 ± 0.879
4.918LeuVal: 4.918 ± 4.5
0.868LeuTrp: 0.868 ± 0.466
3.471LeuTyr: 3.471 ± 1.3
0.0LeuXaa: 0.0 ± 0.0
Met
1.157MetAla: 1.157 ± 1.807
0.868MetCys: 0.868 ± 0.388
1.446MetAsp: 1.446 ± 0.798
2.314MetGlu: 2.314 ± 1.549
0.579MetPhe: 0.579 ± 0.319
0.289MetGly: 0.289 ± 0.16
0.289MetHis: 0.289 ± 0.16
1.446MetIle: 1.446 ± 0.693
1.446MetLys: 1.446 ± 0.798
3.182MetLeu: 3.182 ± 1.152
0.0MetMet: 0.0 ± 0.0
1.157MetAsn: 1.157 ± 0.713
0.868MetPro: 0.868 ± 0.466
1.446MetGln: 1.446 ± 0.798
2.025MetArg: 2.025 ± 0.77
1.736MetSer: 1.736 ± 0.974
0.289MetThr: 0.289 ± 0.485
1.736MetVal: 1.736 ± 0.598
0.0MetTrp: 0.0 ± 0.0
0.579MetTyr: 0.579 ± 0.756
0.0MetXaa: 0.0 ± 0.0
Asn
2.314AsnAla: 2.314 ± 0.958
0.868AsnCys: 0.868 ± 0.843
2.893AsnAsp: 2.893 ± 1.26
3.182AsnGlu: 3.182 ± 2.025
3.471AsnPhe: 3.471 ± 1.692
2.314AsnGly: 2.314 ± 0.887
0.579AsnHis: 0.579 ± 0.319
4.628AsnIle: 4.628 ± 3.11
4.339AsnLys: 4.339 ± 1.21
6.364AsnLeu: 6.364 ± 1.973
1.157AsnMet: 1.157 ± 0.609
1.736AsnAsn: 1.736 ± 0.933
1.736AsnPro: 1.736 ± 0.679
3.76AsnGln: 3.76 ± 4.548
1.446AsnArg: 1.446 ± 0.96
4.628AsnSer: 4.628 ± 1.804
4.05AsnThr: 4.05 ± 0.494
3.76AsnVal: 3.76 ± 2.436
1.157AsnTrp: 1.157 ± 1.013
2.025AsnTyr: 2.025 ± 1.117
0.0AsnXaa: 0.0 ± 0.0
Pro
1.157ProAla: 1.157 ± 0.609
0.0ProCys: 0.0 ± 0.0
2.314ProAsp: 2.314 ± 1.16
4.05ProGlu: 4.05 ± 0.72
3.182ProPhe: 3.182 ± 1.457
1.446ProGly: 1.446 ± 0.541
0.868ProHis: 0.868 ± 0.479
3.182ProIle: 3.182 ± 0.704
2.603ProLys: 2.603 ± 1.436
2.025ProLeu: 2.025 ± 0.88
1.446ProMet: 1.446 ± 0.454
1.446ProAsn: 1.446 ± 0.729
3.182ProPro: 3.182 ± 1.171
2.603ProGln: 2.603 ± 0.625
2.314ProArg: 2.314 ± 0.746
2.314ProSer: 2.314 ± 0.887
5.207ProThr: 5.207 ± 2.337
2.893ProVal: 2.893 ± 1.303
0.289ProTrp: 0.289 ± 0.588
2.314ProTyr: 2.314 ± 0.825
0.0ProXaa: 0.0 ± 0.0
Gln
1.157GlnAla: 1.157 ± 0.638
0.0GlnCys: 0.0 ± 0.0
2.603GlnAsp: 2.603 ± 1.904
2.025GlnGlu: 2.025 ± 0.777
2.025GlnPhe: 2.025 ± 1.448
2.603GlnGly: 2.603 ± 0.625
1.736GlnHis: 1.736 ± 1.363
2.603GlnIle: 2.603 ± 0.625
3.182GlnLys: 3.182 ± 1.123
4.918GlnLeu: 4.918 ± 1.369
1.157GlnMet: 1.157 ± 0.644
3.182GlnAsn: 3.182 ± 1.588
1.736GlnPro: 1.736 ± 0.637
1.736GlnGln: 1.736 ± 0.637
3.182GlnArg: 3.182 ± 0.617
2.893GlnSer: 2.893 ± 2.081
3.471GlnThr: 3.471 ± 1.196
2.314GlnVal: 2.314 ± 0.614
0.289GlnTrp: 0.289 ± 0.772
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.471ArgAla: 3.471 ± 0.608
0.289ArgCys: 0.289 ± 0.16
2.314ArgAsp: 2.314 ± 0.746
2.893ArgGlu: 2.893 ± 1.596
2.025ArgPhe: 2.025 ± 1.031
2.025ArgGly: 2.025 ± 1.117
0.579ArgHis: 0.579 ± 0.756
1.736ArgIle: 1.736 ± 0.626
3.471ArgLys: 3.471 ± 0.823
2.314ArgLeu: 2.314 ± 0.518
1.736ArgMet: 1.736 ± 0.974
2.603ArgAsn: 2.603 ± 2.339
2.314ArgPro: 2.314 ± 0.676
1.736ArgGln: 1.736 ± 0.974
1.446ArgArg: 1.446 ± 0.744
2.603ArgSer: 2.603 ± 1.058
2.314ArgThr: 2.314 ± 0.746
4.339ArgVal: 4.339 ± 1.598
0.579ArgTrp: 0.579 ± 0.756
1.446ArgTyr: 1.446 ± 0.96
0.0ArgXaa: 0.0 ± 0.0
Ser
5.496SerAla: 5.496 ± 2.248
1.157SerCys: 1.157 ± 0.638
3.76SerAsp: 3.76 ± 1.005
3.182SerGlu: 3.182 ± 1.292
3.76SerPhe: 3.76 ± 0.885
4.918SerGly: 4.918 ± 1.56
1.736SerHis: 1.736 ± 0.958
4.628SerIle: 4.628 ± 2.317
5.496SerLys: 5.496 ± 1.373
3.182SerLeu: 3.182 ± 1.588
1.157SerMet: 1.157 ± 0.638
2.025SerAsn: 2.025 ± 0.932
3.471SerPro: 3.471 ± 1.032
1.736SerGln: 1.736 ± 0.637
5.207SerArg: 5.207 ± 1.327
5.785SerSer: 5.785 ± 2.246
3.471SerThr: 3.471 ± 0.484
3.76SerVal: 3.76 ± 0.599
0.868SerTrp: 0.868 ± 0.479
2.314SerTyr: 2.314 ± 0.606
0.0SerXaa: 0.0 ± 0.0
Thr
4.339ThrAla: 4.339 ± 1.73
0.868ThrCys: 0.868 ± 0.479
2.025ThrAsp: 2.025 ± 1.117
2.025ThrGlu: 2.025 ± 0.756
4.05ThrPhe: 4.05 ± 1.045
2.603ThrGly: 2.603 ± 0.775
1.157ThrHis: 1.157 ± 0.43
8.967ThrIle: 8.967 ± 2.431
5.207ThrLys: 5.207 ± 1.257
7.232ThrLeu: 7.232 ± 1.415
2.603ThrMet: 2.603 ± 0.66
4.339ThrAsn: 4.339 ± 1.038
3.76ThrPro: 3.76 ± 1.235
2.314ThrGln: 2.314 ± 1.217
1.446ThrArg: 1.446 ± 0.541
4.918ThrSer: 4.918 ± 0.801
9.546ThrThr: 9.546 ± 2.231
4.339ThrVal: 4.339 ± 1.396
0.289ThrTrp: 0.289 ± 0.16
1.446ThrTyr: 1.446 ± 0.781
0.0ThrXaa: 0.0 ± 0.0
Val
4.05ValAla: 4.05 ± 1.088
0.579ValCys: 0.579 ± 0.686
4.05ValAsp: 4.05 ± 1.308
4.339ValGlu: 4.339 ± 0.653
2.603ValPhe: 2.603 ± 1.009
3.76ValGly: 3.76 ± 1.676
1.736ValHis: 1.736 ± 0.42
2.025ValIle: 2.025 ± 0.777
3.76ValLys: 3.76 ± 0.457
6.942ValLeu: 6.942 ± 1.287
1.446ValMet: 1.446 ± 0.798
3.182ValAsn: 3.182 ± 1.355
3.182ValPro: 3.182 ± 1.326
2.025ValGln: 2.025 ± 0.444
2.893ValArg: 2.893 ± 2.838
4.918ValSer: 4.918 ± 1.699
6.942ValThr: 6.942 ± 1.759
2.603ValVal: 2.603 ± 0.625
0.0ValTrp: 0.0 ± 0.0
3.471ValTyr: 3.471 ± 0.998
0.0ValXaa: 0.0 ± 0.0
Trp
1.157TrpAla: 1.157 ± 0.479
0.0TrpCys: 0.0 ± 0.0
0.289TrpAsp: 0.289 ± 0.588
0.579TrpGlu: 0.579 ± 0.319
0.289TrpPhe: 0.289 ± 0.16
0.0TrpGly: 0.0 ± 0.0
0.289TrpHis: 0.289 ± 0.824
0.289TrpIle: 0.289 ± 0.485
0.289TrpLys: 0.289 ± 0.16
0.868TrpLeu: 0.868 ± 0.466
0.0TrpMet: 0.0 ± 0.0
0.289TrpAsn: 0.289 ± 0.16
0.0TrpPro: 0.0 ± 0.0
0.868TrpGln: 0.868 ± 0.466
0.289TrpArg: 0.289 ± 0.16
0.868TrpSer: 0.868 ± 0.901
0.579TrpThr: 0.579 ± 0.409
0.868TrpVal: 0.868 ± 0.466
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.868TyrAla: 0.868 ± 0.479
0.579TyrCys: 0.579 ± 0.409
4.05TyrAsp: 4.05 ± 0.446
2.603TyrGlu: 2.603 ± 1.399
0.579TyrPhe: 0.579 ± 0.686
1.446TyrGly: 1.446 ± 0.63
1.157TyrHis: 1.157 ± 0.638
1.736TyrIle: 1.736 ± 0.637
2.025TyrLys: 2.025 ± 0.932
3.182TyrLeu: 3.182 ± 1.24
0.868TyrMet: 0.868 ± 0.587
4.339TyrAsn: 4.339 ± 2.243
1.446TyrPro: 1.446 ± 0.96
1.736TyrGln: 1.736 ± 0.679
1.446TyrArg: 1.446 ± 0.744
2.603TyrSer: 2.603 ± 0.949
2.025TyrThr: 2.025 ± 0.444
1.736TyrVal: 1.736 ± 0.689
0.579TyrTrp: 0.579 ± 0.506
2.025TyrTyr: 2.025 ± 1.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski