Amino acid dipepetide frequency for Beihai hepe-like virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.737AlaAla: 8.737 ± 2.266
1.127AlaCys: 1.127 ± 0.618
6.201AlaAsp: 6.201 ± 1.017
3.664AlaGlu: 3.664 ± 1.196
3.1AlaPhe: 3.1 ± 1.285
5.637AlaGly: 5.637 ± 1.923
1.691AlaHis: 1.691 ± 0.618
6.201AlaIle: 6.201 ± 2.12
4.791AlaLys: 4.791 ± 2.211
7.61AlaLeu: 7.61 ± 1.852
1.973AlaMet: 1.973 ± 0.649
1.691AlaAsn: 1.691 ± 0.593
3.946AlaPro: 3.946 ± 0.717
3.382AlaGln: 3.382 ± 1.523
3.382AlaArg: 3.382 ± 1.214
5.919AlaSer: 5.919 ± 1.749
6.483AlaThr: 6.483 ± 2.205
5.355AlaVal: 5.355 ± 1.484
1.127AlaTrp: 1.127 ± 0.405
2.537AlaTyr: 2.537 ± 0.785
0.0AlaXaa: 0.0 ± 0.0
Cys
1.409CysAla: 1.409 ± 0.713
0.564CysCys: 0.564 ± 0.309
1.409CysAsp: 1.409 ± 0.772
1.409CysGlu: 1.409 ± 0.499
0.0CysPhe: 0.0 ± 0.0
1.127CysGly: 1.127 ± 0.738
1.409CysHis: 1.409 ± 0.499
0.846CysIle: 0.846 ± 0.463
0.846CysLys: 0.846 ± 0.525
0.846CysLeu: 0.846 ± 0.547
0.0CysMet: 0.0 ± 0.0
1.409CysAsn: 1.409 ± 0.713
0.564CysPro: 0.564 ± 0.582
0.282CysGln: 0.282 ± 0.154
1.127CysArg: 1.127 ± 0.618
0.846CysSer: 0.846 ± 0.355
0.564CysThr: 0.564 ± 0.369
1.127CysVal: 1.127 ± 0.618
0.564CysTrp: 0.564 ± 0.741
0.282CysTyr: 0.282 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
5.073AspAla: 5.073 ± 1.613
0.564AspCys: 0.564 ± 0.309
3.382AspAsp: 3.382 ± 1.236
4.51AspGlu: 4.51 ± 2.082
4.228AspPhe: 4.228 ± 1.193
1.691AspGly: 1.691 ± 0.633
1.973AspHis: 1.973 ± 0.754
3.382AspIle: 3.382 ± 1.801
4.228AspLys: 4.228 ± 1.744
4.791AspLeu: 4.791 ± 1.458
0.564AspMet: 0.564 ± 0.369
2.255AspAsn: 2.255 ± 0.809
2.255AspPro: 2.255 ± 0.488
2.255AspGln: 2.255 ± 0.488
3.1AspArg: 3.1 ± 0.888
5.355AspSer: 5.355 ± 3.44
3.382AspThr: 3.382 ± 1.264
2.818AspVal: 2.818 ± 1.179
1.127AspTrp: 1.127 ± 0.974
1.691AspTyr: 1.691 ± 0.768
0.0AspXaa: 0.0 ± 0.0
Glu
5.073GluAla: 5.073 ± 1.753
0.564GluCys: 0.564 ± 0.309
2.537GluAsp: 2.537 ± 1.033
14.374GluGlu: 14.374 ± 6.568
2.255GluPhe: 2.255 ± 0.889
5.355GluGly: 5.355 ± 1.983
2.537GluHis: 2.537 ± 1.39
4.228GluIle: 4.228 ± 1.039
2.818GluLys: 2.818 ± 1.166
6.201GluLeu: 6.201 ± 1.678
2.255GluMet: 2.255 ± 0.889
2.818GluAsn: 2.818 ± 0.593
3.664GluPro: 3.664 ± 0.952
5.355GluGln: 5.355 ± 0.795
1.691GluArg: 1.691 ± 0.71
4.51GluSer: 4.51 ± 2.598
1.973GluThr: 1.973 ± 1.081
4.51GluVal: 4.51 ± 1.43
0.564GluTrp: 0.564 ± 0.309
0.846GluTyr: 0.846 ± 0.699
0.0GluXaa: 0.0 ± 0.0
Phe
2.818PheAla: 2.818 ± 1.056
0.846PheCys: 0.846 ± 0.547
2.537PheAsp: 2.537 ± 1.065
3.1PheGlu: 3.1 ± 0.567
2.818PhePhe: 2.818 ± 1.095
3.1PheGly: 3.1 ± 1.086
1.127PheHis: 1.127 ± 0.635
2.537PheIle: 2.537 ± 0.519
3.1PheLys: 3.1 ± 1.347
2.255PheLeu: 2.255 ± 1.36
0.846PheMet: 0.846 ± 1.094
3.1PheAsn: 3.1 ± 0.487
1.691PhePro: 1.691 ± 0.552
3.382PheGln: 3.382 ± 2.681
1.691PheArg: 1.691 ± 0.632
1.973PheSer: 1.973 ± 1.053
1.691PheThr: 1.691 ± 0.86
1.127PheVal: 1.127 ± 0.405
0.564PheTrp: 0.564 ± 0.369
1.973PheTyr: 1.973 ± 1.249
0.0PheXaa: 0.0 ± 0.0
Gly
5.355GlyAla: 5.355 ± 1.378
1.409GlyCys: 1.409 ± 0.713
2.818GlyAsp: 2.818 ± 0.521
5.355GlyGlu: 5.355 ± 0.837
1.127GlyPhe: 1.127 ± 0.451
3.1GlyGly: 3.1 ± 2.282
1.409GlyHis: 1.409 ± 0.772
2.255GlyIle: 2.255 ± 0.809
3.382GlyLys: 3.382 ± 1.236
4.228GlyLeu: 4.228 ± 0.681
1.973GlyMet: 1.973 ± 0.75
2.255GlyAsn: 2.255 ± 1.785
2.537GlyPro: 2.537 ± 1.11
1.691GlyGln: 1.691 ± 0.684
2.537GlyArg: 2.537 ± 0.544
4.51GlySer: 4.51 ± 1.833
5.919GlyThr: 5.919 ± 1.876
5.073GlyVal: 5.073 ± 0.907
0.282GlyTrp: 0.282 ± 0.154
1.691GlyTyr: 1.691 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
1.691HisAla: 1.691 ± 1.094
1.127HisCys: 1.127 ± 0.618
1.691HisAsp: 1.691 ± 0.926
1.973HisGlu: 1.973 ± 0.388
1.691HisPhe: 1.691 ± 0.71
1.409HisGly: 1.409 ± 0.603
3.1HisHis: 3.1 ± 1.275
2.818HisIle: 2.818 ± 0.591
2.255HisLys: 2.255 ± 1.235
2.537HisLeu: 2.537 ± 1.058
0.846HisMet: 0.846 ± 0.463
1.127HisAsn: 1.127 ± 0.618
1.691HisPro: 1.691 ± 0.741
0.564HisGln: 0.564 ± 0.309
1.127HisArg: 1.127 ± 0.618
1.691HisSer: 1.691 ± 0.648
1.973HisThr: 1.973 ± 0.754
1.409HisVal: 1.409 ± 0.994
0.0HisTrp: 0.0 ± 0.0
1.409HisTyr: 1.409 ± 0.343
0.0HisXaa: 0.0 ± 0.0
Ile
5.919IleAla: 5.919 ± 2.035
0.282IleCys: 0.282 ± 0.154
2.537IleAsp: 2.537 ± 0.975
4.791IleGlu: 4.791 ± 1.052
2.537IlePhe: 2.537 ± 1.098
4.51IleGly: 4.51 ± 1.0
0.846IleHis: 0.846 ± 0.421
2.818IleIle: 2.818 ± 0.591
4.228IleLys: 4.228 ± 0.498
4.791IleLeu: 4.791 ± 1.958
0.282IleMet: 0.282 ± 0.812
2.818IleAsn: 2.818 ± 0.878
4.791IlePro: 4.791 ± 0.909
1.691IleGln: 1.691 ± 0.842
1.409IleArg: 1.409 ± 0.527
2.537IleSer: 2.537 ± 1.136
4.228IleThr: 4.228 ± 2.311
3.946IleVal: 3.946 ± 1.56
0.282IleTrp: 0.282 ± 0.154
3.946IleTyr: 3.946 ± 0.839
0.0IleXaa: 0.0 ± 0.0
Lys
3.664LysAla: 3.664 ± 1.297
1.409LysCys: 1.409 ± 0.708
4.51LysAsp: 4.51 ± 0.869
5.073LysGlu: 5.073 ± 1.928
1.409LysPhe: 1.409 ± 1.237
3.664LysGly: 3.664 ± 0.704
1.973LysHis: 1.973 ± 0.388
4.51LysIle: 4.51 ± 0.879
3.382LysLys: 3.382 ± 1.46
5.073LysLeu: 5.073 ± 1.4
1.127LysMet: 1.127 ± 0.618
1.127LysAsn: 1.127 ± 0.451
3.664LysPro: 3.664 ± 0.704
3.382LysGln: 3.382 ± 0.617
1.409LysArg: 1.409 ± 0.499
2.255LysSer: 2.255 ± 0.649
5.355LysThr: 5.355 ± 1.599
2.537LysVal: 2.537 ± 0.737
1.127LysTrp: 1.127 ± 0.416
2.537LysTyr: 2.537 ± 0.785
0.0LysXaa: 0.0 ± 0.0
Leu
6.201LeuAla: 6.201 ± 2.507
0.564LeuCys: 0.564 ± 0.741
5.637LeuAsp: 5.637 ± 2.201
5.073LeuGlu: 5.073 ± 0.985
4.228LeuPhe: 4.228 ± 2.778
5.637LeuGly: 5.637 ± 0.643
1.691LeuHis: 1.691 ± 0.74
3.946LeuIle: 3.946 ± 1.871
6.764LeuLys: 6.764 ± 1.056
2.537LeuLeu: 2.537 ± 0.872
1.973LeuMet: 1.973 ± 0.803
4.228LeuAsn: 4.228 ± 1.448
3.664LeuPro: 3.664 ± 2.173
2.255LeuGln: 2.255 ± 1.057
5.073LeuArg: 5.073 ± 2.29
4.791LeuSer: 4.791 ± 1.92
5.355LeuThr: 5.355 ± 1.167
3.946LeuVal: 3.946 ± 0.734
0.846LeuTrp: 0.846 ± 1.547
1.691LeuTyr: 1.691 ± 0.593
0.0LeuXaa: 0.0 ± 0.0
Met
1.973MetAla: 1.973 ± 0.638
0.282MetCys: 0.282 ± 0.44
1.973MetAsp: 1.973 ± 1.081
0.846MetGlu: 0.846 ± 0.463
1.691MetPhe: 1.691 ± 0.618
0.564MetGly: 0.564 ± 0.369
1.409MetHis: 1.409 ± 0.772
1.409MetIle: 1.409 ± 0.853
1.127MetLys: 1.127 ± 0.618
0.846MetLeu: 0.846 ± 0.355
1.127MetMet: 1.127 ± 0.618
0.282MetAsn: 0.282 ± 0.154
0.846MetPro: 0.846 ± 0.463
0.846MetGln: 0.846 ± 0.463
1.691MetArg: 1.691 ± 0.974
1.409MetSer: 1.409 ± 1.086
0.846MetThr: 0.846 ± 0.421
0.564MetVal: 0.564 ± 0.309
0.282MetTrp: 0.282 ± 0.652
1.127MetTyr: 1.127 ± 0.738
0.0MetXaa: 0.0 ± 0.0
Asn
2.818AsnAla: 2.818 ± 1.166
0.846AsnCys: 0.846 ± 0.699
1.973AsnAsp: 1.973 ± 0.575
1.409AsnGlu: 1.409 ± 0.499
3.1AsnPhe: 3.1 ± 1.345
2.255AsnGly: 2.255 ± 1.109
0.846AsnHis: 0.846 ± 0.463
2.818AsnIle: 2.818 ± 0.998
2.255AsnLys: 2.255 ± 1.375
3.664AsnLeu: 3.664 ± 1.76
0.846AsnMet: 0.846 ± 0.355
2.255AsnAsn: 2.255 ± 0.902
2.818AsnPro: 2.818 ± 0.998
1.127AsnGln: 1.127 ± 0.738
1.409AsnArg: 1.409 ± 0.772
3.664AsnSer: 3.664 ± 1.388
2.255AsnThr: 2.255 ± 0.673
3.1AsnVal: 3.1 ± 0.63
0.564AsnTrp: 0.564 ± 0.369
3.664AsnTyr: 3.664 ± 1.184
0.0AsnXaa: 0.0 ± 0.0
Pro
3.664ProAla: 3.664 ± 1.089
0.282ProCys: 0.282 ± 0.154
3.664ProAsp: 3.664 ± 2.202
2.537ProGlu: 2.537 ± 0.691
1.127ProPhe: 1.127 ± 0.635
3.946ProGly: 3.946 ± 0.814
1.409ProHis: 1.409 ± 0.603
2.255ProIle: 2.255 ± 0.432
4.791ProLys: 4.791 ± 1.768
4.51ProLeu: 4.51 ± 1.255
0.564ProMet: 0.564 ± 0.309
1.973ProAsn: 1.973 ± 2.032
1.127ProPro: 1.127 ± 0.738
1.691ProGln: 1.691 ± 0.74
3.1ProArg: 3.1 ± 0.985
3.382ProSer: 3.382 ± 3.475
3.946ProThr: 3.946 ± 0.874
3.1ProVal: 3.1 ± 0.801
0.564ProTrp: 0.564 ± 0.309
3.946ProTyr: 3.946 ± 0.695
0.0ProXaa: 0.0 ± 0.0
Gln
4.228GlnAla: 4.228 ± 1.039
0.564GlnCys: 0.564 ± 0.369
1.973GlnAsp: 1.973 ± 0.754
3.382GlnGlu: 3.382 ± 1.477
1.127GlnPhe: 1.127 ± 0.635
2.255GlnGly: 2.255 ± 0.649
0.846GlnHis: 0.846 ± 0.829
1.973GlnIle: 1.973 ± 1.501
2.255GlnLys: 2.255 ± 0.887
4.791GlnLeu: 4.791 ± 2.639
0.846GlnMet: 0.846 ± 1.285
1.973GlnAsn: 1.973 ± 0.711
0.846GlnPro: 0.846 ± 1.551
3.664GlnGln: 3.664 ± 1.57
1.127GlnArg: 1.127 ± 0.618
1.973GlnSer: 1.973 ± 0.754
3.664GlnThr: 3.664 ± 1.522
2.537GlnVal: 2.537 ± 1.099
0.846GlnTrp: 0.846 ± 0.952
0.564GlnTyr: 0.564 ± 0.369
0.0GlnXaa: 0.0 ± 0.0
Arg
3.1ArgAla: 3.1 ± 0.888
1.127ArgCys: 1.127 ± 0.618
3.1ArgAsp: 3.1 ± 1.313
2.818ArgGlu: 2.818 ± 1.179
1.973ArgPhe: 1.973 ± 1.368
1.691ArgGly: 1.691 ± 0.926
1.409ArgHis: 1.409 ± 0.772
3.382ArgIle: 3.382 ± 0.848
1.127ArgLys: 1.127 ± 0.451
2.255ArgLeu: 2.255 ± 1.783
1.127ArgMet: 1.127 ± 0.618
3.1ArgAsn: 3.1 ± 1.409
2.537ArgPro: 2.537 ± 1.033
2.255ArgGln: 2.255 ± 0.887
1.409ArgArg: 1.409 ± 1.215
3.664ArgSer: 3.664 ± 1.808
3.946ArgThr: 3.946 ± 1.329
1.691ArgVal: 1.691 ± 0.974
0.282ArgTrp: 0.282 ± 0.154
1.409ArgTyr: 1.409 ± 0.772
0.0ArgXaa: 0.0 ± 0.0
Ser
7.328SerAla: 7.328 ± 2.781
1.409SerCys: 1.409 ± 0.527
2.818SerAsp: 2.818 ± 1.325
2.818SerGlu: 2.818 ± 1.34
3.946SerPhe: 3.946 ± 1.58
2.255SerGly: 2.255 ± 0.63
1.691SerHis: 1.691 ± 0.901
2.818SerIle: 2.818 ± 0.701
1.691SerLys: 1.691 ± 1.094
5.637SerLeu: 5.637 ± 0.785
0.846SerMet: 0.846 ± 0.952
1.973SerAsn: 1.973 ± 0.795
4.791SerPro: 4.791 ± 3.261
1.409SerGln: 1.409 ± 0.708
2.537SerArg: 2.537 ± 0.982
4.51SerSer: 4.51 ± 2.297
5.355SerThr: 5.355 ± 2.989
3.946SerVal: 3.946 ± 0.793
0.846SerTrp: 0.846 ± 0.724
2.537SerTyr: 2.537 ± 1.614
0.0SerXaa: 0.0 ± 0.0
Thr
7.328ThrAla: 7.328 ± 1.121
1.691ThrCys: 1.691 ± 0.974
4.228ThrAsp: 4.228 ± 1.755
4.51ThrGlu: 4.51 ± 0.868
3.1ThrPhe: 3.1 ± 0.672
3.946ThrGly: 3.946 ± 2.345
2.818ThrHis: 2.818 ± 1.054
4.228ThrIle: 4.228 ± 0.806
4.228ThrLys: 4.228 ± 1.93
5.355ThrLeu: 5.355 ± 2.038
1.973ThrMet: 1.973 ± 0.575
3.1ThrAsn: 3.1 ± 1.14
5.355ThrPro: 5.355 ± 0.709
1.973ThrGln: 1.973 ± 0.711
3.1ThrArg: 3.1 ± 1.305
2.537ThrSer: 2.537 ± 0.611
6.201ThrThr: 6.201 ± 1.873
3.664ThrVal: 3.664 ± 0.982
0.282ThrTrp: 0.282 ± 0.154
1.973ThrTyr: 1.973 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
5.637ValAla: 5.637 ± 1.23
1.409ValCys: 1.409 ± 0.772
3.1ValAsp: 3.1 ± 0.901
3.1ValGlu: 3.1 ± 1.243
0.846ValPhe: 0.846 ± 0.777
3.664ValGly: 3.664 ± 1.434
2.537ValHis: 2.537 ± 0.895
3.382ValIle: 3.382 ± 0.648
3.1ValLys: 3.1 ± 0.511
3.664ValLeu: 3.664 ± 1.037
0.564ValMet: 0.564 ± 0.345
1.973ValAsn: 1.973 ± 0.75
4.51ValPro: 4.51 ± 1.057
2.255ValGln: 2.255 ± 1.532
3.664ValArg: 3.664 ± 1.383
3.664ValSer: 3.664 ± 2.472
3.664ValThr: 3.664 ± 1.314
1.691ValVal: 1.691 ± 1.398
0.0ValTrp: 0.0 ± 0.0
2.255ValTyr: 2.255 ± 0.889
0.0ValXaa: 0.0 ± 0.0
Trp
1.127TrpAla: 1.127 ± 0.555
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.282TrpGlu: 0.282 ± 0.44
0.846TrpPhe: 0.846 ± 0.355
0.564TrpGly: 0.564 ± 0.741
0.564TrpHis: 0.564 ± 0.309
0.846TrpIle: 0.846 ± 1.295
0.564TrpLys: 0.564 ± 0.369
1.409TrpLeu: 1.409 ± 0.708
0.0TrpMet: 0.0 ± 0.0
0.846TrpAsn: 0.846 ± 0.699
0.0TrpPro: 0.0 ± 0.0
0.564TrpGln: 0.564 ± 0.653
0.564TrpArg: 0.564 ± 0.309
0.846TrpSer: 0.846 ± 0.932
0.846TrpThr: 0.846 ± 0.421
0.564TrpVal: 0.564 ± 0.309
0.0TrpTrp: 0.0 ± 0.0
0.282TrpTyr: 0.282 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.691TyrAla: 1.691 ± 0.74
0.564TyrCys: 0.564 ± 0.369
2.537TyrAsp: 2.537 ± 1.285
3.1TyrGlu: 3.1 ± 0.801
1.127TyrPhe: 1.127 ± 0.618
2.537TyrGly: 2.537 ± 0.762
0.846TyrHis: 0.846 ± 0.547
2.537TyrIle: 2.537 ± 1.098
2.255TyrLys: 2.255 ± 0.775
3.1TyrLeu: 3.1 ± 2.045
1.127TyrMet: 1.127 ± 0.59
3.382TyrAsn: 3.382 ± 0.82
0.282TyrPro: 0.282 ± 0.154
1.409TyrGln: 1.409 ± 0.858
2.255TyrArg: 2.255 ± 1.235
1.127TyrSer: 1.127 ± 0.405
3.946TyrThr: 3.946 ± 1.568
1.973TyrVal: 1.973 ± 0.75
0.564TyrTrp: 0.564 ± 0.309
2.537TyrTyr: 2.537 ± 1.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3549 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski