Amino acid dipepetide frequency for Jonchet virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.567AlaAla: 3.567 ± 0.718
0.892AlaCys: 0.892 ± 0.207
3.122AlaAsp: 3.122 ± 0.934
2.899AlaGlu: 2.899 ± 0.924
1.784AlaPhe: 1.784 ± 1.124
4.682AlaGly: 4.682 ± 0.795
1.561AlaHis: 1.561 ± 1.213
4.682AlaIle: 4.682 ± 0.855
5.128AlaLys: 5.128 ± 0.911
5.574AlaLeu: 5.574 ± 0.909
2.899AlaMet: 2.899 ± 1.425
3.122AlaAsn: 3.122 ± 1.224
2.453AlaPro: 2.453 ± 0.519
2.453AlaGln: 2.453 ± 1.06
3.344AlaArg: 3.344 ± 2.266
4.459AlaSer: 4.459 ± 0.753
2.23AlaThr: 2.23 ± 1.14
4.236AlaVal: 4.236 ± 1.444
0.223AlaTrp: 0.223 ± 0.169
2.453AlaTyr: 2.453 ± 0.915
0.0AlaXaa: 0.0 ± 0.0
Cys
1.338CysAla: 1.338 ± 0.482
0.223CysCys: 0.223 ± 0.169
1.338CysAsp: 1.338 ± 0.311
1.115CysGlu: 1.115 ± 0.264
0.669CysPhe: 0.669 ± 0.186
0.669CysGly: 0.669 ± 0.508
0.446CysHis: 0.446 ± 0.339
1.115CysIle: 1.115 ± 0.455
1.784CysLys: 1.784 ± 1.075
2.23CysLeu: 2.23 ± 0.978
0.446CysMet: 0.446 ± 0.289
1.561CysAsn: 1.561 ± 0.426
0.892CysPro: 0.892 ± 1.22
0.223CysGln: 0.223 ± 0.169
0.669CysArg: 0.669 ± 0.241
1.784CysSer: 1.784 ± 0.95
2.007CysThr: 2.007 ± 0.723
1.561CysVal: 1.561 ± 0.79
0.0CysTrp: 0.0 ± 0.0
0.892CysTyr: 0.892 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
4.236AspAla: 4.236 ± 1.5
1.561AspCys: 1.561 ± 0.643
3.122AspAsp: 3.122 ± 1.028
5.351AspGlu: 5.351 ± 0.86
3.567AspPhe: 3.567 ± 0.811
3.344AspGly: 3.344 ± 0.828
0.892AspHis: 0.892 ± 1.141
5.351AspIle: 5.351 ± 1.868
4.236AspLys: 4.236 ± 1.033
3.79AspLeu: 3.79 ± 0.72
1.784AspMet: 1.784 ± 0.418
2.007AspAsn: 2.007 ± 0.525
2.23AspPro: 2.23 ± 0.418
2.676AspGln: 2.676 ± 1.036
1.338AspArg: 1.338 ± 0.392
3.344AspSer: 3.344 ± 0.543
4.236AspThr: 4.236 ± 0.807
4.682AspVal: 4.682 ± 1.178
0.446AspTrp: 0.446 ± 0.289
1.784AspTyr: 1.784 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
4.682GluAla: 4.682 ± 0.778
2.007GluCys: 2.007 ± 1.112
4.236GluAsp: 4.236 ± 0.958
4.682GluGlu: 4.682 ± 1.043
1.561GluPhe: 1.561 ± 1.026
4.013GluGly: 4.013 ± 1.017
1.338GluHis: 1.338 ± 0.371
3.79GluIle: 3.79 ± 1.411
4.682GluLys: 4.682 ± 1.058
5.574GluLeu: 5.574 ± 1.573
2.899GluMet: 2.899 ± 0.571
2.23GluAsn: 2.23 ± 1.08
1.561GluPro: 1.561 ± 0.498
1.115GluGln: 1.115 ± 0.627
2.676GluArg: 2.676 ± 0.664
2.899GluSer: 2.899 ± 0.707
3.344GluThr: 3.344 ± 0.826
3.122GluVal: 3.122 ± 3.437
0.669GluTrp: 0.669 ± 0.241
2.007GluTyr: 2.007 ± 0.525
0.0GluXaa: 0.0 ± 0.0
Phe
1.784PheAla: 1.784 ± 0.632
0.669PheCys: 0.669 ± 0.241
1.784PheAsp: 1.784 ± 1.054
2.676PheGlu: 2.676 ± 0.922
2.23PhePhe: 2.23 ± 0.935
2.453PheGly: 2.453 ± 0.519
0.446PheHis: 0.446 ± 0.104
2.453PheIle: 2.453 ± 0.968
4.013PheLys: 4.013 ± 0.783
2.453PheLeu: 2.453 ± 1.082
0.669PheMet: 0.669 ± 0.244
1.784PheAsn: 1.784 ± 1.124
1.115PhePro: 1.115 ± 0.298
0.446PheGln: 0.446 ± 0.289
2.899PheArg: 2.899 ± 0.963
3.344PheSer: 3.344 ± 1.269
3.567PheThr: 3.567 ± 0.607
1.561PheVal: 1.561 ± 0.426
0.446PheTrp: 0.446 ± 0.104
2.23PheTyr: 2.23 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
2.899GlyAla: 2.899 ± 0.836
0.892GlyCys: 0.892 ± 0.365
4.013GlyAsp: 4.013 ± 1.496
2.676GlyGlu: 2.676 ± 2.235
3.122GlyPhe: 3.122 ± 0.79
2.23GlyGly: 2.23 ± 0.883
1.338GlyHis: 1.338 ± 0.311
4.013GlyIle: 4.013 ± 1.462
4.905GlyLys: 4.905 ± 0.761
5.128GlyLeu: 5.128 ± 1.324
2.23GlyMet: 2.23 ± 0.736
2.23GlyAsn: 2.23 ± 0.519
1.784GlyPro: 1.784 ± 1.124
3.122GlyGln: 3.122 ± 0.502
2.453GlyArg: 2.453 ± 0.628
4.682GlySer: 4.682 ± 1.928
3.79GlyThr: 3.79 ± 0.583
3.567GlyVal: 3.567 ± 0.834
1.338GlyTrp: 1.338 ± 0.371
3.122GlyTyr: 3.122 ± 0.852
0.0GlyXaa: 0.0 ± 0.0
His
1.338HisAla: 1.338 ± 0.596
0.669HisCys: 0.669 ± 0.433
1.784HisAsp: 1.784 ± 0.418
0.669HisGlu: 0.669 ± 0.241
0.446HisPhe: 0.446 ± 0.289
0.446HisGly: 0.446 ± 0.104
0.0HisHis: 0.0 ± 0.0
1.115HisIle: 1.115 ± 0.57
2.453HisLys: 2.453 ± 0.913
1.115HisLeu: 1.115 ± 1.126
0.892HisMet: 0.892 ± 0.404
1.115HisAsn: 1.115 ± 0.455
0.446HisPro: 0.446 ± 0.339
0.446HisGln: 0.446 ± 0.306
0.892HisArg: 0.892 ± 0.207
0.669HisSer: 0.669 ± 0.433
0.223HisThr: 0.223 ± 0.339
1.561HisVal: 1.561 ± 0.355
0.0HisTrp: 0.0 ± 0.0
0.446HisTyr: 0.446 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
2.676IleAla: 2.676 ± 0.414
0.669IleCys: 0.669 ± 0.241
4.013IleAsp: 4.013 ± 0.662
5.574IleGlu: 5.574 ± 0.995
1.561IlePhe: 1.561 ± 0.298
2.899IleGly: 2.899 ± 0.829
0.669IleHis: 0.669 ± 0.241
5.351IleIle: 5.351 ± 1.51
5.797IleLys: 5.797 ± 1.015
3.122IleLeu: 3.122 ± 2.155
1.338IleMet: 1.338 ± 0.651
5.797IleAsn: 5.797 ± 0.908
2.899IlePro: 2.899 ± 0.521
1.338IleGln: 1.338 ± 1.231
4.459IleArg: 4.459 ± 1.003
5.574IleSer: 5.574 ± 1.897
3.79IleThr: 3.79 ± 0.931
5.797IleVal: 5.797 ± 1.862
0.446IleTrp: 0.446 ± 0.104
2.899IleTyr: 2.899 ± 0.665
0.0IleXaa: 0.0 ± 0.0
Lys
5.128LysAla: 5.128 ± 0.714
2.453LysCys: 2.453 ± 1.582
4.905LysAsp: 4.905 ± 0.593
3.122LysGlu: 3.122 ± 0.777
2.23LysPhe: 2.23 ± 0.968
5.797LysGly: 5.797 ± 0.973
0.669LysHis: 0.669 ± 0.433
4.905LysIle: 4.905 ± 0.845
4.682LysLys: 4.682 ± 1.259
7.135LysLeu: 7.135 ± 2.121
3.344LysMet: 3.344 ± 0.928
1.115LysAsn: 1.115 ± 0.264
2.453LysPro: 2.453 ± 0.862
2.453LysGln: 2.453 ± 0.519
3.344LysArg: 3.344 ± 0.791
7.358LysSer: 7.358 ± 1.607
4.682LysThr: 4.682 ± 1.407
6.689LysVal: 6.689 ± 2.098
1.115LysTrp: 1.115 ± 0.264
4.236LysTyr: 4.236 ± 0.807
0.0LysXaa: 0.0 ± 0.0
Leu
6.243LeuAla: 6.243 ± 0.761
0.669LeuCys: 0.669 ± 0.186
5.574LeuAsp: 5.574 ± 0.65
3.567LeuGlu: 3.567 ± 0.934
4.013LeuPhe: 4.013 ± 1.05
3.344LeuGly: 3.344 ± 0.773
1.115LeuHis: 1.115 ± 0.455
4.236LeuIle: 4.236 ± 2.241
5.797LeuLys: 5.797 ± 0.727
6.912LeuLeu: 6.912 ± 1.848
2.23LeuMet: 2.23 ± 0.418
4.459LeuAsn: 4.459 ± 1.044
3.567LeuPro: 3.567 ± 0.833
2.007LeuGln: 2.007 ± 0.351
2.676LeuArg: 2.676 ± 0.967
8.696LeuSer: 8.696 ± 1.309
5.797LeuThr: 5.797 ± 1.793
3.79LeuVal: 3.79 ± 1.277
0.669LeuTrp: 0.669 ± 0.241
2.23LeuTyr: 2.23 ± 0.935
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.444
0.892MetCys: 0.892 ± 0.404
2.453MetAsp: 2.453 ± 0.625
2.899MetGlu: 2.899 ± 0.577
0.446MetPhe: 0.446 ± 0.306
2.453MetGly: 2.453 ± 0.501
0.669MetHis: 0.669 ± 0.291
2.899MetIle: 2.899 ± 0.789
3.344MetLys: 3.344 ± 1.339
3.79MetLeu: 3.79 ± 1.072
0.669MetMet: 0.669 ± 0.291
1.338MetAsn: 1.338 ± 0.245
0.892MetPro: 0.892 ± 0.207
1.561MetGln: 1.561 ± 0.355
1.561MetArg: 1.561 ± 1.084
2.007MetSer: 2.007 ± 0.452
2.23MetThr: 2.23 ± 0.477
1.561MetVal: 1.561 ± 0.298
0.0MetTrp: 0.0 ± 0.0
0.892MetTyr: 0.892 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.122AsnAla: 3.122 ± 0.531
1.561AsnCys: 1.561 ± 0.643
3.122AsnAsp: 3.122 ± 1.197
2.453AsnGlu: 2.453 ± 1.05
2.23AsnPhe: 2.23 ± 0.492
2.899AsnGly: 2.899 ± 0.872
1.561AsnHis: 1.561 ± 0.355
3.344AsnIle: 3.344 ± 0.922
2.899AsnLys: 2.899 ± 0.454
2.676AsnLeu: 2.676 ± 0.414
2.23AsnMet: 2.23 ± 0.519
3.344AsnAsn: 3.344 ± 0.928
2.23AsnPro: 2.23 ± 0.418
1.338AsnGln: 1.338 ± 0.596
2.007AsnArg: 2.007 ± 0.452
5.351AsnSer: 5.351 ± 0.578
4.459AsnThr: 4.459 ± 0.788
2.676AsnVal: 2.676 ± 0.617
0.223AsnTrp: 0.223 ± 0.169
1.561AsnTyr: 1.561 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
1.784ProAla: 1.784 ± 0.418
0.669ProCys: 0.669 ± 0.186
1.561ProAsp: 1.561 ± 0.643
0.669ProGlu: 0.669 ± 1.265
1.784ProPhe: 1.784 ± 0.444
3.79ProGly: 3.79 ± 0.901
0.446ProHis: 0.446 ± 0.306
2.007ProIle: 2.007 ± 0.998
2.899ProLys: 2.899 ± 0.665
3.79ProLeu: 3.79 ± 1.231
0.223ProMet: 0.223 ± 0.144
1.561ProAsn: 1.561 ± 0.575
1.115ProPro: 1.115 ± 1.196
0.669ProGln: 0.669 ± 0.291
1.338ProArg: 1.338 ± 0.482
2.676ProSer: 2.676 ± 1.277
2.899ProThr: 2.899 ± 0.726
0.669ProVal: 0.669 ± 0.241
0.446ProTrp: 0.446 ± 0.104
1.561ProTyr: 1.561 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
3.344GlnAla: 3.344 ± 1.581
0.669GlnCys: 0.669 ± 0.337
2.23GlnAsp: 2.23 ± 0.551
2.676GlnGlu: 2.676 ± 0.784
1.561GlnPhe: 1.561 ± 0.293
2.676GlnGly: 2.676 ± 0.742
0.223GlnHis: 0.223 ± 0.144
2.007GlnIle: 2.007 ± 0.538
1.561GlnLys: 1.561 ± 0.293
2.23GlnLeu: 2.23 ± 0.966
0.892GlnMet: 0.892 ± 0.577
1.338GlnAsn: 1.338 ± 1.165
0.892GlnPro: 0.892 ± 0.207
0.669GlnGln: 0.669 ± 0.337
1.784GlnArg: 1.784 ± 1.054
2.007GlnSer: 2.007 ± 1.679
2.676GlnThr: 2.676 ± 0.617
1.784GlnVal: 1.784 ± 0.415
0.0GlnTrp: 0.0 ± 0.0
1.561GlnTyr: 1.561 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
3.344ArgAla: 3.344 ± 0.793
1.561ArgCys: 1.561 ± 0.426
2.453ArgAsp: 2.453 ± 1.08
2.007ArgGlu: 2.007 ± 0.505
1.338ArgPhe: 1.338 ± 2.56
1.338ArgGly: 1.338 ± 0.311
1.115ArgHis: 1.115 ± 0.275
2.453ArgIle: 2.453 ± 0.778
3.79ArgLys: 3.79 ± 0.851
2.899ArgLeu: 2.899 ± 0.577
2.23ArgMet: 2.23 ± 0.519
3.344ArgAsn: 3.344 ± 0.538
1.115ArgPro: 1.115 ± 0.264
2.453ArgGln: 2.453 ± 1.06
2.23ArgArg: 2.23 ± 0.66
4.905ArgSer: 4.905 ± 0.872
2.453ArgThr: 2.453 ± 0.913
3.567ArgVal: 3.567 ± 0.83
0.446ArgTrp: 0.446 ± 0.104
1.115ArgTyr: 1.115 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
5.128SerAla: 5.128 ± 1.627
1.115SerCys: 1.115 ± 0.489
3.79SerAsp: 3.79 ± 1.099
4.236SerGlu: 4.236 ± 0.866
3.344SerPhe: 3.344 ± 0.705
5.351SerGly: 5.351 ± 1.82
1.338SerHis: 1.338 ± 0.311
4.905SerIle: 4.905 ± 1.244
6.02SerLys: 6.02 ± 1.113
4.682SerLeu: 4.682 ± 0.88
2.899SerMet: 2.899 ± 0.521
4.013SerAsn: 4.013 ± 0.953
2.007SerPro: 2.007 ± 0.351
4.905SerGln: 4.905 ± 0.92
5.574SerArg: 5.574 ± 0.676
6.243SerSer: 6.243 ± 1.677
6.912SerThr: 6.912 ± 2.418
4.905SerVal: 4.905 ± 3.114
1.338SerTrp: 1.338 ± 0.371
4.013SerTyr: 4.013 ± 1.38
0.0SerXaa: 0.0 ± 0.0
Thr
3.344ThrAla: 3.344 ± 0.793
1.561ThrCys: 1.561 ± 0.643
3.79ThrAsp: 3.79 ± 0.931
3.79ThrGlu: 3.79 ± 1.085
2.676ThrPhe: 2.676 ± 0.549
4.459ThrGly: 4.459 ± 1.766
1.338ThrHis: 1.338 ± 0.371
5.574ThrIle: 5.574 ± 1.013
5.574ThrLys: 5.574 ± 1.04
7.135ThrLeu: 7.135 ± 1.461
2.899ThrMet: 2.899 ± 0.506
2.899ThrAsn: 2.899 ± 0.577
2.453ThrPro: 2.453 ± 0.862
2.23ThrGln: 2.23 ± 1.324
2.676ThrArg: 2.676 ± 0.664
4.459ThrSer: 4.459 ± 1.001
4.905ThrThr: 4.905 ± 1.176
3.79ThrVal: 3.79 ± 0.98
0.446ThrTrp: 0.446 ± 0.104
1.561ThrTyr: 1.561 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
3.567ValAla: 3.567 ± 2.097
0.669ValCys: 0.669 ± 1.204
4.013ValAsp: 4.013 ± 0.804
5.574ValGlu: 5.574 ± 0.443
2.007ValPhe: 2.007 ± 0.986
2.453ValGly: 2.453 ± 0.881
0.669ValHis: 0.669 ± 0.337
4.682ValIle: 4.682 ± 1.278
4.459ValLys: 4.459 ± 1.98
2.899ValLeu: 2.899 ± 0.521
1.561ValMet: 1.561 ± 0.498
4.905ValAsn: 4.905 ± 1.255
1.338ValPro: 1.338 ± 0.378
1.561ValGln: 1.561 ± 0.426
3.122ValArg: 3.122 ± 0.922
7.804ValSer: 7.804 ± 3.932
4.459ValThr: 4.459 ± 1.037
5.797ValVal: 5.797 ± 2.911
0.446ValTrp: 0.446 ± 0.104
2.007ValTyr: 2.007 ± 0.723
0.0ValXaa: 0.0 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.433
0.669TrpCys: 0.669 ± 0.241
0.892TrpAsp: 0.892 ± 0.207
0.223TrpGlu: 0.223 ± 0.144
0.669TrpPhe: 0.669 ± 0.186
0.446TrpGly: 0.446 ± 0.104
0.223TrpHis: 0.223 ± 0.144
0.0TrpIle: 0.0 ± 0.0
0.223TrpLys: 0.223 ± 0.169
1.338TrpLeu: 1.338 ± 0.311
0.446TrpMet: 0.446 ± 0.289
0.446TrpAsn: 0.446 ± 0.104
0.0TrpPro: 0.0 ± 0.0
0.223TrpGln: 0.223 ± 0.169
0.0TrpArg: 0.0 ± 0.0
0.669TrpSer: 0.669 ± 0.186
0.223TrpThr: 0.223 ± 0.144
0.669TrpVal: 0.669 ± 0.508
0.223TrpTrp: 0.223 ± 0.144
0.892TrpTyr: 0.892 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.784TyrAla: 1.784 ± 0.444
0.892TyrCys: 0.892 ± 0.207
1.784TyrAsp: 1.784 ± 0.996
2.23TyrGlu: 2.23 ± 0.681
2.007TyrPhe: 2.007 ± 0.452
3.79TyrGly: 3.79 ± 0.98
0.669TyrHis: 0.669 ± 0.186
1.784TyrIle: 1.784 ± 0.444
3.567TyrLys: 3.567 ± 0.762
3.122TyrLeu: 3.122 ± 1.028
1.338TyrMet: 1.338 ± 0.264
2.676TyrAsn: 2.676 ± 0.755
1.115TyrPro: 1.115 ± 0.455
0.892TyrGln: 0.892 ± 0.207
1.115TyrArg: 1.115 ± 0.264
3.79TyrSer: 3.79 ± 0.856
2.676TyrThr: 2.676 ± 0.757
2.007TyrVal: 2.007 ± 0.723
0.223TyrTrp: 0.223 ± 0.144
2.899TyrTyr: 2.899 ± 0.707
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski