Amino acid dipepetide frequency for Herbert herbevirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.862AlaCys: 0.862 ± 1.548
1.15AlaAsp: 1.15 ± 0.227
1.725AlaGlu: 1.725 ± 0.385
1.437AlaPhe: 1.437 ± 0.435
2.3AlaGly: 2.3 ± 0.813
0.862AlaHis: 0.862 ± 0.643
2.874AlaIle: 2.874 ± 2.226
4.312AlaLys: 4.312 ± 1.391
3.162AlaLeu: 3.162 ± 1.508
0.862AlaMet: 0.862 ± 0.46
2.587AlaAsn: 2.587 ± 1.076
0.862AlaPro: 0.862 ± 0.192
0.862AlaGln: 0.862 ± 0.192
1.437AlaArg: 1.437 ± 0.435
1.725AlaSer: 1.725 ± 0.526
1.437AlaThr: 1.437 ± 0.435
0.575AlaVal: 0.575 ± 0.264
0.575AlaTrp: 0.575 ± 0.306
1.725AlaTyr: 1.725 ± 1.117
0.287AlaXaa: 0.287 ± 0.81
Cys
1.15CysAla: 1.15 ± 1.568
0.575CysCys: 0.575 ± 0.306
1.437CysAsp: 1.437 ± 0.904
0.575CysGlu: 0.575 ± 0.772
1.437CysPhe: 1.437 ± 0.435
2.012CysGly: 2.012 ± 2.184
0.862CysHis: 0.862 ± 0.643
1.725CysIle: 1.725 ± 0.526
1.725CysLys: 1.725 ± 1.286
0.862CysLeu: 0.862 ± 0.643
0.287CysMet: 0.287 ± 0.153
1.437CysAsn: 1.437 ± 1.413
0.862CysPro: 0.862 ± 0.46
0.287CysGln: 0.287 ± 0.153
1.725CysArg: 1.725 ± 0.385
1.437CysSer: 1.437 ± 0.336
0.575CysThr: 0.575 ± 0.264
1.15CysVal: 1.15 ± 0.527
0.287CysTrp: 0.287 ± 0.153
1.15CysTyr: 1.15 ± 1.028
0.0CysXaa: 0.0 ± 0.0
Asp
2.3AspAla: 2.3 ± 1.055
0.862AspCys: 0.862 ± 1.158
4.024AspAsp: 4.024 ± 0.174
2.012AspGlu: 2.012 ± 1.448
4.024AspPhe: 4.024 ± 0.783
0.575AspGly: 0.575 ± 0.772
0.287AspHis: 0.287 ± 0.153
6.611AspIle: 6.611 ± 1.501
4.024AspLys: 4.024 ± 0.783
8.911AspLeu: 8.911 ± 0.848
4.024AspMet: 4.024 ± 1.667
3.449AspAsn: 3.449 ± 1.572
1.725AspPro: 1.725 ± 0.919
2.3AspGln: 2.3 ± 0.761
2.012AspArg: 2.012 ± 0.613
4.599AspSer: 4.599 ± 1.971
3.162AspThr: 3.162 ± 0.807
5.461AspVal: 5.461 ± 2.327
0.862AspTrp: 0.862 ± 0.192
2.874AspTyr: 2.874 ± 0.672
0.0AspXaa: 0.0 ± 0.0
Glu
2.3GluAla: 2.3 ± 0.453
0.575GluCys: 0.575 ± 0.264
2.587GluAsp: 2.587 ± 0.552
1.437GluGlu: 1.437 ± 0.766
6.611GluPhe: 6.611 ± 1.501
1.437GluGly: 1.437 ± 1.679
0.0GluHis: 0.0 ± 0.0
6.899GluIle: 6.899 ± 1.161
4.886GluLys: 4.886 ± 1.67
3.449GluLeu: 3.449 ± 0.23
2.587GluMet: 2.587 ± 0.552
3.449GluAsn: 3.449 ± 2.015
2.012GluPro: 2.012 ± 1.072
3.162GluGln: 3.162 ± 0.803
1.15GluArg: 1.15 ± 0.613
4.312GluSer: 4.312 ± 0.299
4.312GluThr: 4.312 ± 1.009
2.012GluVal: 2.012 ± 0.613
0.287GluTrp: 0.287 ± 0.153
3.162GluTyr: 3.162 ± 0.367
0.287GluXaa: 0.287 ± 0.153
Phe
1.15PheAla: 1.15 ± 0.227
2.3PheCys: 2.3 ± 0.453
4.599PheAsp: 4.599 ± 1.521
4.024PheGlu: 4.024 ± 0.941
2.874PhePhe: 2.874 ± 0.672
1.725PheGly: 1.725 ± 0.385
0.575PheHis: 0.575 ± 0.306
3.449PheIle: 3.449 ± 0.94
7.186PheLys: 7.186 ± 0.553
6.611PheLeu: 6.611 ± 0.574
2.012PheMet: 2.012 ± 0.519
2.587PheAsn: 2.587 ± 0.358
1.15PhePro: 1.15 ± 0.613
2.874PheGln: 2.874 ± 1.06
2.587PheArg: 2.587 ± 1.423
4.886PheSer: 4.886 ± 1.623
2.587PheThr: 2.587 ± 0.358
2.012PheVal: 2.012 ± 1.072
0.287PheTrp: 0.287 ± 0.153
3.162PheTyr: 3.162 ± 1.346
0.0PheXaa: 0.0 ± 0.0
Gly
0.862GlyAla: 0.862 ± 0.46
2.012GlyCys: 2.012 ± 0.694
4.599GlyAsp: 4.599 ± 1.239
3.737GlyGlu: 3.737 ± 2.713
3.162GlyPhe: 3.162 ± 0.351
0.862GlyGly: 0.862 ± 1.548
0.862GlyHis: 0.862 ± 0.643
4.599GlyIle: 4.599 ± 1.078
2.587GlyLys: 2.587 ± 2.163
3.162GlyLeu: 3.162 ± 0.807
1.437GlyMet: 1.437 ± 0.583
1.15GlyAsn: 1.15 ± 0.527
1.725GlyPro: 1.725 ± 1.56
0.862GlyGln: 0.862 ± 1.158
1.15GlyArg: 1.15 ± 0.227
4.024GlySer: 4.024 ± 2.471
2.3GlyThr: 2.3 ± 0.619
1.725GlyVal: 1.725 ± 1.414
0.287GlyTrp: 0.287 ± 0.153
2.3GlyTyr: 2.3 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.287HisAla: 0.287 ± 0.386
0.287HisCys: 0.287 ± 0.386
0.862HisAsp: 0.862 ± 0.46
1.437HisGlu: 1.437 ± 0.837
0.575HisPhe: 0.575 ± 0.306
0.287HisGly: 0.287 ± 0.153
0.575HisHis: 0.575 ± 0.306
1.15HisIle: 1.15 ± 0.527
0.575HisLys: 0.575 ± 0.264
1.437HisLeu: 1.437 ± 0.837
0.287HisMet: 0.287 ± 0.153
0.862HisAsn: 0.862 ± 0.192
0.287HisPro: 0.287 ± 0.386
0.287HisGln: 0.287 ± 0.153
0.287HisArg: 0.287 ± 0.153
1.725HisSer: 1.725 ± 0.919
0.287HisThr: 0.287 ± 0.153
0.287HisVal: 0.287 ± 0.81
0.0HisTrp: 0.0 ± 0.0
0.575HisTyr: 0.575 ± 0.772
0.0HisXaa: 0.0 ± 0.0
Ile
2.587IleAla: 2.587 ± 1.423
2.587IleCys: 2.587 ± 0.955
6.899IleAsp: 6.899 ± 0.476
7.186IleGlu: 7.186 ± 1.806
3.737IlePhe: 3.737 ± 1.082
4.024IleGly: 4.024 ± 0.378
0.575IleHis: 0.575 ± 0.306
5.749IleIle: 5.749 ± 0.549
9.198IleLys: 9.198 ± 1.385
7.761IleLeu: 7.761 ± 0.649
3.162IleMet: 3.162 ± 1.217
6.611IleAsn: 6.611 ± 0.574
2.012IlePro: 2.012 ± 0.613
4.024IleGln: 4.024 ± 1.422
3.737IleArg: 3.737 ± 0.554
7.761IleSer: 7.761 ± 2.045
3.737IleThr: 3.737 ± 0.135
3.162IleVal: 3.162 ± 2.386
1.437IleTrp: 1.437 ± 0.583
4.886IleTyr: 4.886 ± 0.106
0.0IleXaa: 0.0 ± 0.0
Lys
3.162LysAla: 3.162 ± 1.533
1.725LysCys: 1.725 ± 0.385
4.886LysAsp: 4.886 ± 1.636
5.749LysGlu: 5.749 ± 1.695
4.312LysPhe: 4.312 ± 0.833
5.749LysGly: 5.749 ± 1.457
0.575LysHis: 0.575 ± 0.306
8.623LysIle: 8.623 ± 2.554
7.761LysLys: 7.761 ± 0.777
10.348LysLeu: 10.348 ± 2.0
2.874LysMet: 2.874 ± 1.532
4.024LysAsn: 4.024 ± 0.174
1.725LysPro: 1.725 ± 0.47
1.725LysGln: 1.725 ± 0.698
2.587LysArg: 2.587 ± 0.358
7.473LysSer: 7.473 ± 0.568
8.048LysThr: 8.048 ± 1.587
4.599LysVal: 4.599 ± 0.896
1.15LysTrp: 1.15 ± 0.98
4.886LysTyr: 4.886 ± 0.59
0.287LysXaa: 0.287 ± 0.153
Leu
2.012LeuAla: 2.012 ± 0.566
1.15LeuCys: 1.15 ± 0.527
7.186LeuAsp: 7.186 ± 0.553
7.761LeuGlu: 7.761 ± 1.656
5.174LeuPhe: 5.174 ± 0.74
4.599LeuGly: 4.599 ± 0.09
0.862LeuHis: 0.862 ± 0.192
9.773LeuIle: 9.773 ± 2.859
8.048LeuLys: 8.048 ± 0.756
9.485LeuLeu: 9.485 ± 2.408
2.587LeuMet: 2.587 ± 0.577
6.899LeuAsn: 6.899 ± 1.653
3.162LeuPro: 3.162 ± 1.685
3.162LeuGln: 3.162 ± 0.351
2.587LeuArg: 2.587 ± 1.076
6.611LeuSer: 6.611 ± 1.008
5.461LeuThr: 5.461 ± 0.733
4.886LeuVal: 4.886 ± 0.106
0.287LeuTrp: 0.287 ± 0.386
3.737LeuTyr: 3.737 ± 1.593
0.0LeuXaa: 0.0 ± 0.0
Met
0.862MetAla: 0.862 ± 0.707
0.575MetCys: 0.575 ± 0.902
1.437MetAsp: 1.437 ± 0.766
1.725MetGlu: 1.725 ± 0.919
1.15MetPhe: 1.15 ± 0.527
1.15MetGly: 1.15 ± 0.227
0.575MetHis: 0.575 ± 0.264
3.162MetIle: 3.162 ± 0.61
3.162MetLys: 3.162 ± 0.367
3.449MetLeu: 3.449 ± 0.939
1.437MetMet: 1.437 ± 0.435
2.587MetAsn: 2.587 ± 0.606
0.575MetPro: 0.575 ± 0.264
1.15MetGln: 1.15 ± 0.227
1.725MetArg: 1.725 ± 0.385
3.162MetSer: 3.162 ± 1.211
2.3MetThr: 2.3 ± 0.453
2.012MetVal: 2.012 ± 0.392
0.0MetTrp: 0.0 ± 0.0
0.287MetTyr: 0.287 ± 0.386
0.0MetXaa: 0.0 ± 0.0
Asn
2.3AsnAla: 2.3 ± 0.453
1.725AsnCys: 1.725 ± 0.791
6.036AsnAsp: 6.036 ± 0.654
3.449AsnGlu: 3.449 ± 0.94
3.737AsnPhe: 3.737 ± 0.554
2.587AsnGly: 2.587 ± 1.648
0.0AsnHis: 0.0 ± 0.0
5.749AsnIle: 5.749 ± 1.043
7.473AsnLys: 7.473 ± 1.362
6.899AsnLeu: 6.899 ± 1.653
1.437AsnMet: 1.437 ± 0.766
4.886AsnAsn: 4.886 ± 0.999
1.437AsnPro: 1.437 ± 0.336
1.725AsnGln: 1.725 ± 0.919
1.725AsnArg: 1.725 ± 0.385
4.024AsnSer: 4.024 ± 1.207
3.449AsnThr: 3.449 ± 1.787
3.449AsnVal: 3.449 ± 1.195
0.575AsnTrp: 0.575 ± 0.306
4.024AsnTyr: 4.024 ± 1.227
0.0AsnXaa: 0.0 ± 0.0
Pro
0.575ProAla: 0.575 ± 0.306
0.287ProCys: 0.287 ± 0.153
2.012ProAsp: 2.012 ± 1.4
1.437ProGlu: 1.437 ± 0.336
2.3ProPhe: 2.3 ± 0.453
1.725ProGly: 1.725 ± 0.526
0.575ProHis: 0.575 ± 0.306
3.737ProIle: 3.737 ± 0.135
2.012ProLys: 2.012 ± 0.694
1.15ProLeu: 1.15 ± 0.613
0.862ProMet: 0.862 ± 0.78
1.725ProAsn: 1.725 ± 0.47
0.575ProPro: 0.575 ± 0.306
0.287ProGln: 0.287 ± 0.81
0.575ProArg: 0.575 ± 0.264
2.874ProSer: 2.874 ± 1.532
1.15ProThr: 1.15 ± 0.227
2.012ProVal: 2.012 ± 0.694
0.0ProTrp: 0.0 ± 0.0
0.575ProTyr: 0.575 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
1.437GlnAla: 1.437 ± 0.583
0.287GlnCys: 0.287 ± 0.386
1.437GlnAsp: 1.437 ± 0.904
2.012GlnGlu: 2.012 ± 0.613
1.725GlnPhe: 1.725 ± 0.919
2.587GlnGly: 2.587 ± 0.606
0.862GlnHis: 0.862 ± 0.643
2.3GlnIle: 2.3 ± 1.226
2.3GlnLys: 2.3 ± 0.619
3.162GlnLeu: 3.162 ± 0.351
0.575GlnMet: 0.575 ± 0.502
2.874GlnAsn: 2.874 ± 0.672
0.575GlnPro: 0.575 ± 0.772
0.0GlnGln: 0.0 ± 0.0
1.725GlnArg: 1.725 ± 0.786
2.587GlnSer: 2.587 ± 1.231
2.012GlnThr: 2.012 ± 0.613
3.162GlnVal: 3.162 ± 0.807
0.0GlnTrp: 0.0 ± 0.0
1.15GlnTyr: 1.15 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
1.15ArgAla: 1.15 ± 0.701
0.575ArgCys: 0.575 ± 0.772
2.587ArgAsp: 2.587 ± 1.231
2.012ArgGlu: 2.012 ± 0.613
1.437ArgPhe: 1.437 ± 0.729
1.725ArgGly: 1.725 ± 0.526
0.575ArgHis: 0.575 ± 0.306
1.437ArgIle: 1.437 ± 0.766
2.3ArgLys: 2.3 ± 0.619
4.599ArgLeu: 4.599 ± 0.966
0.575ArgMet: 0.575 ± 0.772
2.874ArgAsn: 2.874 ± 1.06
0.862ArgPro: 0.862 ± 0.192
1.15ArgGln: 1.15 ± 0.613
0.287ArgArg: 0.287 ± 0.153
3.737ArgSer: 3.737 ± 2.121
1.15ArgThr: 1.15 ± 0.227
1.15ArgVal: 1.15 ± 1.028
0.0ArgTrp: 0.0 ± 0.0
2.587ArgTyr: 2.587 ± 0.91
0.0ArgXaa: 0.0 ± 0.0
Ser
3.162SerAla: 3.162 ± 0.367
2.012SerCys: 2.012 ± 1.166
3.737SerAsp: 3.737 ± 0.554
0.862SerGlu: 0.862 ± 0.192
6.036SerPhe: 6.036 ± 0.654
4.024SerGly: 4.024 ± 0.174
2.3SerHis: 2.3 ± 2.131
8.048SerIle: 8.048 ± 1.307
6.036SerLys: 6.036 ± 1.84
8.336SerLeu: 8.336 ± 2.921
2.587SerMet: 2.587 ± 0.91
7.186SerAsn: 7.186 ± 1.449
1.437SerPro: 1.437 ± 0.336
2.3SerGln: 2.3 ± 0.761
3.162SerArg: 3.162 ± 1.97
6.324SerSer: 6.324 ± 2.423
3.162SerThr: 3.162 ± 0.351
3.737SerVal: 3.737 ± 1.046
0.287SerTrp: 0.287 ± 0.153
4.599SerTyr: 4.599 ± 0.61
0.0SerXaa: 0.0 ± 0.0
Thr
2.587ThrAla: 2.587 ± 0.91
0.862ThrCys: 0.862 ± 0.643
3.162ThrAsp: 3.162 ± 1.217
3.162ThrGlu: 3.162 ± 0.803
3.162ThrPhe: 3.162 ± 1.977
2.874ThrGly: 2.874 ± 1.344
0.575ThrHis: 0.575 ± 0.772
4.312ThrIle: 4.312 ± 0.228
5.461ThrLys: 5.461 ± 0.892
4.599ThrLeu: 4.599 ± 1.134
1.725ThrMet: 1.725 ± 0.791
4.024ThrAsn: 4.024 ± 1.846
2.587ThrPro: 2.587 ± 0.358
1.725ThrGln: 1.725 ± 0.47
0.575ThrArg: 0.575 ± 0.306
2.874ThrSer: 2.874 ± 0.32
3.737ThrThr: 3.737 ± 0.53
3.162ThrVal: 3.162 ± 0.351
0.287ThrTrp: 0.287 ± 0.153
3.449ThrTyr: 3.449 ± 0.769
0.0ThrXaa: 0.0 ± 0.0
Val
1.437ValAla: 1.437 ± 1.444
1.725ValCys: 1.725 ± 1.286
2.3ValAsp: 2.3 ± 0.813
2.587ValGlu: 2.587 ± 0.91
2.874ValPhe: 2.874 ± 2.104
2.012ValGly: 2.012 ± 0.694
0.575ValHis: 0.575 ± 0.306
5.174ValIle: 5.174 ± 2.374
5.749ValLys: 5.749 ± 0.639
4.024ValLeu: 4.024 ± 1.388
0.575ValMet: 0.575 ± 0.765
2.874ValAsn: 2.874 ± 0.988
1.15ValPro: 1.15 ± 0.227
2.3ValGln: 2.3 ± 1.343
1.725ValArg: 1.725 ± 0.385
3.162ValSer: 3.162 ± 0.803
3.162ValThr: 3.162 ± 1.217
2.012ValVal: 2.012 ± 1.166
0.287ValTrp: 0.287 ± 0.386
3.162ValTyr: 3.162 ± 1.936
0.0ValXaa: 0.0 ± 0.0
Trp
0.287TrpAla: 0.287 ± 0.153
0.0TrpCys: 0.0 ± 0.0
0.575TrpAsp: 0.575 ± 0.306
0.287TrpGlu: 0.287 ± 0.81
0.575TrpPhe: 0.575 ± 0.306
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.575TrpIle: 0.575 ± 0.306
0.575TrpLys: 0.575 ± 0.306
0.287TrpLeu: 0.287 ± 0.153
0.287TrpMet: 0.287 ± 0.386
0.575TrpAsn: 0.575 ± 0.264
0.575TrpPro: 0.575 ± 1.62
0.0TrpGln: 0.0 ± 0.0
0.287TrpArg: 0.287 ± 0.386
1.15TrpSer: 1.15 ± 0.227
0.575TrpThr: 0.575 ± 0.264
0.575TrpVal: 0.575 ± 0.264
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.012TyrAla: 2.012 ± 0.965
0.575TyrCys: 0.575 ± 0.264
2.3TyrAsp: 2.3 ± 0.453
3.737TyrGlu: 3.737 ± 0.135
2.3TyrPhe: 2.3 ± 0.619
1.15TyrGly: 1.15 ± 1.028
0.287TyrHis: 0.287 ± 0.153
4.886TyrIle: 4.886 ± 0.963
6.899TyrLys: 6.899 ± 0.715
3.449TyrLeu: 3.449 ± 1.716
2.012TyrMet: 2.012 ± 0.613
3.737TyrAsn: 3.737 ± 1.514
1.15TyrPro: 1.15 ± 0.98
2.587TyrGln: 2.587 ± 0.358
1.725TyrArg: 1.725 ± 0.919
4.886TyrSer: 4.886 ± 0.999
2.3TyrThr: 2.3 ± 2.131
2.012TyrVal: 2.012 ± 0.694
0.287TyrTrp: 0.287 ± 0.81
2.3TyrTyr: 2.3 ± 0.453
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.287XaaAsp: 0.287 ± 0.153
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.287XaaLeu: 0.287 ± 0.153
0.0XaaMet: 0.0 ± 0.0
0.287XaaAsn: 0.287 ± 0.81
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski