Amino acid dipepetide frequency for Influenza A virus (A/chicken/Gansu/2/99(H9N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.788AlaAla: 3.788 ± 1.249
0.947AlaCys: 0.947 ± 0.5
2.841AlaAsp: 2.841 ± 0.478
3.551AlaGlu: 3.551 ± 0.837
1.894AlaPhe: 1.894 ± 0.771
4.025AlaGly: 4.025 ± 0.925
0.947AlaHis: 0.947 ± 0.414
4.025AlaIle: 4.025 ± 0.968
2.367AlaLys: 2.367 ± 0.576
5.682AlaLeu: 5.682 ± 1.305
3.078AlaMet: 3.078 ± 0.759
2.604AlaAsn: 2.604 ± 0.472
2.841AlaPro: 2.841 ± 0.616
1.657AlaGln: 1.657 ± 0.416
2.841AlaArg: 2.841 ± 0.674
4.498AlaSer: 4.498 ± 1.317
5.682AlaThr: 5.682 ± 0.733
2.604AlaVal: 2.604 ± 0.634
1.184AlaTrp: 1.184 ± 0.563
1.42AlaTyr: 1.42 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.947CysAla: 0.947 ± 0.441
0.237CysCys: 0.237 ± 0.227
0.473CysAsp: 0.473 ± 0.352
0.71CysGlu: 0.71 ± 0.364
1.184CysPhe: 1.184 ± 0.457
0.473CysGly: 0.473 ± 0.549
0.947CysHis: 0.947 ± 0.287
1.657CysIle: 1.657 ± 0.863
1.184CysLys: 1.184 ± 0.373
1.184CysLeu: 1.184 ± 0.467
0.947CysMet: 0.947 ± 0.307
1.42CysAsn: 1.42 ± 0.448
0.473CysPro: 0.473 ± 0.342
0.473CysGln: 0.473 ± 0.342
1.184CysArg: 1.184 ± 0.548
1.657CysSer: 1.657 ± 0.758
0.71CysThr: 0.71 ± 0.386
1.657CysVal: 1.657 ± 0.758
0.237CysTrp: 0.237 ± 0.206
0.71CysTyr: 0.71 ± 0.59
0.0CysXaa: 0.0 ± 0.0
Asp
3.078AspAla: 3.078 ± 0.514
0.947AspCys: 0.947 ± 0.336
1.894AspAsp: 1.894 ± 0.51
4.025AspGlu: 4.025 ± 1.328
1.894AspPhe: 1.894 ± 0.833
2.604AspGly: 2.604 ± 1.095
0.473AspHis: 0.473 ± 0.244
2.131AspIle: 2.131 ± 0.653
1.894AspLys: 1.894 ± 0.617
2.604AspLeu: 2.604 ± 0.582
1.657AspMet: 1.657 ± 0.527
2.841AspAsn: 2.841 ± 0.757
3.551AspPro: 3.551 ± 0.983
2.367AspGln: 2.367 ± 1.064
2.131AspArg: 2.131 ± 0.269
3.314AspSer: 3.314 ± 0.818
2.841AspThr: 2.841 ± 1.013
2.841AspVal: 2.841 ± 0.811
0.473AspTrp: 0.473 ± 0.321
1.657AspTyr: 1.657 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
2.367GluAla: 2.367 ± 0.78
1.184GluCys: 1.184 ± 0.744
4.498GluAsp: 4.498 ± 0.941
6.155GluGlu: 6.155 ± 1.242
2.131GluPhe: 2.131 ± 0.728
5.919GluGly: 5.919 ± 1.406
0.71GluHis: 0.71 ± 0.384
4.261GluIle: 4.261 ± 0.593
4.735GluLys: 4.735 ± 1.364
6.392GluLeu: 6.392 ± 0.656
2.367GluMet: 2.367 ± 0.743
4.261GluAsn: 4.261 ± 1.049
2.841GluPro: 2.841 ± 1.234
2.841GluGln: 2.841 ± 1.1
5.445GluArg: 5.445 ± 1.455
5.445GluSer: 5.445 ± 1.047
3.551GluThr: 3.551 ± 0.356
4.972GluVal: 4.972 ± 1.216
0.947GluTrp: 0.947 ± 0.422
1.42GluTyr: 1.42 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.367PheAla: 2.367 ± 0.613
0.237PheCys: 0.237 ± 0.274
1.184PheAsp: 1.184 ± 0.453
4.498PheGlu: 4.498 ± 1.075
1.184PhePhe: 1.184 ± 0.467
1.894PheGly: 1.894 ± 0.427
0.947PheHis: 0.947 ± 0.353
1.657PheIle: 1.657 ± 0.472
0.947PheLys: 0.947 ± 0.342
3.788PheLeu: 3.788 ± 0.686
0.71PheMet: 0.71 ± 0.314
1.657PheAsn: 1.657 ± 0.622
0.947PhePro: 0.947 ± 0.405
2.367PheGln: 2.367 ± 0.771
1.657PheArg: 1.657 ± 0.346
4.025PheSer: 4.025 ± 0.605
2.367PheThr: 2.367 ± 0.616
2.367PheVal: 2.367 ± 0.931
0.473PheTrp: 0.473 ± 0.303
1.184PheTyr: 1.184 ± 0.484
0.0PheXaa: 0.0 ± 0.0
Gly
2.367GlyAla: 2.367 ± 0.733
0.473GlyCys: 0.473 ± 0.238
3.314GlyAsp: 3.314 ± 0.438
3.314GlyGlu: 3.314 ± 1.605
3.314GlyPhe: 3.314 ± 0.478
4.025GlyGly: 4.025 ± 1.016
0.947GlyHis: 0.947 ± 0.491
4.972GlyIle: 4.972 ± 0.746
4.735GlyLys: 4.735 ± 1.022
5.445GlyLeu: 5.445 ± 1.357
2.367GlyMet: 2.367 ± 0.427
3.314GlyAsn: 3.314 ± 1.011
2.841GlyPro: 2.841 ± 0.768
1.894GlyGln: 1.894 ± 0.361
5.682GlyArg: 5.682 ± 0.912
4.261GlySer: 4.261 ± 1.482
6.392GlyThr: 6.392 ± 1.348
4.498GlyVal: 4.498 ± 0.373
1.184GlyTrp: 1.184 ± 0.626
2.131GlyTyr: 2.131 ± 0.684
0.0GlyXaa: 0.0 ± 0.0
His
0.71HisAla: 0.71 ± 0.274
0.237HisCys: 0.237 ± 0.2
0.71HisAsp: 0.71 ± 0.59
1.657HisGlu: 1.657 ± 0.43
1.184HisPhe: 1.184 ± 0.373
1.184HisGly: 1.184 ± 0.443
0.237HisHis: 0.237 ± 0.253
1.657HisIle: 1.657 ± 0.527
1.42HisLys: 1.42 ± 0.457
1.42HisLeu: 1.42 ± 0.511
0.237HisMet: 0.237 ± 0.206
0.473HisAsn: 0.473 ± 0.384
0.71HisPro: 0.71 ± 0.404
0.473HisGln: 0.473 ± 0.247
1.42HisArg: 1.42 ± 0.564
1.42HisSer: 1.42 ± 0.416
0.71HisThr: 0.71 ± 0.373
0.71HisVal: 0.71 ± 0.564
0.0HisTrp: 0.0 ± 0.0
0.237HisTyr: 0.237 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
4.025IleAla: 4.025 ± 0.654
2.131IleCys: 2.131 ± 0.387
3.314IleAsp: 3.314 ± 0.896
7.576IleGlu: 7.576 ± 1.932
1.42IlePhe: 1.42 ± 0.348
4.261IleGly: 4.261 ± 0.497
0.473IleHis: 0.473 ± 0.28
4.025IleIle: 4.025 ± 1.26
4.025IleLys: 4.025 ± 1.332
7.102IleLeu: 7.102 ± 1.679
1.657IleMet: 1.657 ± 0.269
4.025IleAsn: 4.025 ± 1.243
2.604IlePro: 2.604 ± 0.386
2.131IleGln: 2.131 ± 0.429
5.445IleArg: 5.445 ± 1.199
1.894IleSer: 1.894 ± 0.324
3.314IleThr: 3.314 ± 0.748
4.025IleVal: 4.025 ± 0.996
1.42IleTrp: 1.42 ± 0.771
1.42IleTyr: 1.42 ± 0.693
0.0IleXaa: 0.0 ± 0.0
Lys
4.735LysAla: 4.735 ± 0.553
1.184LysCys: 1.184 ± 0.376
2.604LysAsp: 2.604 ± 0.544
5.208LysGlu: 5.208 ± 1.11
0.947LysPhe: 0.947 ± 0.392
3.314LysGly: 3.314 ± 0.58
0.947LysHis: 0.947 ± 0.35
4.498LysIle: 4.498 ± 0.875
3.314LysLys: 3.314 ± 1.562
4.261LysLeu: 4.261 ± 1.286
2.841LysMet: 2.841 ± 0.791
2.841LysAsn: 2.841 ± 1.578
1.42LysPro: 1.42 ± 0.503
1.657LysGln: 1.657 ± 0.538
4.261LysArg: 4.261 ± 1.229
2.841LysSer: 2.841 ± 0.674
4.025LysThr: 4.025 ± 1.34
2.604LysVal: 2.604 ± 0.67
1.894LysTrp: 1.894 ± 0.649
1.894LysTyr: 1.894 ± 0.536
0.0LysXaa: 0.0 ± 0.0
Leu
4.972LeuAla: 4.972 ± 1.134
0.947LeuCys: 0.947 ± 0.486
1.657LeuAsp: 1.657 ± 0.85
5.682LeuGlu: 5.682 ± 1.191
2.604LeuPhe: 2.604 ± 0.723
4.972LeuGly: 4.972 ± 1.035
1.184LeuHis: 1.184 ± 0.556
6.866LeuIle: 6.866 ± 1.318
6.392LeuLys: 6.392 ± 1.544
6.392LeuLeu: 6.392 ± 1.533
2.367LeuMet: 2.367 ± 0.467
4.025LeuAsn: 4.025 ± 1.175
3.551LeuPro: 3.551 ± 0.77
3.078LeuGln: 3.078 ± 0.67
5.682LeuArg: 5.682 ± 1.79
5.208LeuSer: 5.208 ± 0.795
5.682LeuThr: 5.682 ± 1.308
4.498LeuVal: 4.498 ± 0.914
0.947LeuTrp: 0.947 ± 0.336
3.551LeuTyr: 3.551 ± 0.923
0.0LeuXaa: 0.0 ± 0.0
Met
3.788MetAla: 3.788 ± 0.586
0.947MetCys: 0.947 ± 0.685
2.367MetAsp: 2.367 ± 0.796
4.261MetGlu: 4.261 ± 1.043
0.947MetPhe: 0.947 ± 0.653
1.894MetGly: 1.894 ± 0.711
0.237MetHis: 0.237 ± 0.206
2.604MetIle: 2.604 ± 0.62
2.131MetLys: 2.131 ± 1.071
1.894MetLeu: 1.894 ± 0.372
1.657MetMet: 1.657 ± 0.598
1.184MetAsn: 1.184 ± 0.588
0.473MetPro: 0.473 ± 0.353
1.184MetGln: 1.184 ± 0.343
1.894MetArg: 1.894 ± 0.667
1.894MetSer: 1.894 ± 0.436
2.367MetThr: 2.367 ± 0.744
3.078MetVal: 3.078 ± 1.033
0.71MetTrp: 0.71 ± 0.274
0.71MetTyr: 0.71 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
3.788AsnAla: 3.788 ± 0.976
0.947AsnCys: 0.947 ± 0.685
2.841AsnAsp: 2.841 ± 0.478
3.314AsnGlu: 3.314 ± 1.249
1.894AsnPhe: 1.894 ± 0.471
4.261AsnGly: 4.261 ± 1.078
0.473AsnHis: 0.473 ± 0.265
2.131AsnIle: 2.131 ± 0.923
2.841AsnLys: 2.841 ± 0.622
3.551AsnLeu: 3.551 ± 0.807
2.604AsnMet: 2.604 ± 0.653
3.788AsnAsn: 3.788 ± 1.48
4.261AsnPro: 4.261 ± 0.803
2.131AsnGln: 2.131 ± 0.656
4.735AsnArg: 4.735 ± 1.062
3.078AsnSer: 3.078 ± 0.637
3.551AsnThr: 3.551 ± 0.813
3.078AsnVal: 3.078 ± 1.056
1.184AsnTrp: 1.184 ± 0.82
0.71AsnTyr: 0.71 ± 0.44
0.0AsnXaa: 0.0 ± 0.0
Pro
2.367ProAla: 2.367 ± 0.92
0.237ProCys: 0.237 ± 0.2
1.657ProAsp: 1.657 ± 0.363
2.604ProGlu: 2.604 ± 0.711
2.367ProPhe: 2.367 ± 0.46
2.604ProGly: 2.604 ± 0.405
0.71ProHis: 0.71 ± 0.398
2.604ProIle: 2.604 ± 0.402
3.314ProLys: 3.314 ± 1.055
4.261ProLeu: 4.261 ± 0.812
0.947ProMet: 0.947 ± 0.624
2.604ProAsn: 2.604 ± 0.694
1.42ProPro: 1.42 ± 0.41
0.947ProGln: 0.947 ± 0.536
2.367ProArg: 2.367 ± 0.678
3.551ProSer: 3.551 ± 0.697
1.42ProThr: 1.42 ± 0.529
2.131ProVal: 2.131 ± 0.596
0.473ProTrp: 0.473 ± 0.238
0.947ProTyr: 0.947 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
2.131GlnAla: 2.131 ± 1.027
0.947GlnCys: 0.947 ± 0.441
1.42GlnAsp: 1.42 ± 0.527
1.42GlnGlu: 1.42 ± 0.556
0.473GlnPhe: 0.473 ± 0.298
2.604GlnGly: 2.604 ± 0.831
0.71GlnHis: 0.71 ± 0.386
3.551GlnIle: 3.551 ± 0.623
2.841GlnLys: 2.841 ± 0.943
2.604GlnLeu: 2.604 ± 0.834
2.604GlnMet: 2.604 ± 1.059
2.131GlnAsn: 2.131 ± 0.608
0.947GlnPro: 0.947 ± 0.505
0.71GlnGln: 0.71 ± 0.268
3.314GlnArg: 3.314 ± 1.024
2.841GlnSer: 2.841 ± 1.058
2.604GlnThr: 2.604 ± 0.956
2.131GlnVal: 2.131 ± 0.739
0.473GlnTrp: 0.473 ± 0.413
0.947GlnTyr: 0.947 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
4.025ArgAla: 4.025 ± 0.962
1.184ArgCys: 1.184 ± 0.544
2.604ArgAsp: 2.604 ± 0.782
2.841ArgGlu: 2.841 ± 0.935
2.604ArgPhe: 2.604 ± 0.863
6.392ArgGly: 6.392 ± 1.084
0.71ArgHis: 0.71 ± 0.328
4.735ArgIle: 4.735 ± 0.922
2.131ArgLys: 2.131 ± 0.581
4.498ArgLeu: 4.498 ± 0.492
3.314ArgMet: 3.314 ± 1.65
4.735ArgAsn: 4.735 ± 1.014
2.367ArgPro: 2.367 ± 0.629
3.078ArgGln: 3.078 ± 0.511
5.919ArgArg: 5.919 ± 1.18
5.445ArgSer: 5.445 ± 1.401
6.155ArgThr: 6.155 ± 0.779
3.314ArgVal: 3.314 ± 0.795
0.237ArgTrp: 0.237 ± 0.192
1.894ArgTyr: 1.894 ± 0.564
0.0ArgXaa: 0.0 ± 0.0
Ser
3.788SerAla: 3.788 ± 1.056
2.367SerCys: 2.367 ± 0.795
2.131SerAsp: 2.131 ± 0.667
3.551SerGlu: 3.551 ± 0.654
3.314SerPhe: 3.314 ± 0.675
6.155SerGly: 6.155 ± 1.77
2.131SerHis: 2.131 ± 0.865
5.445SerIle: 5.445 ± 1.337
3.788SerLys: 3.788 ± 0.973
6.392SerLeu: 6.392 ± 1.577
2.131SerMet: 2.131 ± 0.815
4.261SerAsn: 4.261 ± 1.753
2.841SerPro: 2.841 ± 0.864
3.314SerGln: 3.314 ± 0.831
3.078SerArg: 3.078 ± 0.641
8.049SerSer: 8.049 ± 1.646
4.261SerThr: 4.261 ± 0.926
3.551SerVal: 3.551 ± 0.932
1.42SerTrp: 1.42 ± 0.635
2.367SerTyr: 2.367 ± 0.82
0.0SerXaa: 0.0 ± 0.0
Thr
4.025ThrAla: 4.025 ± 0.594
0.947ThrCys: 0.947 ± 0.428
3.314ThrAsp: 3.314 ± 0.865
4.498ThrGlu: 4.498 ± 1.133
2.604ThrPhe: 2.604 ± 0.337
4.025ThrGly: 4.025 ± 0.956
2.131ThrHis: 2.131 ± 0.535
5.682ThrIle: 5.682 ± 1.301
4.261ThrLys: 4.261 ± 1.145
5.208ThrLeu: 5.208 ± 1.077
1.894ThrMet: 1.894 ± 0.367
3.078ThrAsn: 3.078 ± 0.609
1.184ThrPro: 1.184 ± 0.373
2.604ThrGln: 2.604 ± 0.806
4.498ThrArg: 4.498 ± 0.858
4.025ThrSer: 4.025 ± 0.757
4.972ThrThr: 4.972 ± 1.779
4.972ThrVal: 4.972 ± 1.262
0.237ThrTrp: 0.237 ± 0.206
3.078ThrTyr: 3.078 ± 0.579
0.0ThrXaa: 0.0 ± 0.0
Val
3.314ValAla: 3.314 ± 0.703
2.131ValCys: 2.131 ± 1.28
3.788ValAsp: 3.788 ± 0.978
4.261ValGlu: 4.261 ± 0.803
2.604ValPhe: 2.604 ± 0.596
4.025ValGly: 4.025 ± 0.454
1.184ValHis: 1.184 ± 0.556
1.42ValIle: 1.42 ± 0.349
2.367ValLys: 2.367 ± 0.731
5.208ValLeu: 5.208 ± 1.395
1.42ValMet: 1.42 ± 0.528
3.551ValAsn: 3.551 ± 0.856
2.367ValPro: 2.367 ± 0.604
2.841ValGln: 2.841 ± 0.897
4.498ValArg: 4.498 ± 1.382
5.919ValSer: 5.919 ± 0.845
3.314ValThr: 3.314 ± 0.474
3.078ValVal: 3.078 ± 0.813
0.473ValTrp: 0.473 ± 0.32
1.184ValTyr: 1.184 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
1.184TrpAla: 1.184 ± 0.47
0.0TrpCys: 0.0 ± 0.0
0.473TrpAsp: 0.473 ± 0.257
1.42TrpGlu: 1.42 ± 0.608
0.473TrpPhe: 0.473 ± 0.257
0.71TrpGly: 0.71 ± 0.274
0.473TrpHis: 0.473 ± 0.363
0.473TrpIle: 0.473 ± 0.28
0.71TrpLys: 0.71 ± 0.398
0.947TrpLeu: 0.947 ± 0.447
0.71TrpMet: 0.71 ± 0.325
0.947TrpAsn: 0.947 ± 0.34
0.473TrpPro: 0.473 ± 0.278
0.0TrpGln: 0.0 ± 0.0
0.71TrpArg: 0.71 ± 0.537
2.131TrpSer: 2.131 ± 1.189
1.184TrpThr: 1.184 ± 0.643
1.184TrpVal: 1.184 ± 0.357
0.71TrpTrp: 0.71 ± 0.325
0.473TrpTyr: 0.473 ± 0.384
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.71TyrAla: 0.71 ± 0.255
0.237TyrCys: 0.237 ± 0.2
2.131TyrAsp: 2.131 ± 0.663
1.894TyrGlu: 1.894 ± 0.724
1.42TyrPhe: 1.42 ± 0.318
2.131TyrGly: 2.131 ± 0.464
0.237TyrHis: 0.237 ± 0.192
1.894TyrIle: 1.894 ± 0.263
1.894TyrLys: 1.894 ± 0.3
1.42TyrLeu: 1.42 ± 0.603
0.473TyrMet: 0.473 ± 0.247
1.657TyrAsn: 1.657 ± 0.531
1.42TyrPro: 1.42 ± 0.665
1.657TyrGln: 1.657 ± 0.412
1.42TyrArg: 1.42 ± 1.11
2.841TyrSer: 2.841 ± 0.422
2.131TyrThr: 2.131 ± 0.739
1.657TyrVal: 1.657 ± 0.782
0.71TyrTrp: 0.71 ± 0.292
0.473TyrTyr: 0.473 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4225 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski