Amino acid dipepetide frequency for Influenza A virus (A/Netherlands/219/2003(H7N7))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.762AlaAla: 3.762 ± 1.096
0.941AlaCys: 0.941 ± 0.525
2.822AlaAsp: 2.822 ± 0.903
3.997AlaGlu: 3.997 ± 0.756
2.116AlaPhe: 2.116 ± 0.795
3.997AlaGly: 3.997 ± 1.115
0.705AlaHis: 0.705 ± 0.496
4.467AlaIle: 4.467 ± 0.834
2.351AlaLys: 2.351 ± 0.713
6.348AlaLeu: 6.348 ± 0.855
3.292AlaMet: 3.292 ± 0.81
2.822AlaAsn: 2.822 ± 0.769
2.116AlaPro: 2.116 ± 0.538
1.881AlaGln: 1.881 ± 0.49
3.997AlaArg: 3.997 ± 0.508
5.643AlaSer: 5.643 ± 1.422
5.173AlaThr: 5.173 ± 0.756
3.057AlaVal: 3.057 ± 0.623
0.705AlaTrp: 0.705 ± 0.406
1.176AlaTyr: 1.176 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
0.705CysAla: 0.705 ± 0.337
0.235CysCys: 0.235 ± 0.206
0.705CysAsp: 0.705 ± 0.488
0.941CysGlu: 0.941 ± 0.345
1.411CysPhe: 1.411 ± 0.545
0.47CysGly: 0.47 ± 0.303
1.176CysHis: 1.176 ± 0.466
0.941CysIle: 0.941 ± 0.348
1.176CysLys: 1.176 ± 0.401
1.411CysLeu: 1.411 ± 0.429
0.941CysMet: 0.941 ± 0.36
1.176CysAsn: 1.176 ± 0.427
0.235CysPro: 0.235 ± 0.236
0.0CysGln: 0.0 ± 0.0
1.176CysArg: 1.176 ± 0.598
1.411CysSer: 1.411 ± 0.451
1.176CysThr: 1.176 ± 0.518
1.646CysVal: 1.646 ± 0.66
0.235CysTrp: 0.235 ± 0.194
0.941CysTyr: 0.941 ± 0.606
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.498
1.881AspCys: 1.881 ± 0.428
1.411AspAsp: 1.411 ± 0.461
3.057AspGlu: 3.057 ± 0.762
2.351AspPhe: 2.351 ± 0.734
3.057AspGly: 3.057 ± 0.775
0.705AspHis: 0.705 ± 0.307
1.176AspIle: 1.176 ± 0.565
2.116AspLys: 2.116 ± 0.448
3.527AspLeu: 3.527 ± 0.539
1.881AspMet: 1.881 ± 0.497
3.762AspAsn: 3.762 ± 1.193
3.997AspPro: 3.997 ± 0.797
2.116AspGln: 2.116 ± 0.804
2.351AspArg: 2.351 ± 0.424
2.351AspSer: 2.351 ± 0.604
1.646AspThr: 1.646 ± 0.481
3.292AspVal: 3.292 ± 0.55
0.47AspTrp: 0.47 ± 0.32
1.411AspTyr: 1.411 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
2.822GluAla: 2.822 ± 0.549
0.941GluCys: 0.941 ± 0.748
4.467GluAsp: 4.467 ± 0.939
7.524GluGlu: 7.524 ± 1.003
2.116GluPhe: 2.116 ± 0.517
4.703GluGly: 4.703 ± 1.287
0.705GluHis: 0.705 ± 0.37
4.938GluIle: 4.938 ± 0.758
5.173GluLys: 5.173 ± 1.442
5.408GluLeu: 5.408 ± 0.804
2.822GluMet: 2.822 ± 0.497
3.997GluAsn: 3.997 ± 0.981
2.351GluPro: 2.351 ± 1.083
3.057GluGln: 3.057 ± 0.938
5.408GluArg: 5.408 ± 1.331
6.348GluSer: 6.348 ± 1.322
3.762GluThr: 3.762 ± 0.768
5.173GluVal: 5.173 ± 1.384
0.705GluTrp: 0.705 ± 0.414
1.176GluTyr: 1.176 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.116PheAla: 2.116 ± 0.513
0.0PheCys: 0.0 ± 0.0
1.176PheAsp: 1.176 ± 0.516
4.703PheGlu: 4.703 ± 1.008
1.176PhePhe: 1.176 ± 0.493
2.351PheGly: 2.351 ± 0.409
1.176PheHis: 1.176 ± 0.515
2.351PheIle: 2.351 ± 0.851
0.47PheLys: 0.47 ± 0.319
4.232PheLeu: 4.232 ± 0.663
0.705PheMet: 0.705 ± 0.322
2.116PheAsn: 2.116 ± 0.641
1.411PhePro: 1.411 ± 0.504
2.351PheGln: 2.351 ± 0.622
1.881PheArg: 1.881 ± 0.283
3.762PheSer: 3.762 ± 0.489
2.822PheThr: 2.822 ± 0.432
2.822PheVal: 2.822 ± 0.79
0.235PheTrp: 0.235 ± 0.249
1.176PheTyr: 1.176 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
3.292GlyAla: 3.292 ± 0.934
0.705GlyCys: 0.705 ± 0.327
3.292GlyAsp: 3.292 ± 0.476
3.762GlyGlu: 3.762 ± 1.459
3.527GlyPhe: 3.527 ± 0.718
3.292GlyGly: 3.292 ± 0.841
0.941GlyHis: 0.941 ± 0.358
5.173GlyIle: 5.173 ± 0.691
3.997GlyLys: 3.997 ± 0.986
5.173GlyLeu: 5.173 ± 1.101
1.881GlyMet: 1.881 ± 0.444
2.822GlyAsn: 2.822 ± 0.777
3.057GlyPro: 3.057 ± 0.674
2.116GlyGln: 2.116 ± 0.52
5.173GlyArg: 5.173 ± 1.195
5.173GlySer: 5.173 ± 1.62
7.054GlyThr: 7.054 ± 1.096
4.703GlyVal: 4.703 ± 0.583
1.176GlyTrp: 1.176 ± 0.6
1.881GlyTyr: 1.881 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.273
0.235HisCys: 0.235 ± 0.192
0.47HisAsp: 0.47 ± 0.481
1.411HisGlu: 1.411 ± 0.381
0.941HisPhe: 0.941 ± 0.369
0.941HisGly: 0.941 ± 0.471
0.47HisHis: 0.47 ± 0.473
1.411HisIle: 1.411 ± 0.77
1.176HisLys: 1.176 ± 0.477
1.176HisLeu: 1.176 ± 0.456
0.235HisMet: 0.235 ± 0.194
0.47HisAsn: 0.47 ± 0.481
0.705HisPro: 0.705 ± 0.367
0.941HisGln: 0.941 ± 0.292
0.941HisArg: 0.941 ± 0.534
1.881HisSer: 1.881 ± 0.647
0.705HisThr: 0.705 ± 0.354
0.235HisVal: 0.235 ± 0.235
0.235HisTrp: 0.235 ± 0.236
0.235HisTyr: 0.235 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
4.467IleAla: 4.467 ± 0.891
2.351IleCys: 2.351 ± 0.752
4.467IleAsp: 4.467 ± 1.402
6.819IleGlu: 6.819 ± 2.149
1.411IlePhe: 1.411 ± 0.281
3.997IleGly: 3.997 ± 0.802
0.705IleHis: 0.705 ± 0.327
3.762IleIle: 3.762 ± 0.982
3.762IleLys: 3.762 ± 1.102
6.113IleLeu: 6.113 ± 1.457
1.646IleMet: 1.646 ± 0.363
3.527IleAsn: 3.527 ± 0.555
2.351IlePro: 2.351 ± 0.555
2.586IleGln: 2.586 ± 0.562
5.408IleArg: 5.408 ± 1.264
2.351IleSer: 2.351 ± 0.559
3.527IleThr: 3.527 ± 0.584
3.292IleVal: 3.292 ± 0.697
0.705IleTrp: 0.705 ± 0.521
0.705IleTyr: 0.705 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
3.762LysAla: 3.762 ± 1.04
1.176LysCys: 1.176 ± 0.477
2.822LysAsp: 2.822 ± 0.297
4.703LysGlu: 4.703 ± 1.07
1.411LysPhe: 1.411 ± 0.532
3.527LysGly: 3.527 ± 0.911
0.705LysHis: 0.705 ± 0.328
3.057LysIle: 3.057 ± 0.772
3.057LysLys: 3.057 ± 1.689
4.938LysLeu: 4.938 ± 1.157
2.586LysMet: 2.586 ± 0.766
2.116LysAsn: 2.116 ± 0.81
0.47LysPro: 0.47 ± 0.384
1.881LysGln: 1.881 ± 0.644
5.173LysArg: 5.173 ± 1.491
3.762LysSer: 3.762 ± 0.478
3.997LysThr: 3.997 ± 1.153
2.586LysVal: 2.586 ± 0.686
1.881LysTrp: 1.881 ± 0.449
1.646LysTyr: 1.646 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
5.173LeuAla: 5.173 ± 0.846
0.941LeuCys: 0.941 ± 0.49
0.941LeuAsp: 0.941 ± 0.54
5.643LeuGlu: 5.643 ± 1.221
1.881LeuPhe: 1.881 ± 0.498
3.997LeuGly: 3.997 ± 0.706
0.941LeuHis: 0.941 ± 0.489
7.759LeuIle: 7.759 ± 1.157
5.643LeuLys: 5.643 ± 1.215
7.054LeuLeu: 7.054 ± 1.424
1.881LeuMet: 1.881 ± 0.507
4.467LeuAsn: 4.467 ± 1.104
3.762LeuPro: 3.762 ± 0.843
2.586LeuGln: 2.586 ± 0.63
6.584LeuArg: 6.584 ± 1.193
4.938LeuSer: 4.938 ± 0.736
5.878LeuThr: 5.878 ± 1.562
3.762LeuVal: 3.762 ± 0.968
1.176LeuTrp: 1.176 ± 0.295
2.822LeuTyr: 2.822 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
3.762MetAla: 3.762 ± 0.657
1.176MetCys: 1.176 ± 0.714
3.057MetAsp: 3.057 ± 1.181
4.703MetGlu: 4.703 ± 1.03
1.176MetPhe: 1.176 ± 0.795
2.351MetGly: 2.351 ± 1.121
0.235MetHis: 0.235 ± 0.194
2.351MetIle: 2.351 ± 0.63
2.822MetLys: 2.822 ± 0.802
1.646MetLeu: 1.646 ± 0.453
1.646MetMet: 1.646 ± 0.643
0.705MetAsn: 0.705 ± 0.488
0.47MetPro: 0.47 ± 0.32
0.941MetGln: 0.941 ± 0.346
2.116MetArg: 2.116 ± 0.628
2.116MetSer: 2.116 ± 0.373
2.586MetThr: 2.586 ± 0.656
3.292MetVal: 3.292 ± 1.167
0.235MetTrp: 0.235 ± 0.194
1.176MetTyr: 1.176 ± 0.489
0.0MetXaa: 0.0 ± 0.0
Asn
4.938AsnAla: 4.938 ± 1.187
0.235AsnCys: 0.235 ± 0.236
2.586AsnAsp: 2.586 ± 0.436
3.762AsnGlu: 3.762 ± 0.991
1.646AsnPhe: 1.646 ± 0.519
4.232AsnGly: 4.232 ± 1.198
0.235AsnHis: 0.235 ± 0.206
2.116AsnIle: 2.116 ± 0.252
3.057AsnLys: 3.057 ± 0.567
3.292AsnLeu: 3.292 ± 0.482
2.822AsnMet: 2.822 ± 0.608
2.586AsnAsn: 2.586 ± 1.092
4.467AsnPro: 4.467 ± 0.707
2.586AsnGln: 2.586 ± 0.671
3.057AsnArg: 3.057 ± 0.739
3.057AsnSer: 3.057 ± 0.903
5.173AsnThr: 5.173 ± 1.005
2.822AsnVal: 2.822 ± 0.95
1.176AsnTrp: 1.176 ± 0.577
0.941AsnTyr: 0.941 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
2.586ProAla: 2.586 ± 1.068
0.47ProCys: 0.47 ± 0.309
1.411ProAsp: 1.411 ± 0.477
2.586ProGlu: 2.586 ± 0.42
2.116ProPhe: 2.116 ± 0.356
3.057ProGly: 3.057 ± 0.55
0.235ProHis: 0.235 ± 0.192
2.586ProIle: 2.586 ± 0.502
3.057ProLys: 3.057 ± 0.613
3.292ProLeu: 3.292 ± 0.924
0.941ProMet: 0.941 ± 0.561
3.292ProAsn: 3.292 ± 0.875
1.881ProPro: 1.881 ± 0.447
1.176ProGln: 1.176 ± 0.643
2.116ProArg: 2.116 ± 0.736
3.057ProSer: 3.057 ± 0.787
1.881ProThr: 1.881 ± 0.56
1.646ProVal: 1.646 ± 0.573
0.235ProTrp: 0.235 ± 0.192
0.941ProTyr: 0.941 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
2.351GlnAla: 2.351 ± 0.962
0.705GlnCys: 0.705 ± 0.463
1.176GlnAsp: 1.176 ± 0.536
1.881GlnGlu: 1.881 ± 0.706
0.705GlnPhe: 0.705 ± 0.481
3.057GlnGly: 3.057 ± 0.828
0.47GlnHis: 0.47 ± 0.365
3.997GlnIle: 3.997 ± 0.849
2.351GlnLys: 2.351 ± 0.896
2.586GlnLeu: 2.586 ± 0.502
2.822GlnMet: 2.822 ± 1.026
2.822GlnAsn: 2.822 ± 0.706
0.705GlnPro: 0.705 ± 0.431
1.411GlnGln: 1.411 ± 0.356
3.527GlnArg: 3.527 ± 1.177
3.527GlnSer: 3.527 ± 1.003
2.351GlnThr: 2.351 ± 0.915
1.881GlnVal: 1.881 ± 0.616
0.235GlnTrp: 0.235 ± 0.194
0.941GlnTyr: 0.941 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
4.467ArgAla: 4.467 ± 0.999
0.705ArgCys: 0.705 ± 0.256
2.822ArgAsp: 2.822 ± 0.585
3.057ArgGlu: 3.057 ± 0.966
2.822ArgPhe: 2.822 ± 0.625
7.054ArgGly: 7.054 ± 1.102
0.941ArgHis: 0.941 ± 0.384
4.232ArgIle: 4.232 ± 0.72
2.822ArgLys: 2.822 ± 0.703
4.467ArgLeu: 4.467 ± 0.571
3.762ArgMet: 3.762 ± 1.47
4.703ArgAsn: 4.703 ± 0.936
2.822ArgPro: 2.822 ± 0.651
3.762ArgGln: 3.762 ± 0.599
7.054ArgArg: 7.054 ± 0.947
4.703ArgSer: 4.703 ± 1.183
6.348ArgThr: 6.348 ± 0.943
2.586ArgVal: 2.586 ± 0.833
0.47ArgTrp: 0.47 ± 0.248
2.116ArgTyr: 2.116 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
3.997SerAla: 3.997 ± 1.228
1.646SerCys: 1.646 ± 0.426
2.822SerAsp: 2.822 ± 0.647
3.762SerGlu: 3.762 ± 0.774
4.703SerPhe: 4.703 ± 0.757
6.819SerGly: 6.819 ± 1.508
1.646SerHis: 1.646 ± 0.726
3.997SerIle: 3.997 ± 0.759
2.586SerLys: 2.586 ± 0.666
5.408SerLeu: 5.408 ± 1.026
2.822SerMet: 2.822 ± 0.847
4.232SerAsn: 4.232 ± 1.158
2.822SerPro: 2.822 ± 0.66
3.762SerGln: 3.762 ± 0.843
4.232SerArg: 4.232 ± 1.009
6.819SerSer: 6.819 ± 1.153
4.467SerThr: 4.467 ± 0.763
3.057SerVal: 3.057 ± 0.916
0.941SerTrp: 0.941 ± 0.56
2.116SerTyr: 2.116 ± 0.658
0.0SerXaa: 0.0 ± 0.0
Thr
4.232ThrAla: 4.232 ± 0.438
1.646ThrCys: 1.646 ± 0.899
3.057ThrAsp: 3.057 ± 0.861
4.703ThrGlu: 4.703 ± 1.25
2.351ThrPhe: 2.351 ± 0.419
5.173ThrGly: 5.173 ± 0.938
1.881ThrHis: 1.881 ± 0.705
5.643ThrIle: 5.643 ± 1.241
4.467ThrLys: 4.467 ± 0.494
4.232ThrLeu: 4.232 ± 0.926
2.351ThrMet: 2.351 ± 0.466
3.057ThrAsn: 3.057 ± 0.791
1.646ThrPro: 1.646 ± 0.745
2.351ThrGln: 2.351 ± 0.792
4.703ThrArg: 4.703 ± 0.906
3.057ThrSer: 3.057 ± 0.735
5.173ThrThr: 5.173 ± 1.559
5.173ThrVal: 5.173 ± 1.282
0.941ThrTrp: 0.941 ± 0.523
2.822ThrTyr: 2.822 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
3.527ValAla: 3.527 ± 0.743
1.881ValCys: 1.881 ± 0.688
3.292ValAsp: 3.292 ± 0.93
3.762ValGlu: 3.762 ± 0.704
2.586ValPhe: 2.586 ± 0.684
3.527ValGly: 3.527 ± 0.864
0.941ValHis: 0.941 ± 0.566
1.646ValIle: 1.646 ± 0.397
3.292ValLys: 3.292 ± 0.853
4.938ValLeu: 4.938 ± 1.656
2.116ValMet: 2.116 ± 0.622
3.527ValAsn: 3.527 ± 0.78
2.351ValPro: 2.351 ± 0.788
2.351ValGln: 2.351 ± 0.927
4.232ValArg: 4.232 ± 1.429
5.408ValSer: 5.408 ± 0.871
2.116ValThr: 2.116 ± 0.552
3.527ValVal: 3.527 ± 0.89
0.705ValTrp: 0.705 ± 0.349
1.411ValTyr: 1.411 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.543
0.0TrpCys: 0.0 ± 0.0
0.47TrpAsp: 0.47 ± 0.243
1.646TrpGlu: 1.646 ± 0.523
0.941TrpPhe: 0.941 ± 0.337
0.705TrpGly: 0.705 ± 0.273
0.47TrpHis: 0.47 ± 0.347
1.176TrpIle: 1.176 ± 0.437
0.47TrpLys: 0.47 ± 0.384
0.941TrpLeu: 0.941 ± 0.54
0.705TrpMet: 0.705 ± 0.476
0.941TrpAsn: 0.941 ± 0.335
0.235TrpPro: 0.235 ± 0.192
0.235TrpGln: 0.235 ± 0.24
0.705TrpArg: 0.705 ± 0.557
0.941TrpSer: 0.941 ± 0.501
0.941TrpThr: 0.941 ± 0.381
0.47TrpVal: 0.47 ± 0.309
0.705TrpTrp: 0.705 ± 0.304
0.235TrpTyr: 0.235 ± 0.236
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.941TyrAla: 0.941 ± 0.289
0.235TyrCys: 0.235 ± 0.192
2.116TyrAsp: 2.116 ± 0.804
1.176TyrGlu: 1.176 ± 0.47
1.646TyrPhe: 1.646 ± 0.49
1.881TyrGly: 1.881 ± 0.349
0.235TyrHis: 0.235 ± 0.236
1.411TyrIle: 1.411 ± 0.422
1.411TyrLys: 1.411 ± 0.685
1.646TyrLeu: 1.646 ± 0.368
0.47TyrMet: 0.47 ± 0.248
1.646TyrAsn: 1.646 ± 0.521
0.705TyrPro: 0.705 ± 0.419
1.411TyrGln: 1.411 ± 0.423
1.881TyrArg: 1.881 ± 0.974
2.351TyrSer: 2.351 ± 0.363
1.881TyrThr: 1.881 ± 0.752
2.116TyrVal: 2.116 ± 1.175
0.705TyrTrp: 0.705 ± 0.324
0.47TyrTyr: 0.47 ± 0.309
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski