Amino acid dipepetide frequency for Influenza A virus (A/chicken/Netherlands/1/03(H7N7))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.762AlaAla: 3.762 ± 1.005
0.941AlaCys: 0.941 ± 0.45
2.822AlaAsp: 2.822 ± 0.763
3.997AlaGlu: 3.997 ± 0.778
2.116AlaPhe: 2.116 ± 0.77
3.997AlaGly: 3.997 ± 1.231
0.705AlaHis: 0.705 ± 0.45
4.703AlaIle: 4.703 ± 0.862
2.351AlaLys: 2.351 ± 0.653
6.348AlaLeu: 6.348 ± 0.704
3.292AlaMet: 3.292 ± 0.647
2.822AlaAsn: 2.822 ± 0.64
2.351AlaPro: 2.351 ± 0.435
1.881AlaGln: 1.881 ± 0.454
3.997AlaArg: 3.997 ± 0.448
5.173AlaSer: 5.173 ± 1.3
5.408AlaThr: 5.408 ± 0.679
3.057AlaVal: 3.057 ± 0.67
0.705AlaTrp: 0.705 ± 0.426
1.176AlaTyr: 1.176 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.705CysAla: 0.705 ± 0.276
0.235CysCys: 0.235 ± 0.185
0.705CysAsp: 0.705 ± 0.479
0.941CysGlu: 0.941 ± 0.321
1.411CysPhe: 1.411 ± 0.504
0.47CysGly: 0.47 ± 0.29
1.176CysHis: 1.176 ± 0.419
0.941CysIle: 0.941 ± 0.317
1.176CysLys: 1.176 ± 0.371
1.411CysLeu: 1.411 ± 0.487
0.941CysMet: 0.941 ± 0.29
1.176CysAsn: 1.176 ± 0.379
0.235CysPro: 0.235 ± 0.227
0.0CysGln: 0.0 ± 0.0
1.176CysArg: 1.176 ± 0.555
1.411CysSer: 1.411 ± 0.393
1.176CysThr: 1.176 ± 0.429
1.646CysVal: 1.646 ± 0.559
0.235CysTrp: 0.235 ± 0.201
0.941CysTyr: 0.941 ± 0.581
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.429
1.881AspCys: 1.881 ± 0.386
1.411AspAsp: 1.411 ± 0.409
3.057AspGlu: 3.057 ± 0.797
2.351AspPhe: 2.351 ± 0.787
3.057AspGly: 3.057 ± 0.749
0.705AspHis: 0.705 ± 0.273
0.941AspIle: 0.941 ± 0.446
2.116AspLys: 2.116 ± 0.387
3.527AspLeu: 3.527 ± 0.471
1.881AspMet: 1.881 ± 0.467
3.762AspAsn: 3.762 ± 1.057
3.997AspPro: 3.997 ± 0.638
2.116AspGln: 2.116 ± 0.732
2.351AspArg: 2.351 ± 0.481
2.351AspSer: 2.351 ± 0.523
1.646AspThr: 1.646 ± 0.473
3.527AspVal: 3.527 ± 0.654
0.47AspTrp: 0.47 ± 0.279
1.411AspTyr: 1.411 ± 0.487
0.0AspXaa: 0.0 ± 0.0
Glu
2.822GluAla: 2.822 ± 0.428
0.941GluCys: 0.941 ± 0.598
4.467GluAsp: 4.467 ± 0.918
7.524GluGlu: 7.524 ± 0.967
2.116GluPhe: 2.116 ± 0.606
4.703GluGly: 4.703 ± 1.144
0.705GluHis: 0.705 ± 0.294
4.938GluIle: 4.938 ± 0.698
5.408GluLys: 5.408 ± 1.434
5.408GluLeu: 5.408 ± 0.668
2.822GluMet: 2.822 ± 0.508
3.997GluAsn: 3.997 ± 0.912
2.351GluPro: 2.351 ± 1.04
3.292GluGln: 3.292 ± 0.979
5.173GluArg: 5.173 ± 1.085
6.348GluSer: 6.348 ± 1.366
3.762GluThr: 3.762 ± 0.706
5.173GluVal: 5.173 ± 1.117
0.705GluTrp: 0.705 ± 0.373
1.176GluTyr: 1.176 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.116PheAla: 2.116 ± 0.455
0.0PheCys: 0.0 ± 0.0
1.176PheAsp: 1.176 ± 0.512
4.703PheGlu: 4.703 ± 1.01
1.176PhePhe: 1.176 ± 0.497
2.351PheGly: 2.351 ± 0.396
1.176PheHis: 1.176 ± 0.46
2.351PheIle: 2.351 ± 0.895
0.47PheLys: 0.47 ± 0.301
4.467PheLeu: 4.467 ± 0.857
0.705PheMet: 0.705 ± 0.317
2.116PheAsn: 2.116 ± 0.711
1.411PhePro: 1.411 ± 0.464
2.351PheGln: 2.351 ± 0.622
1.881PheArg: 1.881 ± 0.292
3.762PheSer: 3.762 ± 0.467
2.822PheThr: 2.822 ± 0.401
2.822PheVal: 2.822 ± 0.7
0.235PheTrp: 0.235 ± 0.227
1.176PheTyr: 1.176 ± 0.4
0.0PheXaa: 0.0 ± 0.0
Gly
3.762GlyAla: 3.762 ± 1.078
0.705GlyCys: 0.705 ± 0.295
3.292GlyAsp: 3.292 ± 0.406
3.762GlyGlu: 3.762 ± 1.32
3.527GlyPhe: 3.527 ± 0.665
3.292GlyGly: 3.292 ± 0.739
0.941GlyHis: 0.941 ± 0.346
5.173GlyIle: 5.173 ± 0.758
3.997GlyLys: 3.997 ± 0.872
5.173GlyLeu: 5.173 ± 1.014
1.881GlyMet: 1.881 ± 0.406
2.822GlyAsn: 2.822 ± 0.745
3.292GlyPro: 3.292 ± 0.6
2.116GlyGln: 2.116 ± 0.465
5.173GlyArg: 5.173 ± 1.068
4.938GlySer: 4.938 ± 1.412
6.819GlyThr: 6.819 ± 0.99
4.467GlyVal: 4.467 ± 0.475
1.176GlyTrp: 1.176 ± 0.554
1.881GlyTyr: 1.881 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.264
0.235HisCys: 0.235 ± 0.189
0.47HisAsp: 0.47 ± 0.411
1.411HisGlu: 1.411 ± 0.355
0.941HisPhe: 0.941 ± 0.362
0.941HisGly: 0.941 ± 0.385
0.47HisHis: 0.47 ± 0.454
1.411HisIle: 1.411 ± 0.735
1.176HisLys: 1.176 ± 0.454
1.176HisLeu: 1.176 ± 0.37
0.235HisMet: 0.235 ± 0.201
0.47HisAsn: 0.47 ± 0.411
0.705HisPro: 0.705 ± 0.351
0.941HisGln: 0.941 ± 0.287
0.941HisArg: 0.941 ± 0.451
1.881HisSer: 1.881 ± 0.627
0.705HisThr: 0.705 ± 0.328
0.235HisVal: 0.235 ± 0.242
0.235HisTrp: 0.235 ± 0.227
0.235HisTyr: 0.235 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
4.467IleAla: 4.467 ± 0.909
2.351IleCys: 2.351 ± 0.842
4.467IleAsp: 4.467 ± 1.536
6.819IleGlu: 6.819 ± 2.21
1.176IlePhe: 1.176 ± 0.193
3.997IleGly: 3.997 ± 0.845
0.705IleHis: 0.705 ± 0.295
3.997IleIle: 3.997 ± 0.992
3.292IleLys: 3.292 ± 0.874
5.878IleLeu: 5.878 ± 1.272
1.646IleMet: 1.646 ± 0.304
3.527IleAsn: 3.527 ± 0.58
2.351IlePro: 2.351 ± 0.666
2.822IleGln: 2.822 ± 0.516
5.408IleArg: 5.408 ± 1.227
2.351IleSer: 2.351 ± 0.477
3.527IleThr: 3.527 ± 0.587
3.292IleVal: 3.292 ± 0.764
0.705IleTrp: 0.705 ± 0.462
0.705IleTyr: 0.705 ± 0.336
0.0IleXaa: 0.0 ± 0.0
Lys
3.762LysAla: 3.762 ± 1.067
1.176LysCys: 1.176 ± 0.556
2.822LysAsp: 2.822 ± 0.324
4.703LysGlu: 4.703 ± 1.014
1.411LysPhe: 1.411 ± 0.558
3.527LysGly: 3.527 ± 0.704
0.705LysHis: 0.705 ± 0.284
3.057LysIle: 3.057 ± 0.78
3.057LysLys: 3.057 ± 1.484
4.938LysLeu: 4.938 ± 1.23
2.586LysMet: 2.586 ± 0.679
2.116LysAsn: 2.116 ± 0.643
0.47LysPro: 0.47 ± 0.379
1.881LysGln: 1.881 ± 0.693
5.173LysArg: 5.173 ± 1.291
3.762LysSer: 3.762 ± 0.526
3.997LysThr: 3.997 ± 1.043
2.351LysVal: 2.351 ± 0.534
1.881LysTrp: 1.881 ± 0.505
1.646LysTyr: 1.646 ± 0.23
0.0LysXaa: 0.0 ± 0.0
Leu
5.173LeuAla: 5.173 ± 0.875
0.941LeuCys: 0.941 ± 0.445
0.941LeuAsp: 0.941 ± 0.58
5.643LeuGlu: 5.643 ± 1.248
2.116LeuPhe: 2.116 ± 0.488
3.997LeuGly: 3.997 ± 0.584
0.941LeuHis: 0.941 ± 0.425
7.759LeuIle: 7.759 ± 1.081
5.643LeuLys: 5.643 ± 1.139
6.584LeuLeu: 6.584 ± 1.121
1.881LeuMet: 1.881 ± 0.547
4.467LeuAsn: 4.467 ± 1.009
3.762LeuPro: 3.762 ± 0.81
2.586LeuGln: 2.586 ± 0.614
6.584LeuArg: 6.584 ± 1.21
4.938LeuSer: 4.938 ± 0.728
5.878LeuThr: 5.878 ± 1.628
3.762LeuVal: 3.762 ± 0.925
1.176LeuTrp: 1.176 ± 0.258
2.822LeuTyr: 2.822 ± 0.965
0.0LeuXaa: 0.0 ± 0.0
Met
3.762MetAla: 3.762 ± 0.628
1.176MetCys: 1.176 ± 0.675
3.057MetAsp: 3.057 ± 1.147
4.703MetGlu: 4.703 ± 0.924
1.176MetPhe: 1.176 ± 0.755
2.351MetGly: 2.351 ± 0.92
0.235MetHis: 0.235 ± 0.201
2.351MetIle: 2.351 ± 0.588
2.822MetLys: 2.822 ± 0.888
1.646MetLeu: 1.646 ± 0.406
1.646MetMet: 1.646 ± 0.548
0.941MetAsn: 0.941 ± 0.581
0.47MetPro: 0.47 ± 0.279
0.941MetGln: 0.941 ± 0.332
2.116MetArg: 2.116 ± 0.542
1.881MetSer: 1.881 ± 0.437
2.586MetThr: 2.586 ± 0.571
3.292MetVal: 3.292 ± 1.15
0.235MetTrp: 0.235 ± 0.201
1.176MetTyr: 1.176 ± 0.393
0.0MetXaa: 0.0 ± 0.0
Asn
4.938AsnAla: 4.938 ± 1.201
0.235AsnCys: 0.235 ± 0.227
2.586AsnAsp: 2.586 ± 0.39
3.762AsnGlu: 3.762 ± 0.825
1.646AsnPhe: 1.646 ± 0.534
4.232AsnGly: 4.232 ± 1.115
0.235AsnHis: 0.235 ± 0.185
2.116AsnIle: 2.116 ± 0.212
3.057AsnLys: 3.057 ± 0.545
3.292AsnLeu: 3.292 ± 0.425
2.822AsnMet: 2.822 ± 0.591
2.586AsnAsn: 2.586 ± 0.996
4.467AsnPro: 4.467 ± 0.613
2.586AsnGln: 2.586 ± 0.65
3.057AsnArg: 3.057 ± 0.651
3.057AsnSer: 3.057 ± 0.697
5.408AsnThr: 5.408 ± 1.083
2.822AsnVal: 2.822 ± 0.94
1.176AsnTrp: 1.176 ± 0.563
0.941AsnTyr: 0.941 ± 0.38
0.0AsnXaa: 0.0 ± 0.0
Pro
2.586ProAla: 2.586 ± 0.93
0.47ProCys: 0.47 ± 0.242
1.411ProAsp: 1.411 ± 0.445
2.822ProGlu: 2.822 ± 0.516
2.116ProPhe: 2.116 ± 0.35
3.292ProGly: 3.292 ± 0.659
0.235ProHis: 0.235 ± 0.189
2.586ProIle: 2.586 ± 0.435
2.822ProLys: 2.822 ± 0.62
3.292ProLeu: 3.292 ± 0.858
0.941ProMet: 0.941 ± 0.552
3.292ProAsn: 3.292 ± 0.729
1.881ProPro: 1.881 ± 0.446
1.176ProGln: 1.176 ± 0.686
2.116ProArg: 2.116 ± 0.658
3.057ProSer: 3.057 ± 0.746
1.881ProThr: 1.881 ± 0.575
1.646ProVal: 1.646 ± 0.541
0.235ProTrp: 0.235 ± 0.189
0.941ProTyr: 0.941 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
2.351GlnAla: 2.351 ± 0.933
0.705GlnCys: 0.705 ± 0.463
1.176GlnAsp: 1.176 ± 0.534
1.881GlnGlu: 1.881 ± 0.661
0.705GlnPhe: 0.705 ± 0.444
3.057GlnGly: 3.057 ± 0.835
0.47GlnHis: 0.47 ± 0.302
3.997GlnIle: 3.997 ± 0.813
2.351GlnLys: 2.351 ± 0.851
2.586GlnLeu: 2.586 ± 0.477
2.822GlnMet: 2.822 ± 0.904
2.822GlnAsn: 2.822 ± 0.684
0.705GlnPro: 0.705 ± 0.389
1.411GlnGln: 1.411 ± 0.355
3.527GlnArg: 3.527 ± 1.133
3.527GlnSer: 3.527 ± 1.037
2.351GlnThr: 2.351 ± 0.777
1.881GlnVal: 1.881 ± 0.515
0.47GlnTrp: 0.47 ± 0.403
0.941GlnTyr: 0.941 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
4.467ArgAla: 4.467 ± 0.905
0.705ArgCys: 0.705 ± 0.314
2.822ArgAsp: 2.822 ± 0.647
3.057ArgGlu: 3.057 ± 0.859
2.822ArgPhe: 2.822 ± 0.71
7.054ArgGly: 7.054 ± 1.107
0.941ArgHis: 0.941 ± 0.358
4.232ArgIle: 4.232 ± 0.682
2.822ArgLys: 2.822 ± 0.605
4.467ArgLeu: 4.467 ± 0.579
3.762ArgMet: 3.762 ± 1.605
4.703ArgAsn: 4.703 ± 0.833
2.822ArgPro: 2.822 ± 0.608
3.527ArgGln: 3.527 ± 0.469
7.054ArgArg: 7.054 ± 0.816
4.703ArgSer: 4.703 ± 1.102
6.348ArgThr: 6.348 ± 0.833
2.822ArgVal: 2.822 ± 0.983
0.235ArgTrp: 0.235 ± 0.185
2.116ArgTyr: 2.116 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
3.997SerAla: 3.997 ± 1.143
1.646SerCys: 1.646 ± 0.392
2.822SerAsp: 2.822 ± 0.578
3.762SerGlu: 3.762 ± 0.693
4.703SerPhe: 4.703 ± 0.789
6.584SerGly: 6.584 ± 1.489
1.646SerHis: 1.646 ± 0.738
3.762SerIle: 3.762 ± 0.81
2.822SerLys: 2.822 ± 0.786
5.408SerLeu: 5.408 ± 1.104
2.822SerMet: 2.822 ± 0.947
4.232SerAsn: 4.232 ± 1.13
2.822SerPro: 2.822 ± 0.625
3.762SerGln: 3.762 ± 0.855
4.232SerArg: 4.232 ± 0.889
6.819SerSer: 6.819 ± 1.207
4.232SerThr: 4.232 ± 0.773
3.057SerVal: 3.057 ± 0.946
0.941SerTrp: 0.941 ± 0.588
2.116SerTyr: 2.116 ± 0.677
0.0SerXaa: 0.0 ± 0.0
Thr
4.232ThrAla: 4.232 ± 0.397
1.646ThrCys: 1.646 ± 0.806
3.057ThrAsp: 3.057 ± 0.771
4.703ThrGlu: 4.703 ± 1.058
2.351ThrPhe: 2.351 ± 0.386
5.173ThrGly: 5.173 ± 1.022
1.881ThrHis: 1.881 ± 0.712
5.643ThrIle: 5.643 ± 1.152
4.467ThrLys: 4.467 ± 0.54
4.232ThrLeu: 4.232 ± 0.906
2.351ThrMet: 2.351 ± 0.435
3.057ThrAsn: 3.057 ± 0.792
1.646ThrPro: 1.646 ± 0.635
2.351ThrGln: 2.351 ± 0.728
4.703ThrArg: 4.703 ± 0.814
3.292ThrSer: 3.292 ± 0.803
4.938ThrThr: 4.938 ± 1.374
5.173ThrVal: 5.173 ± 1.2
0.941ThrTrp: 0.941 ± 0.434
2.822ThrTyr: 2.822 ± 0.612
0.0ThrXaa: 0.0 ± 0.0
Val
3.527ValAla: 3.527 ± 0.689
1.881ValCys: 1.881 ± 0.566
3.292ValAsp: 3.292 ± 0.92
3.762ValGlu: 3.762 ± 0.638
2.822ValPhe: 2.822 ± 0.629
3.527ValGly: 3.527 ± 0.833
0.941ValHis: 0.941 ± 0.585
1.411ValIle: 1.411 ± 0.404
3.292ValLys: 3.292 ± 0.748
5.173ValLeu: 5.173 ± 1.936
2.116ValMet: 2.116 ± 0.601
3.527ValAsn: 3.527 ± 0.901
2.116ValPro: 2.116 ± 0.761
2.351ValGln: 2.351 ± 0.862
4.232ValArg: 4.232 ± 1.551
5.408ValSer: 5.408 ± 0.812
2.116ValThr: 2.116 ± 0.467
3.762ValVal: 3.762 ± 0.723
0.705ValTrp: 0.705 ± 0.327
1.411ValTyr: 1.411 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.294
0.0TrpCys: 0.0 ± 0.0
0.47TrpAsp: 0.47 ± 0.258
1.646TrpGlu: 1.646 ± 0.499
0.941TrpPhe: 0.941 ± 0.322
0.705TrpGly: 0.705 ± 0.264
0.47TrpHis: 0.47 ± 0.357
0.941TrpIle: 0.941 ± 0.377
0.47TrpLys: 0.47 ± 0.379
0.941TrpLeu: 0.941 ± 0.496
0.705TrpMet: 0.705 ± 0.448
0.941TrpAsn: 0.941 ± 0.322
0.235TrpPro: 0.235 ± 0.189
0.235TrpGln: 0.235 ± 0.205
0.705TrpArg: 0.705 ± 0.546
1.176TrpSer: 1.176 ± 0.563
1.176TrpThr: 1.176 ± 0.412
0.47TrpVal: 0.47 ± 0.242
0.705TrpTrp: 0.705 ± 0.266
0.235TrpTyr: 0.235 ± 0.227
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.941TyrAla: 0.941 ± 0.307
0.235TyrCys: 0.235 ± 0.189
2.116TyrAsp: 2.116 ± 0.757
1.176TyrGlu: 1.176 ± 0.391
1.646TyrPhe: 1.646 ± 0.386
1.881TyrGly: 1.881 ± 0.356
0.235TyrHis: 0.235 ± 0.227
1.411TyrIle: 1.411 ± 0.409
1.411TyrLys: 1.411 ± 0.672
1.646TyrLeu: 1.646 ± 0.359
0.47TyrMet: 0.47 ± 0.214
1.646TyrAsn: 1.646 ± 0.507
0.705TyrPro: 0.705 ± 0.405
1.411TyrGln: 1.411 ± 0.369
1.881TyrArg: 1.881 ± 0.905
2.351TyrSer: 2.351 ± 0.353
1.881TyrThr: 1.881 ± 0.642
2.116TyrVal: 2.116 ± 0.96
0.705TyrTrp: 0.705 ± 0.29
0.47TyrTyr: 0.47 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski