Amino acid dipepetide frequency for Influenza A virus (A/New York/1051/2006(H1N1))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.771AlaAla: 3.771 ± 1.002
0.838AlaCys: 0.838 ± 0.414
2.933AlaAsp: 2.933 ± 0.568
3.771AlaGlu: 3.771 ± 0.842
1.676AlaPhe: 1.676 ± 0.663
4.4AlaGly: 4.4 ± 0.999
0.629AlaHis: 0.629 ± 0.423
5.447AlaIle: 5.447 ± 0.932
2.514AlaLys: 2.514 ± 0.625
5.238AlaLeu: 5.238 ± 1.029
2.933AlaMet: 2.933 ± 0.899
3.352AlaAsn: 3.352 ± 0.673
2.514AlaPro: 2.514 ± 0.608
1.257AlaGln: 1.257 ± 0.55
2.514AlaArg: 2.514 ± 0.686
5.028AlaSer: 5.028 ± 1.067
3.771AlaThr: 3.771 ± 0.774
3.352AlaVal: 3.352 ± 0.86
0.838AlaTrp: 0.838 ± 0.365
1.048AlaTyr: 1.048 ± 0.294
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.38
0.21CysCys: 0.21 ± 0.176
0.21CysAsp: 0.21 ± 0.2
0.838CysGlu: 0.838 ± 0.278
1.886CysPhe: 1.886 ± 0.728
0.21CysGly: 0.21 ± 0.192
0.629CysHis: 0.629 ± 0.237
1.048CysIle: 1.048 ± 0.406
0.838CysLys: 0.838 ± 0.367
1.048CysLeu: 1.048 ± 0.347
0.838CysMet: 0.838 ± 0.305
2.095CysAsn: 2.095 ± 0.569
0.629CysPro: 0.629 ± 0.335
0.21CysGln: 0.21 ± 0.2
1.467CysArg: 1.467 ± 0.668
1.886CysSer: 1.886 ± 0.952
0.838CysThr: 0.838 ± 0.314
1.676CysVal: 1.676 ± 0.658
0.21CysTrp: 0.21 ± 0.186
0.419CysTyr: 0.419 ± 0.284
0.0CysXaa: 0.0 ± 0.0
Asp
2.514AspAla: 2.514 ± 0.566
1.257AspCys: 1.257 ± 0.341
1.676AspAsp: 1.676 ± 0.572
3.143AspGlu: 3.143 ± 0.467
2.514AspPhe: 2.514 ± 0.766
3.562AspGly: 3.562 ± 1.262
0.629AspHis: 0.629 ± 0.33
1.048AspIle: 1.048 ± 0.42
1.886AspLys: 1.886 ± 0.634
3.981AspLeu: 3.981 ± 1.046
1.467AspMet: 1.467 ± 0.434
2.514AspAsn: 2.514 ± 0.728
2.933AspPro: 2.933 ± 0.738
2.305AspGln: 2.305 ± 0.852
2.724AspArg: 2.724 ± 0.532
3.771AspSer: 3.771 ± 0.803
2.933AspThr: 2.933 ± 0.768
2.514AspVal: 2.514 ± 0.659
0.629AspTrp: 0.629 ± 0.294
2.095AspTyr: 2.095 ± 0.566
0.0AspXaa: 0.0 ± 0.0
Glu
2.724GluAla: 2.724 ± 0.536
1.886GluCys: 1.886 ± 1.01
4.4GluAsp: 4.4 ± 0.878
6.914GluGlu: 6.914 ± 1.285
1.886GluPhe: 1.886 ± 0.539
3.981GluGly: 3.981 ± 1.409
0.838GluHis: 0.838 ± 0.302
5.447GluIle: 5.447 ± 0.882
6.076GluLys: 6.076 ± 1.418
5.238GluLeu: 5.238 ± 0.917
2.514GluMet: 2.514 ± 0.649
4.19GluAsn: 4.19 ± 0.809
2.724GluPro: 2.724 ± 1.214
3.981GluGln: 3.981 ± 1.021
3.981GluArg: 3.981 ± 0.984
6.704GluSer: 6.704 ± 0.906
3.562GluThr: 3.562 ± 0.587
5.238GluVal: 5.238 ± 1.073
0.838GluTrp: 0.838 ± 0.387
1.467GluTyr: 1.467 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
2.095PheAla: 2.095 ± 0.555
0.21PheCys: 0.21 ± 0.192
1.467PheAsp: 1.467 ± 0.626
5.447PheGlu: 5.447 ± 1.246
1.257PhePhe: 1.257 ± 0.405
1.886PheGly: 1.886 ± 0.325
1.048PheHis: 1.048 ± 0.391
1.886PheIle: 1.886 ± 0.592
1.048PheLys: 1.048 ± 0.414
4.19PheLeu: 4.19 ± 0.751
0.838PheMet: 0.838 ± 0.423
2.095PheAsn: 2.095 ± 0.626
1.048PhePro: 1.048 ± 0.37
2.514PheGln: 2.514 ± 0.618
1.257PheArg: 1.257 ± 0.461
3.562PheSer: 3.562 ± 0.762
3.771PheThr: 3.771 ± 0.581
3.143PheVal: 3.143 ± 0.822
0.629PheTrp: 0.629 ± 0.326
1.048PheTyr: 1.048 ± 0.478
0.0PheXaa: 0.0 ± 0.0
Gly
3.352GlyAla: 3.352 ± 1.218
0.21GlyCys: 0.21 ± 0.2
3.352GlyAsp: 3.352 ± 0.568
3.981GlyGlu: 3.981 ± 1.364
2.933GlyPhe: 2.933 ± 0.574
2.724GlyGly: 2.724 ± 0.777
0.838GlyHis: 0.838 ± 0.47
4.4GlyIle: 4.4 ± 0.674
4.819GlyLys: 4.819 ± 0.685
4.4GlyLeu: 4.4 ± 0.96
2.095GlyMet: 2.095 ± 0.455
3.143GlyAsn: 3.143 ± 0.936
3.562GlyPro: 3.562 ± 0.694
1.886GlyGln: 1.886 ± 0.663
4.19GlyArg: 4.19 ± 1.045
4.609GlySer: 4.609 ± 1.491
5.238GlyThr: 5.238 ± 0.992
5.657GlyVal: 5.657 ± 0.657
1.467GlyTrp: 1.467 ± 0.825
2.724GlyTyr: 2.724 ± 0.751
0.0GlyXaa: 0.0 ± 0.0
His
0.629HisAla: 0.629 ± 0.264
0.21HisCys: 0.21 ± 0.189
0.419HisAsp: 0.419 ± 0.284
0.838HisGlu: 0.838 ± 0.447
1.257HisPhe: 1.257 ± 0.423
1.257HisGly: 1.257 ± 0.42
0.419HisHis: 0.419 ± 0.4
2.095HisIle: 2.095 ± 0.885
1.676HisLys: 1.676 ± 0.479
1.257HisLeu: 1.257 ± 0.455
0.21HisMet: 0.21 ± 0.186
0.419HisAsn: 0.419 ± 0.4
1.257HisPro: 1.257 ± 0.442
0.629HisGln: 0.629 ± 0.235
1.257HisArg: 1.257 ± 0.561
1.467HisSer: 1.467 ± 0.445
1.676HisThr: 1.676 ± 0.82
0.21HisVal: 0.21 ± 0.227
0.0HisTrp: 0.0 ± 0.0
0.838HisTyr: 0.838 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
3.143IleAla: 3.143 ± 0.783
2.514IleCys: 2.514 ± 0.546
2.305IleAsp: 2.305 ± 0.935
6.914IleGlu: 6.914 ± 2.164
1.676IlePhe: 1.676 ± 0.541
6.076IleGly: 6.076 ± 1.226
0.629IleHis: 0.629 ± 0.294
4.19IleIle: 4.19 ± 1.014
3.981IleLys: 3.981 ± 0.97
5.238IleLeu: 5.238 ± 1.342
2.305IleMet: 2.305 ± 0.481
3.771IleAsn: 3.771 ± 0.693
1.676IlePro: 1.676 ± 0.515
2.305IleGln: 2.305 ± 0.457
5.028IleArg: 5.028 ± 1.152
3.771IleSer: 3.771 ± 1.169
3.562IleThr: 3.562 ± 0.896
3.771IleVal: 3.771 ± 0.736
1.467IleTrp: 1.467 ± 0.66
1.676IleTyr: 1.676 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
4.19LysAla: 4.19 ± 1.157
1.048LysCys: 1.048 ± 0.512
2.724LysAsp: 2.724 ± 0.646
5.657LysGlu: 5.657 ± 1.253
1.886LysPhe: 1.886 ± 0.686
3.143LysGly: 3.143 ± 0.883
1.257LysHis: 1.257 ± 0.335
3.771LysIle: 3.771 ± 0.738
4.19LysLys: 4.19 ± 1.649
5.866LysLeu: 5.866 ± 1.203
3.352LysMet: 3.352 ± 0.604
3.143LysAsn: 3.143 ± 1.016
0.629LysPro: 0.629 ± 0.426
1.886LysGln: 1.886 ± 0.749
4.819LysArg: 4.819 ± 1.387
3.771LysSer: 3.771 ± 0.912
3.771LysThr: 3.771 ± 0.92
2.514LysVal: 2.514 ± 0.93
1.676LysTrp: 1.676 ± 0.582
1.886LysTyr: 1.886 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
3.771LeuAla: 3.771 ± 0.648
1.048LeuCys: 1.048 ± 0.504
1.886LeuAsp: 1.886 ± 0.579
6.914LeuGlu: 6.914 ± 1.133
2.095LeuPhe: 2.095 ± 0.589
4.4LeuGly: 4.4 ± 1.175
0.629LeuHis: 0.629 ± 0.347
5.866LeuIle: 5.866 ± 0.966
7.333LeuLys: 7.333 ± 1.507
7.542LeuLeu: 7.542 ± 1.778
2.514LeuMet: 2.514 ± 0.577
4.19LeuAsn: 4.19 ± 0.659
3.771LeuPro: 3.771 ± 0.558
3.352LeuGln: 3.352 ± 0.556
5.447LeuArg: 5.447 ± 0.911
3.981LeuSer: 3.981 ± 0.745
6.076LeuThr: 6.076 ± 1.721
3.352LeuVal: 3.352 ± 1.189
1.676LeuTrp: 1.676 ± 0.474
3.143LeuTyr: 3.143 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
3.981MetAla: 3.981 ± 0.894
1.257MetCys: 1.257 ± 0.608
3.352MetAsp: 3.352 ± 0.904
3.771MetGlu: 3.771 ± 0.719
1.048MetPhe: 1.048 ± 0.68
2.305MetGly: 2.305 ± 0.913
0.21MetHis: 0.21 ± 0.186
3.143MetIle: 3.143 ± 0.711
2.305MetLys: 2.305 ± 0.719
1.467MetLeu: 1.467 ± 0.354
1.257MetMet: 1.257 ± 0.605
0.838MetAsn: 0.838 ± 0.339
0.629MetPro: 0.629 ± 0.334
1.257MetGln: 1.257 ± 0.577
2.514MetArg: 2.514 ± 0.934
2.305MetSer: 2.305 ± 0.419
1.467MetThr: 1.467 ± 0.433
2.933MetVal: 2.933 ± 0.893
0.419MetTrp: 0.419 ± 0.258
0.838MetTyr: 0.838 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
4.4AsnAla: 4.4 ± 0.827
0.21AsnCys: 0.21 ± 0.2
2.933AsnAsp: 2.933 ± 0.482
3.981AsnGlu: 3.981 ± 0.743
1.676AsnPhe: 1.676 ± 0.455
5.866AsnGly: 5.866 ± 1.889
0.629AsnHis: 0.629 ± 0.442
2.095AsnIle: 2.095 ± 0.634
2.724AsnLys: 2.724 ± 0.73
4.19AsnLeu: 4.19 ± 0.875
1.886AsnMet: 1.886 ± 0.679
1.886AsnAsn: 1.886 ± 0.668
4.609AsnPro: 4.609 ± 0.645
2.095AsnGln: 2.095 ± 0.681
3.143AsnArg: 3.143 ± 0.743
3.771AsnSer: 3.771 ± 0.778
3.562AsnThr: 3.562 ± 0.55
2.095AsnVal: 2.095 ± 0.747
0.838AsnTrp: 0.838 ± 0.473
1.048AsnTyr: 1.048 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
2.305ProAla: 2.305 ± 1.147
0.419ProCys: 0.419 ± 0.244
1.676ProAsp: 1.676 ± 0.608
3.562ProGlu: 3.562 ± 0.566
2.305ProPhe: 2.305 ± 0.553
2.095ProGly: 2.095 ± 0.453
0.419ProHis: 0.419 ± 0.377
2.095ProIle: 2.095 ± 0.6
3.562ProLys: 3.562 ± 0.879
3.143ProLeu: 3.143 ± 0.786
1.467ProMet: 1.467 ± 0.643
3.771ProAsn: 3.771 ± 0.839
1.257ProPro: 1.257 ± 0.363
0.629ProGln: 0.629 ± 0.24
2.095ProArg: 2.095 ± 0.665
3.143ProSer: 3.143 ± 0.8
1.048ProThr: 1.048 ± 0.361
1.676ProVal: 1.676 ± 0.656
0.629ProTrp: 0.629 ± 0.322
0.838ProTyr: 0.838 ± 0.465
0.0ProXaa: 0.0 ± 0.0
Gln
2.095GlnAla: 2.095 ± 0.958
0.838GlnCys: 0.838 ± 0.488
1.467GlnAsp: 1.467 ± 0.401
1.676GlnGlu: 1.676 ± 0.456
0.629GlnPhe: 0.629 ± 0.316
3.352GlnGly: 3.352 ± 0.616
0.629GlnHis: 0.629 ± 0.392
3.143GlnIle: 3.143 ± 0.501
2.724GlnLys: 2.724 ± 0.976
3.771GlnLeu: 3.771 ± 1.096
2.305GlnMet: 2.305 ± 1.004
2.933GlnAsn: 2.933 ± 0.529
0.629GlnPro: 0.629 ± 0.39
1.048GlnGln: 1.048 ± 0.341
3.352GlnArg: 3.352 ± 0.814
2.514GlnSer: 2.514 ± 0.856
2.514GlnThr: 2.514 ± 0.819
1.676GlnVal: 1.676 ± 0.605
0.419GlnTrp: 0.419 ± 0.372
0.629GlnTyr: 0.629 ± 0.239
0.0GlnXaa: 0.0 ± 0.0
Arg
4.609ArgAla: 4.609 ± 0.796
0.838ArgCys: 0.838 ± 0.409
2.933ArgAsp: 2.933 ± 0.62
3.352ArgGlu: 3.352 ± 0.607
3.143ArgPhe: 3.143 ± 0.667
5.866ArgGly: 5.866 ± 1.305
0.629ArgHis: 0.629 ± 0.367
4.609ArgIle: 4.609 ± 0.659
2.305ArgLys: 2.305 ± 0.459
3.981ArgLeu: 3.981 ± 0.695
3.771ArgMet: 3.771 ± 1.061
4.609ArgAsn: 4.609 ± 1.137
2.305ArgPro: 2.305 ± 0.597
2.724ArgGln: 2.724 ± 0.757
5.238ArgArg: 5.238 ± 0.802
4.19ArgSer: 4.19 ± 1.182
5.657ArgThr: 5.657 ± 1.015
2.305ArgVal: 2.305 ± 0.975
0.419ArgTrp: 0.419 ± 0.337
1.257ArgTyr: 1.257 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
3.771SerAla: 3.771 ± 1.074
2.305SerCys: 2.305 ± 0.919
2.724SerAsp: 2.724 ± 0.519
2.933SerGlu: 2.933 ± 0.781
5.657SerPhe: 5.657 ± 0.957
4.819SerGly: 4.819 ± 1.201
2.724SerHis: 2.724 ± 0.703
6.285SerIle: 6.285 ± 1.509
3.352SerLys: 3.352 ± 0.718
6.076SerLeu: 6.076 ± 0.958
2.095SerMet: 2.095 ± 0.689
3.352SerAsn: 3.352 ± 0.904
2.724SerPro: 2.724 ± 0.713
3.562SerGln: 3.562 ± 0.767
3.352SerArg: 3.352 ± 0.783
7.123SerSer: 7.123 ± 1.272
4.609SerThr: 4.609 ± 1.137
3.981SerVal: 3.981 ± 1.161
1.676SerTrp: 1.676 ± 0.628
2.095SerTyr: 2.095 ± 0.678
0.0SerXaa: 0.0 ± 0.0
Thr
3.562ThrAla: 3.562 ± 0.377
1.048ThrCys: 1.048 ± 0.349
2.724ThrAsp: 2.724 ± 0.721
3.981ThrGlu: 3.981 ± 0.868
2.933ThrPhe: 2.933 ± 0.558
5.028ThrGly: 5.028 ± 0.983
2.305ThrHis: 2.305 ± 0.723
4.819ThrIle: 4.819 ± 0.923
4.609ThrLys: 4.609 ± 0.887
4.4ThrLeu: 4.4 ± 0.658
2.305ThrMet: 2.305 ± 0.513
2.305ThrAsn: 2.305 ± 0.528
1.886ThrPro: 1.886 ± 0.588
3.352ThrGln: 3.352 ± 0.798
4.19ThrArg: 4.19 ± 1.043
3.562ThrSer: 3.562 ± 0.548
3.981ThrThr: 3.981 ± 1.124
4.819ThrVal: 4.819 ± 0.985
0.629ThrTrp: 0.629 ± 0.264
2.933ThrTyr: 2.933 ± 0.687
0.0ThrXaa: 0.0 ± 0.0
Val
4.19ValAla: 4.19 ± 0.802
1.257ValCys: 1.257 ± 0.48
3.771ValAsp: 3.771 ± 0.713
3.771ValGlu: 3.771 ± 0.637
2.095ValPhe: 2.095 ± 0.663
2.305ValGly: 2.305 ± 0.768
1.467ValHis: 1.467 ± 0.502
1.257ValIle: 1.257 ± 0.365
3.352ValLys: 3.352 ± 1.034
5.866ValLeu: 5.866 ± 1.417
1.467ValMet: 1.467 ± 0.552
3.143ValAsn: 3.143 ± 0.675
1.467ValPro: 1.467 ± 0.588
1.676ValGln: 1.676 ± 0.776
5.028ValArg: 5.028 ± 1.332
5.866ValSer: 5.866 ± 0.706
3.562ValThr: 3.562 ± 0.998
3.143ValVal: 3.143 ± 0.732
0.419ValTrp: 0.419 ± 0.266
1.467ValTyr: 1.467 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.408
0.0TrpCys: 0.0 ± 0.0
1.048TrpAsp: 1.048 ± 0.415
1.257TrpGlu: 1.257 ± 0.546
0.629TrpPhe: 0.629 ± 0.267
0.629TrpGly: 0.629 ± 0.264
0.629TrpHis: 0.629 ± 0.344
1.467TrpIle: 1.467 ± 0.476
0.629TrpLys: 0.629 ± 0.391
0.838TrpLeu: 0.838 ± 0.399
0.838TrpMet: 0.838 ± 0.299
0.629TrpAsn: 0.629 ± 0.27
0.629TrpPro: 0.629 ± 0.298
0.0TrpGln: 0.0 ± 0.0
0.838TrpArg: 0.838 ± 0.476
1.257TrpSer: 1.257 ± 0.628
1.886TrpThr: 1.886 ± 0.664
1.257TrpVal: 1.257 ± 0.545
0.419TrpTrp: 0.419 ± 0.22
0.21TrpTyr: 0.21 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.257TyrAla: 1.257 ± 0.584
0.21TyrCys: 0.21 ± 0.189
2.095TyrAsp: 2.095 ± 0.661
1.886TyrGlu: 1.886 ± 0.672
1.257TyrPhe: 1.257 ± 0.285
1.676TyrGly: 1.676 ± 0.257
0.838TyrHis: 0.838 ± 0.801
1.886TyrIle: 1.886 ± 0.404
1.257TyrLys: 1.257 ± 0.378
1.257TyrLeu: 1.257 ± 0.424
0.629TyrMet: 0.629 ± 0.28
1.467TyrAsn: 1.467 ± 0.466
1.257TyrPro: 1.257 ± 0.645
1.676TyrGln: 1.676 ± 0.415
2.305TyrArg: 2.305 ± 0.982
2.933TyrSer: 2.933 ± 0.436
1.886TyrThr: 1.886 ± 0.639
1.257TyrVal: 1.257 ± 0.508
0.629TyrTrp: 0.629 ± 0.282
0.629TyrTyr: 0.629 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4774 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski