Amino acid dipepetide frequency for Influenza A virus (A/duck/Jiangxi/5460/2014(mixed))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.426AlaAla: 3.426 ± 0.873
1.542AlaCys: 1.542 ± 0.667
3.084AlaAsp: 3.084 ± 0.763
3.769AlaGlu: 3.769 ± 0.684
1.885AlaPhe: 1.885 ± 0.666
3.426AlaGly: 3.426 ± 0.92
0.514AlaHis: 0.514 ± 0.401
3.769AlaIle: 3.769 ± 0.743
2.056AlaLys: 2.056 ± 0.581
5.996AlaLeu: 5.996 ± 0.796
2.57AlaMet: 2.57 ± 0.792
2.056AlaAsn: 2.056 ± 0.682
2.227AlaPro: 2.227 ± 0.415
1.542AlaGln: 1.542 ± 0.38
2.57AlaArg: 2.57 ± 0.482
4.797AlaSer: 4.797 ± 1.129
3.94AlaThr: 3.94 ± 0.826
2.741AlaVal: 2.741 ± 0.555
1.028AlaTrp: 1.028 ± 0.444
0.857AlaTyr: 0.857 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
0.685CysAla: 0.685 ± 0.329
0.171CysCys: 0.171 ± 0.15
1.199CysAsp: 1.199 ± 0.495
0.685CysGlu: 0.685 ± 0.269
2.398CysPhe: 2.398 ± 0.762
0.514CysGly: 0.514 ± 0.28
0.514CysHis: 0.514 ± 0.253
2.056CysIle: 2.056 ± 0.675
0.857CysLys: 0.857 ± 0.308
1.885CysLeu: 1.885 ± 0.52
0.685CysMet: 0.685 ± 0.294
1.542CysAsn: 1.542 ± 0.434
0.514CysPro: 0.514 ± 0.228
0.685CysGln: 0.685 ± 0.339
1.199CysArg: 1.199 ± 0.576
1.542CysSer: 1.542 ± 0.523
1.199CysThr: 1.199 ± 0.449
1.199CysVal: 1.199 ± 0.253
0.343CysTrp: 0.343 ± 0.238
0.857CysTyr: 0.857 ± 0.403
0.0CysXaa: 0.0 ± 0.0
Asp
2.741AspAla: 2.741 ± 0.36
1.542AspCys: 1.542 ± 0.446
1.542AspAsp: 1.542 ± 0.481
2.741AspGlu: 2.741 ± 0.701
1.885AspPhe: 1.885 ± 0.708
3.426AspGly: 3.426 ± 0.791
1.028AspHis: 1.028 ± 0.33
1.713AspIle: 1.713 ± 0.461
2.056AspLys: 2.056 ± 0.413
4.112AspLeu: 4.112 ± 0.757
1.542AspMet: 1.542 ± 0.452
3.426AspAsn: 3.426 ± 0.719
3.255AspPro: 3.255 ± 0.656
1.713AspGln: 1.713 ± 0.678
2.398AspArg: 2.398 ± 0.482
3.598AspSer: 3.598 ± 0.647
2.056AspThr: 2.056 ± 0.507
3.769AspVal: 3.769 ± 0.487
0.857AspTrp: 0.857 ± 0.303
1.542AspTyr: 1.542 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
2.741GluAla: 2.741 ± 0.62
1.542GluCys: 1.542 ± 0.77
4.283GluAsp: 4.283 ± 0.709
5.311GluGlu: 5.311 ± 1.151
1.713GluPhe: 1.713 ± 0.536
4.626GluGly: 4.626 ± 0.97
0.514GluHis: 0.514 ± 0.246
4.797GluIle: 4.797 ± 0.821
6.168GluLys: 6.168 ± 1.25
4.797GluLeu: 4.797 ± 0.788
2.398GluMet: 2.398 ± 0.529
3.94GluAsn: 3.94 ± 0.741
2.57GluPro: 2.57 ± 1.046
3.084GluGln: 3.084 ± 0.921
4.112GluArg: 4.112 ± 1.117
5.311GluSer: 5.311 ± 1.262
4.283GluThr: 4.283 ± 0.518
4.454GluVal: 4.454 ± 0.897
0.857GluTrp: 0.857 ± 0.391
1.542GluTyr: 1.542 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.056PheAla: 2.056 ± 0.448
0.514PheCys: 0.514 ± 0.303
1.371PheAsp: 1.371 ± 0.405
4.626PheGlu: 4.626 ± 0.933
1.542PhePhe: 1.542 ± 0.416
1.713PheGly: 1.713 ± 0.277
1.371PheHis: 1.371 ± 0.494
2.57PheIle: 2.57 ± 0.774
1.028PheLys: 1.028 ± 0.41
3.426PheLeu: 3.426 ± 0.752
1.028PheMet: 1.028 ± 0.4
1.885PheAsn: 1.885 ± 0.587
1.028PhePro: 1.028 ± 0.344
2.57PheGln: 2.57 ± 0.602
1.885PheArg: 1.885 ± 0.292
3.426PheSer: 3.426 ± 0.565
2.57PheThr: 2.57 ± 0.629
2.912PheVal: 2.912 ± 0.698
0.514PheTrp: 0.514 ± 0.376
0.857PheTyr: 0.857 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
3.084GlyAla: 3.084 ± 0.883
0.685GlyCys: 0.685 ± 0.25
2.57GlyAsp: 2.57 ± 0.374
3.598GlyGlu: 3.598 ± 1.191
4.283GlyPhe: 4.283 ± 0.994
2.57GlyGly: 2.57 ± 0.696
1.371GlyHis: 1.371 ± 0.467
3.598GlyIle: 3.598 ± 0.787
4.797GlyLys: 4.797 ± 0.733
4.283GlyLeu: 4.283 ± 1.051
1.885GlyMet: 1.885 ± 0.467
4.112GlyAsn: 4.112 ± 1.046
3.084GlyPro: 3.084 ± 0.548
2.57GlyGln: 2.57 ± 0.466
4.797GlyArg: 4.797 ± 1.06
4.797GlySer: 4.797 ± 1.241
5.311GlyThr: 5.311 ± 0.603
5.311GlyVal: 5.311 ± 0.315
1.371GlyTrp: 1.371 ± 0.493
2.227GlyTyr: 2.227 ± 0.675
0.0GlyXaa: 0.0 ± 0.0
His
0.857HisAla: 0.857 ± 0.231
0.514HisCys: 0.514 ± 0.256
0.514HisAsp: 0.514 ± 0.376
0.857HisGlu: 0.857 ± 0.345
1.028HisPhe: 1.028 ± 0.301
0.685HisGly: 0.685 ± 0.334
0.685HisHis: 0.685 ± 0.434
1.885HisIle: 1.885 ± 0.855
1.542HisLys: 1.542 ± 0.415
1.028HisLeu: 1.028 ± 0.39
0.171HisMet: 0.171 ± 0.167
0.171HisAsn: 0.171 ± 0.176
0.857HisPro: 0.857 ± 0.346
0.857HisGln: 0.857 ± 0.332
1.199HisArg: 1.199 ± 0.555
1.713HisSer: 1.713 ± 0.599
0.514HisThr: 0.514 ± 0.281
0.685HisVal: 0.685 ± 0.36
0.171HisTrp: 0.171 ± 0.166
0.343HisTyr: 0.343 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
3.426IleAla: 3.426 ± 0.768
2.398IleCys: 2.398 ± 0.75
4.283IleAsp: 4.283 ± 1.149
6.853IleGlu: 6.853 ± 1.896
1.371IlePhe: 1.371 ± 0.328
4.797IleGly: 4.797 ± 1.095
0.685IleHis: 0.685 ± 0.296
3.426IleIle: 3.426 ± 0.763
3.598IleLys: 3.598 ± 0.906
5.654IleLeu: 5.654 ± 1.121
1.885IleMet: 1.885 ± 0.466
3.769IleAsn: 3.769 ± 0.593
2.227IlePro: 2.227 ± 0.448
2.056IleGln: 2.056 ± 0.4
5.311IleArg: 5.311 ± 1.207
2.056IleSer: 2.056 ± 0.737
4.626IleThr: 4.626 ± 0.767
3.426IleVal: 3.426 ± 0.896
0.857IleTrp: 0.857 ± 0.428
1.713IleTyr: 1.713 ± 0.522
0.343IleXaa: 0.343 ± 0.421
Lys
3.426LysAla: 3.426 ± 0.726
1.542LysCys: 1.542 ± 0.584
2.912LysAsp: 2.912 ± 0.473
4.454LysGlu: 4.454 ± 0.916
1.542LysPhe: 1.542 ± 0.342
3.084LysGly: 3.084 ± 0.51
0.857LysHis: 0.857 ± 0.303
4.283LysIle: 4.283 ± 0.793
3.255LysLys: 3.255 ± 1.181
4.626LysLeu: 4.626 ± 1.064
2.57LysMet: 2.57 ± 0.541
1.542LysAsn: 1.542 ± 0.544
1.199LysPro: 1.199 ± 0.452
2.398LysGln: 2.398 ± 0.874
3.769LysArg: 3.769 ± 0.884
3.426LysSer: 3.426 ± 0.756
4.283LysThr: 4.283 ± 0.908
1.885LysVal: 1.885 ± 0.586
1.713LysTrp: 1.713 ± 0.439
1.713LysTyr: 1.713 ± 0.362
0.171LysXaa: 0.171 ± 0.167
Leu
4.283LeuAla: 4.283 ± 0.705
1.199LeuCys: 1.199 ± 0.451
1.371LeuAsp: 1.371 ± 0.551
5.825LeuGlu: 5.825 ± 1.206
2.398LeuPhe: 2.398 ± 0.668
4.283LeuGly: 4.283 ± 0.673
0.685LeuHis: 0.685 ± 0.332
5.996LeuIle: 5.996 ± 1.12
5.996LeuLys: 5.996 ± 1.266
6.168LeuLeu: 6.168 ± 1.161
2.056LeuMet: 2.056 ± 0.455
3.769LeuAsn: 3.769 ± 1.06
3.94LeuPro: 3.94 ± 0.81
2.227LeuGln: 2.227 ± 0.57
4.968LeuArg: 4.968 ± 0.826
4.112LeuSer: 4.112 ± 0.787
4.283LeuThr: 4.283 ± 1.06
4.283LeuVal: 4.283 ± 0.933
1.542LeuTrp: 1.542 ± 0.36
2.741LeuTyr: 2.741 ± 0.782
0.0LeuXaa: 0.0 ± 0.0
Met
3.255MetAla: 3.255 ± 0.749
0.685MetCys: 0.685 ± 0.594
2.741MetAsp: 2.741 ± 0.942
3.598MetGlu: 3.598 ± 0.901
0.857MetPhe: 0.857 ± 0.611
2.227MetGly: 2.227 ± 0.797
0.171MetHis: 0.171 ± 0.144
2.227MetIle: 2.227 ± 0.48
2.398MetLys: 2.398 ± 0.718
1.542MetLeu: 1.542 ± 0.386
1.371MetMet: 1.371 ± 0.541
1.028MetAsn: 1.028 ± 0.512
0.685MetPro: 0.685 ± 0.287
1.199MetGln: 1.199 ± 0.452
2.912MetArg: 2.912 ± 0.65
2.056MetSer: 2.056 ± 0.478
2.056MetThr: 2.056 ± 0.582
1.713MetVal: 1.713 ± 0.75
0.514MetTrp: 0.514 ± 0.232
0.514MetTyr: 0.514 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
4.283AsnAla: 4.283 ± 1.076
0.685AsnCys: 0.685 ± 0.397
3.598AsnAsp: 3.598 ± 0.867
3.769AsnGlu: 3.769 ± 0.692
1.199AsnPhe: 1.199 ± 0.484
6.168AsnGly: 6.168 ± 1.595
0.343AsnHis: 0.343 ± 0.194
2.741AsnIle: 2.741 ± 0.452
2.741AsnLys: 2.741 ± 0.525
2.741AsnLeu: 2.741 ± 0.552
2.056AsnMet: 2.056 ± 0.558
2.57AsnAsn: 2.57 ± 0.785
3.255AsnPro: 3.255 ± 0.582
2.056AsnGln: 2.056 ± 0.474
3.426AsnArg: 3.426 ± 0.782
2.912AsnSer: 2.912 ± 0.511
4.797AsnThr: 4.797 ± 0.754
1.885AsnVal: 1.885 ± 0.511
1.371AsnTrp: 1.371 ± 0.461
0.857AsnTyr: 0.857 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
2.912ProAla: 2.912 ± 0.755
0.685ProCys: 0.685 ± 0.331
1.713ProAsp: 1.713 ± 0.439
2.57ProGlu: 2.57 ± 0.544
2.398ProPhe: 2.398 ± 0.482
2.398ProGly: 2.398 ± 0.374
0.685ProHis: 0.685 ± 0.411
1.885ProIle: 1.885 ± 0.436
2.398ProLys: 2.398 ± 0.463
2.398ProLeu: 2.398 ± 0.76
1.028ProMet: 1.028 ± 0.589
4.112ProAsn: 4.112 ± 0.868
1.199ProPro: 1.199 ± 0.356
0.857ProGln: 0.857 ± 0.48
2.398ProArg: 2.398 ± 0.632
3.255ProSer: 3.255 ± 0.716
1.713ProThr: 1.713 ± 0.49
1.713ProVal: 1.713 ± 0.414
0.685ProTrp: 0.685 ± 0.273
1.028ProTyr: 1.028 ± 0.413
0.0ProXaa: 0.0 ± 0.0
Gln
2.741GlnAla: 2.741 ± 0.882
0.857GlnCys: 0.857 ± 0.399
1.542GlnAsp: 1.542 ± 0.581
2.227GlnGlu: 2.227 ± 0.749
1.028GlnPhe: 1.028 ± 0.425
2.57GlnGly: 2.57 ± 0.882
0.685GlnHis: 0.685 ± 0.317
3.769GlnIle: 3.769 ± 0.777
1.885GlnLys: 1.885 ± 0.757
2.57GlnLeu: 2.57 ± 0.786
2.056GlnMet: 2.056 ± 0.802
2.741GlnAsn: 2.741 ± 0.794
0.685GlnPro: 0.685 ± 0.337
1.885GlnGln: 1.885 ± 0.462
2.912GlnArg: 2.912 ± 0.902
3.255GlnSer: 3.255 ± 0.79
2.741GlnThr: 2.741 ± 0.696
1.542GlnVal: 1.542 ± 0.464
0.685GlnTrp: 0.685 ± 0.411
0.685GlnTyr: 0.685 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
3.426ArgAla: 3.426 ± 0.973
0.685ArgCys: 0.685 ± 0.258
3.084ArgAsp: 3.084 ± 0.514
3.255ArgGlu: 3.255 ± 0.841
2.912ArgPhe: 2.912 ± 0.773
6.682ArgGly: 6.682 ± 1.0
0.685ArgHis: 0.685 ± 0.331
3.769ArgIle: 3.769 ± 0.448
2.056ArgLys: 2.056 ± 0.75
3.94ArgLeu: 3.94 ± 0.773
2.741ArgMet: 2.741 ± 1.322
3.94ArgAsn: 3.94 ± 0.787
2.57ArgPro: 2.57 ± 0.52
3.598ArgGln: 3.598 ± 0.734
5.14ArgArg: 5.14 ± 0.924
4.454ArgSer: 4.454 ± 0.909
6.339ArgThr: 6.339 ± 1.013
2.57ArgVal: 2.57 ± 0.717
0.514ArgTrp: 0.514 ± 0.304
1.199ArgTyr: 1.199 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
2.912SerAla: 2.912 ± 1.092
2.57SerCys: 2.57 ± 0.847
2.912SerAsp: 2.912 ± 0.507
3.255SerGlu: 3.255 ± 0.713
4.626SerPhe: 4.626 ± 1.051
6.339SerGly: 6.339 ± 1.289
1.371SerHis: 1.371 ± 0.605
4.454SerIle: 4.454 ± 0.641
2.912SerLys: 2.912 ± 0.66
5.311SerLeu: 5.311 ± 0.998
2.056SerMet: 2.056 ± 0.814
4.283SerAsn: 4.283 ± 0.98
2.912SerPro: 2.912 ± 0.857
3.255SerGln: 3.255 ± 0.734
3.255SerArg: 3.255 ± 0.659
6.682SerSer: 6.682 ± 0.69
4.283SerThr: 4.283 ± 0.888
2.227SerVal: 2.227 ± 0.781
1.371SerTrp: 1.371 ± 0.61
1.713SerTyr: 1.713 ± 0.544
0.0SerXaa: 0.0 ± 0.0
Thr
3.255ThrAla: 3.255 ± 0.365
0.857ThrCys: 0.857 ± 0.31
2.741ThrAsp: 2.741 ± 0.618
4.454ThrGlu: 4.454 ± 0.857
2.227ThrPhe: 2.227 ± 0.432
5.482ThrGly: 5.482 ± 1.029
1.885ThrHis: 1.885 ± 0.623
5.996ThrIle: 5.996 ± 1.035
4.283ThrLys: 4.283 ± 0.507
4.626ThrLeu: 4.626 ± 0.878
1.885ThrMet: 1.885 ± 0.511
3.598ThrAsn: 3.598 ± 0.721
1.885ThrPro: 1.885 ± 0.489
3.084ThrGln: 3.084 ± 1.027
4.112ThrArg: 4.112 ± 0.947
2.398ThrSer: 2.398 ± 0.483
3.255ThrThr: 3.255 ± 1.09
5.482ThrVal: 5.482 ± 1.03
0.685ThrTrp: 0.685 ± 0.327
2.398ThrTyr: 2.398 ± 0.582
0.171ThrXaa: 0.171 ± 0.167
Val
2.741ValAla: 2.741 ± 0.58
1.713ValCys: 1.713 ± 0.485
2.741ValAsp: 2.741 ± 0.672
4.797ValGlu: 4.797 ± 1.006
1.885ValPhe: 1.885 ± 0.501
2.227ValGly: 2.227 ± 0.776
1.199ValHis: 1.199 ± 0.333
3.084ValIle: 3.084 ± 0.948
2.398ValLys: 2.398 ± 0.534
4.283ValLeu: 4.283 ± 0.992
1.713ValMet: 1.713 ± 0.541
3.084ValAsn: 3.084 ± 0.627
2.398ValPro: 2.398 ± 0.707
2.227ValGln: 2.227 ± 0.718
3.598ValArg: 3.598 ± 0.746
5.311ValSer: 5.311 ± 0.707
3.255ValThr: 3.255 ± 0.786
3.426ValVal: 3.426 ± 0.711
0.685ValTrp: 0.685 ± 0.298
1.199ValTyr: 1.199 ± 0.321
0.0ValXaa: 0.0 ± 0.0
Trp
0.685TrpAla: 0.685 ± 0.271
0.0TrpCys: 0.0 ± 0.0
0.685TrpAsp: 0.685 ± 0.283
1.371TrpGlu: 1.371 ± 0.402
0.685TrpPhe: 0.685 ± 0.3
0.685TrpGly: 0.685 ± 0.237
0.685TrpHis: 0.685 ± 0.403
1.542TrpIle: 1.542 ± 0.464
0.685TrpLys: 0.685 ± 0.357
1.371TrpLeu: 1.371 ± 0.516
1.371TrpMet: 1.371 ± 0.573
0.514TrpAsn: 0.514 ± 0.27
0.343TrpPro: 0.343 ± 0.204
0.0TrpGln: 0.0 ± 0.0
1.028TrpArg: 1.028 ± 0.513
1.542TrpSer: 1.542 ± 0.543
2.056TrpThr: 2.056 ± 0.603
0.857TrpVal: 0.857 ± 0.547
0.343TrpTrp: 0.343 ± 0.195
0.343TrpTyr: 0.343 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.857TyrAla: 0.857 ± 0.252
0.343TyrCys: 0.343 ± 0.204
2.056TyrAsp: 2.056 ± 0.631
1.028TyrGlu: 1.028 ± 0.55
1.028TyrPhe: 1.028 ± 0.334
2.227TyrGly: 2.227 ± 0.447
0.343TyrHis: 0.343 ± 0.217
1.371TyrIle: 1.371 ± 0.391
1.542TyrLys: 1.542 ± 0.533
1.199TyrLeu: 1.199 ± 0.404
0.343TyrMet: 0.343 ± 0.195
1.371TyrAsn: 1.371 ± 0.328
1.199TyrPro: 1.199 ± 0.378
1.199TyrGln: 1.199 ± 0.308
2.57TyrArg: 2.57 ± 0.855
2.056TyrSer: 2.056 ± 0.358
1.199TyrThr: 1.199 ± 0.586
1.713TyrVal: 1.713 ± 0.607
0.685TyrTrp: 0.685 ± 0.264
0.343TyrTyr: 0.343 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.171XaaGly: 0.171 ± 0.167
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.171XaaLeu: 0.171 ± 0.211
0.171XaaMet: 0.171 ± 0.211
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.171XaaVal: 0.171 ± 0.167
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
39.061XaaXaa: 39.061 ± 37.996
Statistics based on 14 proteins (5838 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski