Amino acid dipepetide frequency for Influenza A virus (A/mallard/MN/51/1998(H2N3))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.693AlaAla: 3.693 ± 1.081
0.923AlaCys: 0.923 ± 0.441
2.539AlaAsp: 2.539 ± 0.484
3.232AlaGlu: 3.232 ± 0.795
1.385AlaPhe: 1.385 ± 0.664
4.848AlaGly: 4.848 ± 1.128
0.693AlaHis: 0.693 ± 0.424
4.617AlaIle: 4.617 ± 0.982
3.001AlaLys: 3.001 ± 0.573
5.309AlaLeu: 5.309 ± 1.124
2.77AlaMet: 2.77 ± 0.771
3.001AlaAsn: 3.001 ± 0.729
2.308AlaPro: 2.308 ± 0.437
1.847AlaGln: 1.847 ± 0.603
3.001AlaArg: 3.001 ± 0.645
4.386AlaSer: 4.386 ± 1.421
4.848AlaThr: 4.848 ± 0.961
3.001AlaVal: 3.001 ± 0.722
0.923AlaTrp: 0.923 ± 0.487
0.923AlaTyr: 0.923 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.693CysAla: 0.693 ± 0.318
0.231CysCys: 0.231 ± 0.192
0.923CysAsp: 0.923 ± 0.556
0.923CysGlu: 0.923 ± 0.288
1.616CysPhe: 1.616 ± 0.722
0.231CysGly: 0.231 ± 0.211
0.693CysHis: 0.693 ± 0.281
1.616CysIle: 1.616 ± 0.71
0.923CysLys: 0.923 ± 0.382
1.385CysLeu: 1.385 ± 0.521
1.154CysMet: 1.154 ± 0.31
0.923CysAsn: 0.923 ± 0.383
0.462CysPro: 0.462 ± 0.278
0.462CysGln: 0.462 ± 0.278
1.385CysArg: 1.385 ± 0.681
1.847CysSer: 1.847 ± 0.681
0.462CysThr: 0.462 ± 0.256
0.923CysVal: 0.923 ± 0.324
0.462CysTrp: 0.462 ± 0.256
0.462CysTyr: 0.462 ± 0.278
0.0CysXaa: 0.0 ± 0.0
Asp
2.77AspAla: 2.77 ± 0.511
1.616AspCys: 1.616 ± 0.303
1.847AspAsp: 1.847 ± 0.428
3.232AspGlu: 3.232 ± 0.632
2.078AspPhe: 2.078 ± 0.786
3.001AspGly: 3.001 ± 1.091
0.923AspHis: 0.923 ± 0.479
1.847AspIle: 1.847 ± 0.554
2.078AspLys: 2.078 ± 0.53
2.77AspLeu: 2.77 ± 0.675
1.847AspMet: 1.847 ± 0.523
3.463AspAsn: 3.463 ± 1.151
3.463AspPro: 3.463 ± 1.013
2.308AspGln: 2.308 ± 0.92
2.308AspArg: 2.308 ± 0.47
2.308AspSer: 2.308 ± 0.467
2.308AspThr: 2.308 ± 0.589
4.155AspVal: 4.155 ± 0.572
0.693AspTrp: 0.693 ± 0.315
1.616AspTyr: 1.616 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
2.539GluAla: 2.539 ± 0.426
1.385GluCys: 1.385 ± 0.717
4.155GluAsp: 4.155 ± 0.75
6.233GluGlu: 6.233 ± 1.059
2.308GluPhe: 2.308 ± 0.624
4.386GluGly: 4.386 ± 1.339
0.923GluHis: 0.923 ± 0.623
5.078GluIle: 5.078 ± 0.688
6.233GluLys: 6.233 ± 1.342
5.309GluLeu: 5.309 ± 0.711
2.078GluMet: 2.078 ± 0.641
4.617GluAsn: 4.617 ± 1.146
2.539GluPro: 2.539 ± 1.093
3.693GluGln: 3.693 ± 1.008
5.078GluArg: 5.078 ± 1.115
6.233GluSer: 6.233 ± 1.295
4.155GluThr: 4.155 ± 0.547
4.155GluVal: 4.155 ± 1.362
1.385GluTrp: 1.385 ± 0.45
1.385GluTyr: 1.385 ± 0.268
0.0GluXaa: 0.0 ± 0.0
Phe
1.847PheAla: 1.847 ± 0.49
0.231PheCys: 0.231 ± 0.211
1.385PheAsp: 1.385 ± 0.435
5.078PheGlu: 5.078 ± 0.999
1.616PhePhe: 1.616 ± 0.461
1.847PheGly: 1.847 ± 0.266
1.385PheHis: 1.385 ± 0.569
1.616PheIle: 1.616 ± 0.56
1.154PheLys: 1.154 ± 0.442
3.924PheLeu: 3.924 ± 0.762
0.693PheMet: 0.693 ± 0.345
2.539PheAsn: 2.539 ± 0.722
1.154PhePro: 1.154 ± 0.513
2.308PheGln: 2.308 ± 0.719
1.847PheArg: 1.847 ± 0.296
3.001PheSer: 3.001 ± 0.55
2.308PheThr: 2.308 ± 0.497
2.308PheVal: 2.308 ± 0.7
0.462PheTrp: 0.462 ± 0.293
1.154PheTyr: 1.154 ± 0.441
0.0PheXaa: 0.0 ± 0.0
Gly
2.539GlyAla: 2.539 ± 0.62
0.462GlyCys: 0.462 ± 0.269
3.001GlyAsp: 3.001 ± 0.254
3.924GlyGlu: 3.924 ± 1.346
3.924GlyPhe: 3.924 ± 0.876
3.693GlyGly: 3.693 ± 0.81
0.462GlyHis: 0.462 ± 0.309
4.848GlyIle: 4.848 ± 1.048
3.693GlyLys: 3.693 ± 0.462
4.155GlyLeu: 4.155 ± 0.816
1.847GlyMet: 1.847 ± 0.463
4.155GlyAsn: 4.155 ± 1.072
3.001GlyPro: 3.001 ± 0.813
2.308GlyGln: 2.308 ± 0.584
5.54GlyArg: 5.54 ± 1.121
5.078GlySer: 5.078 ± 1.496
5.771GlyThr: 5.771 ± 0.697
4.617GlyVal: 4.617 ± 0.55
1.154GlyTrp: 1.154 ± 0.659
2.539GlyTyr: 2.539 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
1.154HisAla: 1.154 ± 0.391
0.462HisCys: 0.462 ± 0.363
0.231HisAsp: 0.231 ± 0.225
1.385HisGlu: 1.385 ± 0.394
1.154HisPhe: 1.154 ± 0.341
0.923HisGly: 0.923 ± 0.394
0.462HisHis: 0.462 ± 0.45
1.385HisIle: 1.385 ± 0.847
1.616HisLys: 1.616 ± 0.469
1.154HisLeu: 1.154 ± 0.364
0.462HisMet: 0.462 ± 0.28
0.462HisAsn: 0.462 ± 0.45
1.616HisPro: 1.616 ± 0.501
0.693HisGln: 0.693 ± 0.279
1.154HisArg: 1.154 ± 0.475
1.616HisSer: 1.616 ± 0.482
0.462HisThr: 0.462 ± 0.33
0.462HisVal: 0.462 ± 0.284
0.0HisTrp: 0.0 ± 0.0
0.231HisTyr: 0.231 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.386IleAla: 4.386 ± 1.035
2.078IleCys: 2.078 ± 0.504
3.463IleAsp: 3.463 ± 0.934
6.002IleGlu: 6.002 ± 2.132
0.923IlePhe: 0.923 ± 0.32
5.309IleGly: 5.309 ± 1.005
0.923IleHis: 0.923 ± 0.465
3.693IleIle: 3.693 ± 1.282
4.617IleLys: 4.617 ± 0.709
6.002IleLeu: 6.002 ± 1.55
2.308IleMet: 2.308 ± 0.566
4.155IleAsn: 4.155 ± 0.575
2.078IlePro: 2.078 ± 0.629
2.078IleGln: 2.078 ± 0.461
5.309IleArg: 5.309 ± 1.374
2.308IleSer: 2.308 ± 0.412
4.155IleThr: 4.155 ± 1.082
3.693IleVal: 3.693 ± 0.698
0.693IleTrp: 0.693 ± 0.455
1.154IleTyr: 1.154 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
4.386LysAla: 4.386 ± 1.061
1.154LysCys: 1.154 ± 0.519
3.693LysAsp: 3.693 ± 1.094
5.54LysGlu: 5.54 ± 0.778
1.385LysPhe: 1.385 ± 0.595
3.463LysGly: 3.463 ± 0.66
0.923LysHis: 0.923 ± 0.25
3.924LysIle: 3.924 ± 0.694
3.693LysLys: 3.693 ± 1.713
4.386LysLeu: 4.386 ± 0.967
3.463LysMet: 3.463 ± 0.513
1.847LysAsn: 1.847 ± 0.593
0.462LysPro: 0.462 ± 0.399
1.847LysGln: 1.847 ± 1.17
5.54LysArg: 5.54 ± 1.62
3.001LysSer: 3.001 ± 0.825
4.617LysThr: 4.617 ± 1.058
2.77LysVal: 2.77 ± 0.742
1.847LysTrp: 1.847 ± 0.59
1.847LysTyr: 1.847 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
5.078LeuAla: 5.078 ± 0.699
0.923LeuCys: 0.923 ± 0.353
1.616LeuAsp: 1.616 ± 0.699
7.618LeuGlu: 7.618 ± 1.476
1.847LeuPhe: 1.847 ± 0.536
4.386LeuGly: 4.386 ± 0.724
0.462LeuHis: 0.462 ± 0.254
6.233LeuIle: 6.233 ± 0.865
5.771LeuLys: 5.771 ± 1.33
6.925LeuLeu: 6.925 ± 1.555
2.078LeuMet: 2.078 ± 0.36
3.693LeuAsn: 3.693 ± 1.062
4.155LeuPro: 4.155 ± 0.612
2.77LeuGln: 2.77 ± 0.484
5.771LeuArg: 5.771 ± 1.462
5.309LeuSer: 5.309 ± 0.796
6.233LeuThr: 6.233 ± 1.826
3.693LeuVal: 3.693 ± 0.953
0.923LeuTrp: 0.923 ± 0.289
2.77LeuTyr: 2.77 ± 0.989
0.0LeuXaa: 0.0 ± 0.0
Met
3.463MetAla: 3.463 ± 0.693
1.154MetCys: 1.154 ± 0.63
3.232MetAsp: 3.232 ± 0.992
5.309MetGlu: 5.309 ± 0.784
1.154MetPhe: 1.154 ± 0.746
1.847MetGly: 1.847 ± 0.862
0.693MetHis: 0.693 ± 0.363
3.001MetIle: 3.001 ± 0.572
2.539MetLys: 2.539 ± 0.805
1.616MetLeu: 1.616 ± 0.409
1.847MetMet: 1.847 ± 0.755
0.693MetAsn: 0.693 ± 0.46
0.693MetPro: 0.693 ± 0.315
1.154MetGln: 1.154 ± 0.318
2.078MetArg: 2.078 ± 0.635
2.078MetSer: 2.078 ± 0.497
2.308MetThr: 2.308 ± 0.597
3.232MetVal: 3.232 ± 1.045
0.231MetTrp: 0.231 ± 0.22
0.693MetTyr: 0.693 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
3.693AsnAla: 3.693 ± 0.891
0.693AsnCys: 0.693 ± 0.439
3.693AsnAsp: 3.693 ± 0.857
4.155AsnGlu: 4.155 ± 0.869
1.847AsnPhe: 1.847 ± 0.458
4.386AsnGly: 4.386 ± 1.075
0.462AsnHis: 0.462 ± 0.279
2.308AsnIle: 2.308 ± 0.397
3.463AsnLys: 3.463 ± 0.612
3.924AsnLeu: 3.924 ± 0.632
3.001AsnMet: 3.001 ± 0.592
2.77AsnAsn: 2.77 ± 1.005
4.617AsnPro: 4.617 ± 0.614
1.616AsnGln: 1.616 ± 0.586
2.539AsnArg: 2.539 ± 0.616
3.232AsnSer: 3.232 ± 0.704
4.386AsnThr: 4.386 ± 0.805
2.77AsnVal: 2.77 ± 0.934
0.923AsnTrp: 0.923 ± 0.511
1.154AsnTyr: 1.154 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
3.463ProAla: 3.463 ± 1.046
0.462ProCys: 0.462 ± 0.291
0.923ProAsp: 0.923 ± 0.398
3.463ProGlu: 3.463 ± 0.759
2.77ProPhe: 2.77 ± 0.482
2.539ProGly: 2.539 ± 0.411
0.462ProHis: 0.462 ± 0.399
2.539ProIle: 2.539 ± 0.266
3.463ProLys: 3.463 ± 0.692
4.155ProLeu: 4.155 ± 0.684
0.923ProMet: 0.923 ± 0.556
3.693ProAsn: 3.693 ± 1.068
1.616ProPro: 1.616 ± 0.523
0.693ProGln: 0.693 ± 0.284
1.847ProArg: 1.847 ± 0.62
3.232ProSer: 3.232 ± 0.646
1.847ProThr: 1.847 ± 0.755
1.385ProVal: 1.385 ± 0.643
0.693ProTrp: 0.693 ± 0.411
0.923ProTyr: 0.923 ± 0.558
0.0ProXaa: 0.0 ± 0.0
Gln
2.078GlnAla: 2.078 ± 0.879
0.923GlnCys: 0.923 ± 0.465
1.385GlnAsp: 1.385 ± 0.63
1.616GlnGlu: 1.616 ± 0.726
0.462GlnPhe: 0.462 ± 0.271
3.001GlnGly: 3.001 ± 0.868
0.693GlnHis: 0.693 ± 0.328
3.232GlnIle: 3.232 ± 0.494
2.539GlnLys: 2.539 ± 0.792
2.77GlnLeu: 2.77 ± 0.516
2.77GlnMet: 2.77 ± 0.848
2.078GlnAsn: 2.078 ± 0.655
1.154GlnPro: 1.154 ± 0.463
0.923GlnGln: 0.923 ± 0.225
4.155GlnArg: 4.155 ± 0.841
2.77GlnSer: 2.77 ± 1.036
3.001GlnThr: 3.001 ± 0.865
1.616GlnVal: 1.616 ± 0.599
0.923GlnTrp: 0.923 ± 0.503
0.693GlnTyr: 0.693 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
3.924ArgAla: 3.924 ± 0.865
0.462ArgCys: 0.462 ± 0.242
3.001ArgAsp: 3.001 ± 0.867
2.539ArgGlu: 2.539 ± 0.831
2.77ArgPhe: 2.77 ± 0.695
6.694ArgGly: 6.694 ± 1.199
0.462ArgHis: 0.462 ± 0.322
3.924ArgIle: 3.924 ± 0.84
2.539ArgLys: 2.539 ± 0.62
5.309ArgLeu: 5.309 ± 0.545
3.924ArgMet: 3.924 ± 1.26
5.078ArgAsn: 5.078 ± 1.196
3.001ArgPro: 3.001 ± 0.856
2.77ArgGln: 2.77 ± 0.707
5.771ArgArg: 5.771 ± 1.054
4.386ArgSer: 4.386 ± 1.212
6.233ArgThr: 6.233 ± 0.856
3.001ArgVal: 3.001 ± 1.189
0.231ArgTrp: 0.231 ± 0.279
1.385ArgTyr: 1.385 ± 0.547
0.0ArgXaa: 0.0 ± 0.0
Ser
3.001SerAla: 3.001 ± 1.022
1.847SerCys: 1.847 ± 0.868
3.001SerAsp: 3.001 ± 0.579
2.77SerGlu: 2.77 ± 0.853
3.924SerPhe: 3.924 ± 0.588
5.771SerGly: 5.771 ± 1.056
1.616SerHis: 1.616 ± 0.695
4.617SerIle: 4.617 ± 0.634
3.001SerLys: 3.001 ± 0.663
5.771SerLeu: 5.771 ± 1.199
2.078SerMet: 2.078 ± 0.925
4.386SerAsn: 4.386 ± 1.41
2.539SerPro: 2.539 ± 0.686
3.693SerGln: 3.693 ± 1.029
3.001SerArg: 3.001 ± 0.693
6.464SerSer: 6.464 ± 1.434
5.078SerThr: 5.078 ± 0.854
3.924SerVal: 3.924 ± 1.131
1.616SerTrp: 1.616 ± 0.683
1.847SerTyr: 1.847 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
3.693ThrAla: 3.693 ± 0.651
0.923ThrCys: 0.923 ± 0.322
2.539ThrAsp: 2.539 ± 0.793
4.848ThrGlu: 4.848 ± 0.903
2.539ThrPhe: 2.539 ± 0.348
4.617ThrGly: 4.617 ± 0.973
2.77ThrHis: 2.77 ± 0.73
5.54ThrIle: 5.54 ± 1.37
4.617ThrLys: 4.617 ± 0.634
5.54ThrLeu: 5.54 ± 1.158
2.308ThrMet: 2.308 ± 0.551
2.539ThrAsn: 2.539 ± 0.51
2.078ThrPro: 2.078 ± 0.566
3.693ThrGln: 3.693 ± 1.204
4.386ThrArg: 4.386 ± 0.787
3.463ThrSer: 3.463 ± 0.685
4.617ThrThr: 4.617 ± 1.021
5.309ThrVal: 5.309 ± 1.183
0.693ThrTrp: 0.693 ± 0.251
3.001ThrTyr: 3.001 ± 0.772
0.0ThrXaa: 0.0 ± 0.0
Val
3.232ValAla: 3.232 ± 0.69
1.385ValCys: 1.385 ± 0.584
3.001ValAsp: 3.001 ± 0.824
2.77ValGlu: 2.77 ± 0.573
2.308ValPhe: 2.308 ± 0.698
3.232ValGly: 3.232 ± 0.88
0.923ValHis: 0.923 ± 0.615
2.078ValIle: 2.078 ± 0.738
3.232ValLys: 3.232 ± 0.745
5.54ValLeu: 5.54 ± 1.684
1.385ValMet: 1.385 ± 0.662
3.232ValAsn: 3.232 ± 0.632
2.539ValPro: 2.539 ± 0.625
2.078ValGln: 2.078 ± 0.831
4.155ValArg: 4.155 ± 1.403
5.771ValSer: 5.771 ± 0.647
3.232ValThr: 3.232 ± 1.019
3.001ValVal: 3.001 ± 0.565
0.923ValTrp: 0.923 ± 0.481
1.847ValTyr: 1.847 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
0.462TrpAla: 0.462 ± 0.256
0.0TrpCys: 0.0 ± 0.0
0.693TrpAsp: 0.693 ± 0.284
1.385TrpGlu: 1.385 ± 0.604
0.693TrpPhe: 0.693 ± 0.311
0.923TrpGly: 0.923 ± 0.333
0.462TrpHis: 0.462 ± 0.306
1.385TrpIle: 1.385 ± 0.491
0.923TrpLys: 0.923 ± 0.551
1.154TrpLeu: 1.154 ± 0.534
1.385TrpMet: 1.385 ± 0.515
0.693TrpAsn: 0.693 ± 0.311
0.462TrpPro: 0.462 ± 0.291
0.231TrpGln: 0.231 ± 0.225
0.693TrpArg: 0.693 ± 0.453
1.385TrpSer: 1.385 ± 0.611
2.078TrpThr: 2.078 ± 0.808
0.231TrpVal: 0.231 ± 0.2
0.462TrpTrp: 0.462 ± 0.254
0.231TrpTyr: 0.231 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.154TyrAla: 1.154 ± 0.458
0.462TyrCys: 0.462 ± 0.291
2.308TyrAsp: 2.308 ± 0.735
1.847TyrGlu: 1.847 ± 0.617
1.154TyrPhe: 1.154 ± 0.327
1.616TyrGly: 1.616 ± 0.379
0.693TyrHis: 0.693 ± 0.676
1.847TyrIle: 1.847 ± 0.296
0.693TyrLys: 0.693 ± 0.271
1.154TyrLeu: 1.154 ± 0.43
0.462TyrMet: 0.462 ± 0.254
1.616TyrAsn: 1.616 ± 0.471
1.154TyrPro: 1.154 ± 0.705
1.616TyrGln: 1.616 ± 0.469
1.847TyrArg: 1.847 ± 0.907
2.078TyrSer: 2.078 ± 0.53
1.616TyrThr: 1.616 ± 0.721
1.847TyrVal: 1.847 ± 0.767
0.693TyrTrp: 0.693 ± 0.316
0.231TyrTyr: 0.231 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4333 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski