Amino acid dipepetide frequency for Influenza A virus (strain A/Gull/Minnesota/945/1980 H13N6)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.188AlaAla: 4.188 ± 1.077
1.047AlaCys: 1.047 ± 0.543
2.723AlaAsp: 2.723 ± 0.672
3.351AlaGlu: 3.351 ± 0.722
2.094AlaPhe: 2.094 ± 0.883
3.56AlaGly: 3.56 ± 1.032
0.838AlaHis: 0.838 ± 0.439
3.979AlaIle: 3.979 ± 0.631
2.513AlaLys: 2.513 ± 0.534
5.864AlaLeu: 5.864 ± 0.905
2.723AlaMet: 2.723 ± 0.712
4.188AlaAsn: 4.188 ± 0.992
2.932AlaPro: 2.932 ± 0.613
1.466AlaGln: 1.466 ± 0.501
2.723AlaArg: 2.723 ± 0.598
4.817AlaSer: 4.817 ± 1.285
5.026AlaThr: 5.026 ± 0.598
2.932AlaVal: 2.932 ± 0.747
0.838AlaTrp: 0.838 ± 0.467
0.838AlaTyr: 0.838 ± 0.297
0.0AlaXaa: 0.0 ± 0.0
Cys
0.628CysAla: 0.628 ± 0.338
0.419CysCys: 0.419 ± 0.368
0.419CysAsp: 0.419 ± 0.364
0.838CysGlu: 0.838 ± 0.289
1.675CysPhe: 1.675 ± 0.634
0.209CysGly: 0.209 ± 0.182
1.047CysHis: 1.047 ± 0.392
1.675CysIle: 1.675 ± 0.664
0.838CysLys: 0.838 ± 0.361
1.047CysLeu: 1.047 ± 0.388
1.047CysMet: 1.047 ± 0.318
1.885CysAsn: 1.885 ± 0.625
0.628CysPro: 0.628 ± 0.299
0.209CysGln: 0.209 ± 0.2
1.675CysArg: 1.675 ± 0.685
1.257CysSer: 1.257 ± 0.519
1.047CysThr: 1.047 ± 0.319
1.257CysVal: 1.257 ± 0.404
0.209CysTrp: 0.209 ± 0.169
0.419CysTyr: 0.419 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
3.141AspAla: 3.141 ± 0.366
1.047AspCys: 1.047 ± 0.271
1.257AspAsp: 1.257 ± 0.364
3.56AspGlu: 3.56 ± 0.607
2.094AspPhe: 2.094 ± 0.789
2.723AspGly: 2.723 ± 0.814
0.419AspHis: 0.419 ± 0.272
1.675AspIle: 1.675 ± 0.484
2.094AspLys: 2.094 ± 0.505
3.77AspLeu: 3.77 ± 0.822
1.885AspMet: 1.885 ± 0.383
3.141AspAsn: 3.141 ± 0.831
4.607AspPro: 4.607 ± 1.053
1.885AspGln: 1.885 ± 0.622
2.723AspArg: 2.723 ± 0.518
3.141AspSer: 3.141 ± 0.776
1.675AspThr: 1.675 ± 0.51
3.351AspVal: 3.351 ± 0.537
0.419AspTrp: 0.419 ± 0.28
1.466AspTyr: 1.466 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
2.513GluAla: 2.513 ± 0.689
1.257GluCys: 1.257 ± 0.776
4.817GluAsp: 4.817 ± 0.743
6.492GluGlu: 6.492 ± 0.776
1.675GluPhe: 1.675 ± 0.565
4.817GluGly: 4.817 ± 1.23
0.838GluHis: 0.838 ± 0.543
4.607GluIle: 4.607 ± 0.697
5.026GluLys: 5.026 ± 1.404
5.654GluLeu: 5.654 ± 0.939
2.932GluMet: 2.932 ± 0.905
3.77GluAsn: 3.77 ± 0.961
2.513GluPro: 2.513 ± 0.986
4.188GluGln: 4.188 ± 1.168
4.817GluArg: 4.817 ± 1.192
7.539GluSer: 7.539 ± 1.36
3.979GluThr: 3.979 ± 0.596
5.654GluVal: 5.654 ± 1.182
0.838GluTrp: 0.838 ± 0.359
1.885GluTyr: 1.885 ± 0.437
0.0GluXaa: 0.0 ± 0.0
Phe
2.304PheAla: 2.304 ± 0.983
0.0PheCys: 0.0 ± 0.0
1.047PheAsp: 1.047 ± 0.471
5.026PheGlu: 5.026 ± 1.092
1.257PhePhe: 1.257 ± 0.395
1.675PheGly: 1.675 ± 0.38
0.628PheHis: 0.628 ± 0.338
2.094PheIle: 2.094 ± 0.633
1.047PheLys: 1.047 ± 0.493
3.979PheLeu: 3.979 ± 0.858
0.838PheMet: 0.838 ± 0.425
2.094PheAsn: 2.094 ± 0.646
0.628PhePro: 0.628 ± 0.382
2.932PheGln: 2.932 ± 0.74
1.466PheArg: 1.466 ± 0.342
3.77PheSer: 3.77 ± 0.525
2.723PheThr: 2.723 ± 0.55
2.932PheVal: 2.932 ± 0.781
0.209PheTrp: 0.209 ± 0.237
1.047PheTyr: 1.047 ± 0.417
0.0PheXaa: 0.0 ± 0.0
Gly
2.932GlyAla: 2.932 ± 0.981
0.628GlyCys: 0.628 ± 0.281
2.932GlyAsp: 2.932 ± 0.528
5.236GlyGlu: 5.236 ± 1.439
2.513GlyPhe: 2.513 ± 0.595
3.77GlyGly: 3.77 ± 0.815
0.628GlyHis: 0.628 ± 0.428
5.864GlyIle: 5.864 ± 1.101
3.141GlyLys: 3.141 ± 0.457
5.026GlyLeu: 5.026 ± 1.099
1.885GlyMet: 1.885 ± 0.472
3.141GlyAsn: 3.141 ± 0.983
3.56GlyPro: 3.56 ± 0.835
2.304GlyGln: 2.304 ± 0.468
5.445GlyArg: 5.445 ± 1.338
2.304GlySer: 2.304 ± 0.739
5.654GlyThr: 5.654 ± 0.648
4.398GlyVal: 4.398 ± 0.656
1.257GlyTrp: 1.257 ± 0.605
2.723GlyTyr: 2.723 ± 0.877
0.0GlyXaa: 0.0 ± 0.0
His
0.628HisAla: 0.628 ± 0.267
0.209HisCys: 0.209 ± 0.185
0.838HisAsp: 0.838 ± 0.546
1.257HisGlu: 1.257 ± 0.385
1.257HisPhe: 1.257 ± 0.497
1.047HisGly: 1.047 ± 0.43
0.209HisHis: 0.209 ± 0.2
2.304HisIle: 2.304 ± 0.972
1.257HisLys: 1.257 ± 0.451
1.885HisLeu: 1.885 ± 0.563
0.209HisMet: 0.209 ± 0.169
0.0HisAsn: 0.0 ± 0.0
1.257HisPro: 1.257 ± 0.543
0.838HisGln: 0.838 ± 0.249
1.675HisArg: 1.675 ± 0.668
1.047HisSer: 1.047 ± 0.436
0.838HisThr: 0.838 ± 0.416
0.209HisVal: 0.209 ± 0.23
0.0HisTrp: 0.0 ± 0.0
0.628HisTyr: 0.628 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
3.77IleAla: 3.77 ± 0.812
2.513IleCys: 2.513 ± 0.681
4.188IleAsp: 4.188 ± 1.01
7.33IleGlu: 7.33 ± 1.703
1.047IlePhe: 1.047 ± 0.319
4.188IleGly: 4.188 ± 1.047
0.838IleHis: 0.838 ± 0.327
4.398IleIle: 4.398 ± 1.06
4.188IleLys: 4.188 ± 0.925
6.073IleLeu: 6.073 ± 1.843
1.885IleMet: 1.885 ± 0.445
3.77IleAsn: 3.77 ± 0.999
1.675IlePro: 1.675 ± 0.603
1.885IleGln: 1.885 ± 0.609
6.492IleArg: 6.492 ± 1.25
2.932IleSer: 2.932 ± 0.867
4.188IleThr: 4.188 ± 0.868
3.77IleVal: 3.77 ± 0.647
0.628IleTrp: 0.628 ± 0.49
1.675IleTyr: 1.675 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
4.817LysAla: 4.817 ± 0.89
1.466LysCys: 1.466 ± 0.442
2.304LysAsp: 2.304 ± 0.456
5.026LysGlu: 5.026 ± 1.11
1.885LysPhe: 1.885 ± 0.681
3.351LysGly: 3.351 ± 0.699
0.628LysHis: 0.628 ± 0.317
4.398LysIle: 4.398 ± 0.953
3.77LysLys: 3.77 ± 1.684
5.236LysLeu: 5.236 ± 1.297
3.141LysMet: 3.141 ± 0.644
1.885LysAsn: 1.885 ± 0.581
0.838LysPro: 0.838 ± 0.451
1.257LysGln: 1.257 ± 0.602
4.398LysArg: 4.398 ± 1.308
2.304LysSer: 2.304 ± 0.566
3.979LysThr: 3.979 ± 1.122
1.885LysVal: 1.885 ± 0.422
1.885LysTrp: 1.885 ± 0.55
2.094LysTyr: 2.094 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
4.817LeuAla: 4.817 ± 0.923
1.257LeuCys: 1.257 ± 0.567
2.513LeuAsp: 2.513 ± 0.909
6.702LeuGlu: 6.702 ± 1.073
2.094LeuPhe: 2.094 ± 0.617
3.56LeuGly: 3.56 ± 0.645
1.466LeuHis: 1.466 ± 0.492
6.702LeuIle: 6.702 ± 1.003
7.539LeuLys: 7.539 ± 1.295
7.33LeuLeu: 7.33 ± 1.76
1.885LeuMet: 1.885 ± 0.434
3.351LeuAsn: 3.351 ± 1.017
3.351LeuPro: 3.351 ± 0.821
2.932LeuGln: 2.932 ± 0.644
5.654LeuArg: 5.654 ± 1.179
5.026LeuSer: 5.026 ± 0.831
5.026LeuThr: 5.026 ± 1.58
4.817LeuVal: 4.817 ± 1.067
1.466LeuTrp: 1.466 ± 0.457
2.723LeuTyr: 2.723 ± 0.945
0.0LeuXaa: 0.0 ± 0.0
Met
3.77MetAla: 3.77 ± 0.843
0.838MetCys: 0.838 ± 0.684
3.351MetAsp: 3.351 ± 0.866
4.398MetGlu: 4.398 ± 0.795
1.257MetPhe: 1.257 ± 0.675
2.094MetGly: 2.094 ± 0.802
0.628MetHis: 0.628 ± 0.392
2.723MetIle: 2.723 ± 0.656
2.513MetLys: 2.513 ± 0.889
1.466MetLeu: 1.466 ± 0.415
1.675MetMet: 1.675 ± 0.592
0.838MetAsn: 0.838 ± 0.386
0.209MetPro: 0.209 ± 0.184
1.257MetGln: 1.257 ± 0.524
1.885MetArg: 1.885 ± 0.647
2.723MetSer: 2.723 ± 0.572
1.885MetThr: 1.885 ± 0.63
2.932MetVal: 2.932 ± 1.021
0.838MetTrp: 0.838 ± 0.42
0.838MetTyr: 0.838 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
4.188AsnAla: 4.188 ± 0.871
0.838AsnCys: 0.838 ± 0.378
2.932AsnAsp: 2.932 ± 0.596
3.56AsnGlu: 3.56 ± 1.014
1.885AsnPhe: 1.885 ± 0.409
4.817AsnGly: 4.817 ± 1.284
0.628AsnHis: 0.628 ± 0.439
3.141AsnIle: 3.141 ± 0.891
3.141AsnLys: 3.141 ± 0.662
3.141AsnLeu: 3.141 ± 0.495
2.304AsnMet: 2.304 ± 0.573
3.351AsnAsn: 3.351 ± 1.443
4.188AsnPro: 4.188 ± 0.738
1.675AsnGln: 1.675 ± 0.553
2.932AsnArg: 2.932 ± 0.728
3.979AsnSer: 3.979 ± 0.994
5.026AsnThr: 5.026 ± 0.856
2.513AsnVal: 2.513 ± 0.814
0.838AsnTrp: 0.838 ± 0.424
0.838AsnTyr: 0.838 ± 0.387
0.0AsnXaa: 0.0 ± 0.0
Pro
3.141ProAla: 3.141 ± 0.962
0.628ProCys: 0.628 ± 0.324
1.466ProAsp: 1.466 ± 0.57
2.932ProGlu: 2.932 ± 0.694
1.885ProPhe: 1.885 ± 0.579
2.723ProGly: 2.723 ± 0.678
0.628ProHis: 0.628 ± 0.389
2.723ProIle: 2.723 ± 0.366
3.141ProLys: 3.141 ± 0.836
2.932ProLeu: 2.932 ± 0.828
1.257ProMet: 1.257 ± 0.711
3.979ProAsn: 3.979 ± 1.147
1.047ProPro: 1.047 ± 0.391
0.628ProGln: 0.628 ± 0.248
1.675ProArg: 1.675 ± 0.656
3.141ProSer: 3.141 ± 0.73
1.047ProThr: 1.047 ± 0.483
2.304ProVal: 2.304 ± 0.682
0.419ProTrp: 0.419 ± 0.277
1.047ProTyr: 1.047 ± 0.463
0.0ProXaa: 0.0 ± 0.0
Gln
2.513GlnAla: 2.513 ± 0.78
0.419GlnCys: 0.419 ± 0.294
1.047GlnAsp: 1.047 ± 0.36
2.723GlnGlu: 2.723 ± 0.853
0.419GlnPhe: 0.419 ± 0.286
2.513GlnGly: 2.513 ± 0.732
0.628GlnHis: 0.628 ± 0.312
2.723GlnIle: 2.723 ± 0.612
2.304GlnLys: 2.304 ± 0.689
3.56GlnLeu: 3.56 ± 0.962
2.304GlnMet: 2.304 ± 0.797
2.932GlnAsn: 2.932 ± 0.731
0.628GlnPro: 0.628 ± 0.388
1.257GlnGln: 1.257 ± 0.374
3.979GlnArg: 3.979 ± 0.854
3.351GlnSer: 3.351 ± 0.878
2.304GlnThr: 2.304 ± 0.888
2.094GlnVal: 2.094 ± 0.697
0.419GlnTrp: 0.419 ± 0.338
0.628GlnTyr: 0.628 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
4.398ArgAla: 4.398 ± 0.876
1.047ArgCys: 1.047 ± 0.442
3.56ArgAsp: 3.56 ± 0.619
3.351ArgGlu: 3.351 ± 0.891
2.932ArgPhe: 2.932 ± 0.686
6.073ArgGly: 6.073 ± 1.201
1.047ArgHis: 1.047 ± 0.382
4.817ArgIle: 4.817 ± 0.869
2.723ArgLys: 2.723 ± 0.704
4.817ArgLeu: 4.817 ± 0.858
3.979ArgMet: 3.979 ± 1.474
4.817ArgAsn: 4.817 ± 1.127
2.723ArgPro: 2.723 ± 0.787
3.351ArgGln: 3.351 ± 0.745
5.445ArgArg: 5.445 ± 1.092
4.188ArgSer: 4.188 ± 1.014
6.492ArgThr: 6.492 ± 1.066
2.094ArgVal: 2.094 ± 0.827
0.209ArgTrp: 0.209 ± 0.243
1.466ArgTyr: 1.466 ± 0.633
0.0ArgXaa: 0.0 ± 0.0
Ser
3.141SerAla: 3.141 ± 0.8
1.885SerCys: 1.885 ± 0.6
2.932SerAsp: 2.932 ± 0.575
3.77SerGlu: 3.77 ± 0.967
4.607SerPhe: 4.607 ± 0.994
5.445SerGly: 5.445 ± 0.806
1.675SerHis: 1.675 ± 0.643
5.864SerIle: 5.864 ± 0.82
3.56SerLys: 3.56 ± 0.721
5.026SerLeu: 5.026 ± 1.016
2.304SerMet: 2.304 ± 0.867
2.932SerAsn: 2.932 ± 0.833
2.723SerPro: 2.723 ± 0.549
4.398SerGln: 4.398 ± 0.638
3.979SerArg: 3.979 ± 0.786
7.33SerSer: 7.33 ± 1.363
4.607SerThr: 4.607 ± 1.196
2.513SerVal: 2.513 ± 0.774
2.723SerTrp: 2.723 ± 1.155
1.466SerTyr: 1.466 ± 0.61
0.0SerXaa: 0.0 ± 0.0
Thr
3.56ThrAla: 3.56 ± 0.724
1.047ThrCys: 1.047 ± 0.453
2.094ThrAsp: 2.094 ± 0.464
4.188ThrGlu: 4.188 ± 0.781
2.932ThrPhe: 2.932 ± 0.537
5.445ThrGly: 5.445 ± 1.185
2.932ThrHis: 2.932 ± 0.7
3.56ThrIle: 3.56 ± 0.607
4.188ThrLys: 4.188 ± 0.583
4.817ThrLeu: 4.817 ± 0.617
2.304ThrMet: 2.304 ± 0.543
3.56ThrAsn: 3.56 ± 0.696
1.675ThrPro: 1.675 ± 0.483
2.723ThrGln: 2.723 ± 0.798
5.026ThrArg: 5.026 ± 1.077
5.445ThrSer: 5.445 ± 1.057
4.817ThrThr: 4.817 ± 0.992
3.141ThrVal: 3.141 ± 1.098
0.419ThrTrp: 0.419 ± 0.238
2.723ThrTyr: 2.723 ± 0.79
0.0ThrXaa: 0.0 ± 0.0
Val
2.723ValAla: 2.723 ± 0.695
1.466ValCys: 1.466 ± 0.402
2.723ValAsp: 2.723 ± 1.12
3.77ValGlu: 3.77 ± 0.841
1.885ValPhe: 1.885 ± 0.585
3.77ValGly: 3.77 ± 1.155
1.047ValHis: 1.047 ± 0.389
1.466ValIle: 1.466 ± 0.466
1.675ValLys: 1.675 ± 0.399
5.864ValLeu: 5.864 ± 1.217
2.094ValMet: 2.094 ± 0.476
3.77ValAsn: 3.77 ± 0.541
2.304ValPro: 2.304 ± 0.672
2.304ValGln: 2.304 ± 0.698
4.817ValArg: 4.817 ± 1.064
4.607ValSer: 4.607 ± 0.811
2.723ValThr: 2.723 ± 0.6
2.723ValVal: 2.723 ± 0.705
0.628ValTrp: 0.628 ± 0.337
1.257ValTyr: 1.257 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.462
0.0TrpCys: 0.0 ± 0.0
0.838TrpAsp: 0.838 ± 0.394
1.466TrpGlu: 1.466 ± 0.548
0.838TrpPhe: 0.838 ± 0.29
1.047TrpGly: 1.047 ± 0.382
0.838TrpHis: 0.838 ± 0.522
1.047TrpIle: 1.047 ± 0.332
0.628TrpLys: 0.628 ± 0.408
0.628TrpLeu: 0.628 ± 0.355
0.838TrpMet: 0.838 ± 0.33
0.628TrpAsn: 0.628 ± 0.303
0.419TrpPro: 0.419 ± 0.248
0.0TrpGln: 0.0 ± 0.0
0.628TrpArg: 0.628 ± 0.531
1.675TrpSer: 1.675 ± 0.667
1.466TrpThr: 1.466 ± 0.438
0.838TrpVal: 0.838 ± 0.381
0.419TrpTrp: 0.419 ± 0.231
0.419TrpTyr: 0.419 ± 0.244
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.628TyrAla: 0.628 ± 0.27
0.419TyrCys: 0.419 ± 0.248
2.094TyrAsp: 2.094 ± 0.672
1.047TyrGlu: 1.047 ± 0.47
1.466TyrPhe: 1.466 ± 0.387
2.513TyrGly: 2.513 ± 0.522
0.0TyrHis: 0.0 ± 0.0
1.675TyrIle: 1.675 ± 0.415
1.257TyrLys: 1.257 ± 0.388
2.094TyrLeu: 2.094 ± 0.538
0.419TyrMet: 0.419 ± 0.231
1.885TyrAsn: 1.885 ± 0.763
0.838TyrPro: 0.838 ± 0.451
1.257TyrGln: 1.257 ± 0.421
2.094TyrArg: 2.094 ± 0.865
2.304TyrSer: 2.304 ± 0.341
2.094TyrThr: 2.094 ± 0.627
1.257TyrVal: 1.257 ± 0.463
0.838TyrTrp: 0.838 ± 0.332
0.419TyrTyr: 0.419 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4776 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski