Amino acid dipepetide frequency for Influenza A virus (strain A/Turkey/Minnesota/833/1980 H4N2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.955AlaAla: 3.955 ± 1.028
1.249AlaCys: 1.249 ± 0.507
2.498AlaAsp: 2.498 ± 0.517
4.163AlaGlu: 4.163 ± 1.034
1.665AlaPhe: 1.665 ± 0.668
3.747AlaGly: 3.747 ± 1.053
0.624AlaHis: 0.624 ± 0.419
3.955AlaIle: 3.955 ± 0.812
1.873AlaLys: 1.873 ± 0.591
5.828AlaLeu: 5.828 ± 0.918
2.706AlaMet: 2.706 ± 0.674
3.331AlaAsn: 3.331 ± 0.79
2.082AlaPro: 2.082 ± 0.515
1.665AlaGln: 1.665 ± 0.492
3.122AlaArg: 3.122 ± 0.505
4.788AlaSer: 4.788 ± 1.367
5.204AlaThr: 5.204 ± 0.762
2.914AlaVal: 2.914 ± 0.798
1.249AlaTrp: 1.249 ± 0.561
1.041AlaTyr: 1.041 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.624CysAla: 0.624 ± 0.331
0.208CysCys: 0.208 ± 0.191
0.833CysAsp: 0.833 ± 0.506
0.833CysGlu: 0.833 ± 0.279
1.873CysPhe: 1.873 ± 0.688
0.208CysGly: 0.208 ± 0.186
1.041CysHis: 1.041 ± 0.246
1.665CysIle: 1.665 ± 0.642
1.249CysLys: 1.249 ± 0.385
1.249CysLeu: 1.249 ± 0.437
0.624CysMet: 0.624 ± 0.281
1.041CysAsn: 1.041 ± 0.397
0.624CysPro: 0.624 ± 0.355
0.833CysGln: 0.833 ± 0.408
1.041CysArg: 1.041 ± 0.665
1.457CysSer: 1.457 ± 0.574
0.833CysThr: 0.833 ± 0.368
1.457CysVal: 1.457 ± 0.53
0.208CysTrp: 0.208 ± 0.181
0.624CysTyr: 0.624 ± 0.382
0.0CysXaa: 0.0 ± 0.0
Asp
2.29AspAla: 2.29 ± 0.557
1.249AspCys: 1.249 ± 0.391
1.665AspAsp: 1.665 ± 0.401
3.331AspGlu: 3.331 ± 0.642
2.29AspPhe: 2.29 ± 0.806
3.331AspGly: 3.331 ± 0.809
0.833AspHis: 0.833 ± 0.41
2.706AspIle: 2.706 ± 0.701
1.457AspLys: 1.457 ± 0.461
4.163AspLeu: 4.163 ± 0.808
1.665AspMet: 1.665 ± 0.483
2.914AspAsn: 2.914 ± 0.839
3.747AspPro: 3.747 ± 0.924
1.873AspGln: 1.873 ± 0.829
2.706AspArg: 2.706 ± 0.599
3.747AspSer: 3.747 ± 0.958
2.29AspThr: 2.29 ± 0.626
3.122AspVal: 3.122 ± 0.631
0.416AspTrp: 0.416 ± 0.291
2.082AspTyr: 2.082 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
2.498GluAla: 2.498 ± 0.557
1.041GluCys: 1.041 ± 0.598
4.788GluAsp: 4.788 ± 0.733
6.453GluGlu: 6.453 ± 1.014
2.082GluPhe: 2.082 ± 0.614
4.371GluGly: 4.371 ± 1.159
0.833GluHis: 0.833 ± 0.552
4.996GluIle: 4.996 ± 0.92
6.245GluLys: 6.245 ± 1.432
5.412GluLeu: 5.412 ± 0.665
2.706GluMet: 2.706 ± 0.628
3.955GluAsn: 3.955 ± 1.21
3.122GluPro: 3.122 ± 1.259
3.955GluGln: 3.955 ± 1.323
4.788GluArg: 4.788 ± 1.02
6.453GluSer: 6.453 ± 1.177
3.747GluThr: 3.747 ± 0.644
5.204GluVal: 5.204 ± 1.166
1.041GluTrp: 1.041 ± 0.419
1.665GluTyr: 1.665 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
2.082PheAla: 2.082 ± 0.579
0.208PheCys: 0.208 ± 0.186
1.457PheAsp: 1.457 ± 0.421
5.204PheGlu: 5.204 ± 1.287
1.457PhePhe: 1.457 ± 0.47
1.457PheGly: 1.457 ± 0.287
1.041PheHis: 1.041 ± 0.37
2.706PheIle: 2.706 ± 0.811
0.833PheLys: 0.833 ± 0.447
4.163PheLeu: 4.163 ± 0.734
1.041PheMet: 1.041 ± 0.425
2.29PheAsn: 2.29 ± 0.715
0.624PhePro: 0.624 ± 0.392
2.914PheGln: 2.914 ± 0.741
1.665PheArg: 1.665 ± 0.33
3.539PheSer: 3.539 ± 0.49
2.498PheThr: 2.498 ± 0.505
2.706PheVal: 2.706 ± 0.784
0.208PheTrp: 0.208 ± 0.21
1.041PheTyr: 1.041 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
2.706GlyAla: 2.706 ± 0.806
0.624GlyCys: 0.624 ± 0.358
3.747GlyAsp: 3.747 ± 0.453
3.747GlyGlu: 3.747 ± 1.508
2.498GlyPhe: 2.498 ± 0.51
2.914GlyGly: 2.914 ± 0.774
1.041GlyHis: 1.041 ± 0.538
3.747GlyIle: 3.747 ± 0.913
3.955GlyLys: 3.955 ± 0.562
4.371GlyLeu: 4.371 ± 1.028
2.082GlyMet: 2.082 ± 0.452
3.331GlyAsn: 3.331 ± 0.872
2.498GlyPro: 2.498 ± 0.791
2.29GlyGln: 2.29 ± 0.49
5.204GlyArg: 5.204 ± 0.868
4.788GlySer: 4.788 ± 1.309
6.037GlyThr: 6.037 ± 1.037
4.58GlyVal: 4.58 ± 0.512
1.249GlyTrp: 1.249 ± 0.509
1.873GlyTyr: 1.873 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
0.624HisAla: 0.624 ± 0.238
0.208HisCys: 0.208 ± 0.192
0.624HisAsp: 0.624 ± 0.382
0.833HisGlu: 0.833 ± 0.38
1.457HisPhe: 1.457 ± 0.451
0.833HisGly: 0.833 ± 0.403
0.416HisHis: 0.416 ± 0.335
2.29HisIle: 2.29 ± 1.034
1.041HisLys: 1.041 ± 0.463
1.873HisLeu: 1.873 ± 0.508
0.208HisMet: 0.208 ± 0.181
0.0HisAsn: 0.0 ± 0.0
1.041HisPro: 1.041 ± 0.369
1.041HisGln: 1.041 ± 0.494
1.665HisArg: 1.665 ± 0.698
1.249HisSer: 1.249 ± 0.453
0.624HisThr: 0.624 ± 0.439
0.416HisVal: 0.416 ± 0.292
0.0HisTrp: 0.0 ± 0.0
0.624HisTyr: 0.624 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
4.163IleAla: 4.163 ± 0.627
2.082IleCys: 2.082 ± 0.525
3.539IleAsp: 3.539 ± 0.992
7.494IleGlu: 7.494 ± 1.924
1.457IlePhe: 1.457 ± 0.336
4.996IleGly: 4.996 ± 0.8
0.624IleHis: 0.624 ± 0.345
3.331IleIle: 3.331 ± 0.933
3.331IleLys: 3.331 ± 1.051
6.245IleLeu: 6.245 ± 1.578
2.498IleMet: 2.498 ± 0.461
3.955IleAsn: 3.955 ± 0.826
2.082IlePro: 2.082 ± 0.639
2.082IleGln: 2.082 ± 0.454
5.828IleArg: 5.828 ± 1.183
3.122IleSer: 3.122 ± 0.886
3.539IleThr: 3.539 ± 0.935
4.371IleVal: 4.371 ± 0.92
0.833IleTrp: 0.833 ± 0.449
1.457IleTyr: 1.457 ± 0.546
0.0IleXaa: 0.0 ± 0.0
Lys
4.371LysAla: 4.371 ± 0.888
1.665LysCys: 1.665 ± 0.667
2.706LysAsp: 2.706 ± 0.374
4.58LysGlu: 4.58 ± 1.114
1.873LysPhe: 1.873 ± 0.678
2.706LysGly: 2.706 ± 0.603
0.833LysHis: 0.833 ± 0.345
3.539LysIle: 3.539 ± 0.798
3.331LysLys: 3.331 ± 1.544
4.788LysLeu: 4.788 ± 1.188
2.498LysMet: 2.498 ± 0.734
1.665LysAsn: 1.665 ± 0.601
1.041LysPro: 1.041 ± 0.453
2.706LysGln: 2.706 ± 1.031
4.996LysArg: 4.996 ± 1.477
3.539LysSer: 3.539 ± 0.773
4.163LysThr: 4.163 ± 1.195
1.873LysVal: 1.873 ± 0.517
1.873LysTrp: 1.873 ± 0.647
1.873LysTyr: 1.873 ± 0.515
0.0LysXaa: 0.0 ± 0.0
Leu
4.788LeuAla: 4.788 ± 0.756
0.624LeuCys: 0.624 ± 0.339
1.457LeuAsp: 1.457 ± 0.63
6.453LeuGlu: 6.453 ± 1.207
2.706LeuPhe: 2.706 ± 0.597
3.955LeuGly: 3.955 ± 0.766
1.249LeuHis: 1.249 ± 0.464
6.245LeuIle: 6.245 ± 1.241
7.494LeuLys: 7.494 ± 1.586
6.453LeuLeu: 6.453 ± 1.43
2.498LeuMet: 2.498 ± 0.535
4.163LeuAsn: 4.163 ± 1.169
3.539LeuPro: 3.539 ± 0.746
2.914LeuGln: 2.914 ± 0.755
5.62LeuArg: 5.62 ± 1.326
4.58LeuSer: 4.58 ± 0.824
5.828LeuThr: 5.828 ± 1.454
3.747LeuVal: 3.747 ± 0.898
1.873LeuTrp: 1.873 ± 0.578
2.914LeuTyr: 2.914 ± 1.013
0.0LeuXaa: 0.0 ± 0.0
Met
3.955MetAla: 3.955 ± 0.684
0.833MetCys: 0.833 ± 0.601
3.331MetAsp: 3.331 ± 0.937
4.371MetGlu: 4.371 ± 0.885
1.041MetPhe: 1.041 ± 0.736
1.873MetGly: 1.873 ± 0.842
0.416MetHis: 0.416 ± 0.313
2.498MetIle: 2.498 ± 0.566
2.29MetLys: 2.29 ± 0.91
1.665MetLeu: 1.665 ± 0.417
1.457MetMet: 1.457 ± 0.6
0.624MetAsn: 0.624 ± 0.382
0.833MetPro: 0.833 ± 0.341
1.457MetGln: 1.457 ± 0.638
2.29MetArg: 2.29 ± 0.813
2.29MetSer: 2.29 ± 0.566
1.665MetThr: 1.665 ± 0.621
3.122MetVal: 3.122 ± 1.008
0.416MetTrp: 0.416 ± 0.262
0.833MetTyr: 0.833 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
4.163AsnAla: 4.163 ± 0.964
0.208AsnCys: 0.208 ± 0.186
2.29AsnAsp: 2.29 ± 0.397
3.955AsnGlu: 3.955 ± 0.953
1.665AsnPhe: 1.665 ± 0.465
5.62AsnGly: 5.62 ± 1.758
0.208AsnHis: 0.208 ± 0.191
3.122AsnIle: 3.122 ± 0.693
2.914AsnLys: 2.914 ± 0.674
3.122AsnLeu: 3.122 ± 0.601
2.498AsnMet: 2.498 ± 0.775
2.914AsnAsn: 2.914 ± 1.031
4.163AsnPro: 4.163 ± 0.588
1.873AsnGln: 1.873 ± 0.56
3.539AsnArg: 3.539 ± 0.79
3.122AsnSer: 3.122 ± 0.611
3.955AsnThr: 3.955 ± 0.765
1.665AsnVal: 1.665 ± 0.71
1.249AsnTrp: 1.249 ± 0.615
0.833AsnTyr: 0.833 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
2.706ProAla: 2.706 ± 1.012
0.624ProCys: 0.624 ± 0.345
1.457ProAsp: 1.457 ± 0.459
2.914ProGlu: 2.914 ± 0.544
2.29ProPhe: 2.29 ± 0.422
2.498ProGly: 2.498 ± 0.533
0.624ProHis: 0.624 ± 0.416
2.498ProIle: 2.498 ± 0.333
2.914ProLys: 2.914 ± 0.811
3.539ProLeu: 3.539 ± 0.814
1.249ProMet: 1.249 ± 0.669
3.539ProAsn: 3.539 ± 0.806
1.249ProPro: 1.249 ± 0.523
0.833ProGln: 0.833 ± 0.401
2.082ProArg: 2.082 ± 0.737
3.122ProSer: 3.122 ± 0.804
1.873ProThr: 1.873 ± 0.617
1.457ProVal: 1.457 ± 0.567
0.624ProTrp: 0.624 ± 0.333
0.624ProTyr: 0.624 ± 0.398
0.0ProXaa: 0.0 ± 0.0
Gln
2.29GlnAla: 2.29 ± 1.005
0.833GlnCys: 0.833 ± 0.342
1.665GlnAsp: 1.665 ± 0.548
2.706GlnGlu: 2.706 ± 0.842
0.624GlnPhe: 0.624 ± 0.315
2.914GlnGly: 2.914 ± 0.784
0.833GlnHis: 0.833 ± 0.409
3.539GlnIle: 3.539 ± 0.613
2.914GlnLys: 2.914 ± 0.918
3.331GlnLeu: 3.331 ± 1.081
2.29GlnMet: 2.29 ± 0.893
3.539GlnAsn: 3.539 ± 1.035
0.624GlnPro: 0.624 ± 0.391
1.249GlnGln: 1.249 ± 0.428
3.122GlnArg: 3.122 ± 0.981
3.122GlnSer: 3.122 ± 0.921
3.331GlnThr: 3.331 ± 0.996
2.706GlnVal: 2.706 ± 0.744
0.833GlnTrp: 0.833 ± 0.444
0.624GlnTyr: 0.624 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
4.371ArgAla: 4.371 ± 0.741
1.041ArgCys: 1.041 ± 0.388
3.331ArgAsp: 3.331 ± 0.739
3.122ArgGlu: 3.122 ± 0.831
2.914ArgPhe: 2.914 ± 0.669
7.077ArgGly: 7.077 ± 1.055
0.833ArgHis: 0.833 ± 0.423
4.371ArgIle: 4.371 ± 0.742
2.29ArgLys: 2.29 ± 0.714
5.412ArgLeu: 5.412 ± 0.634
3.539ArgMet: 3.539 ± 1.537
4.996ArgAsn: 4.996 ± 0.808
2.498ArgPro: 2.498 ± 0.501
3.539ArgGln: 3.539 ± 0.736
5.62ArgArg: 5.62 ± 1.003
3.747ArgSer: 3.747 ± 1.145
5.828ArgThr: 5.828 ± 1.061
3.331ArgVal: 3.331 ± 0.971
0.416ArgTrp: 0.416 ± 0.371
1.457ArgTyr: 1.457 ± 0.531
0.0ArgXaa: 0.0 ± 0.0
Ser
2.914SerAla: 2.914 ± 0.973
2.29SerCys: 2.29 ± 0.75
2.914SerAsp: 2.914 ± 0.808
3.122SerGlu: 3.122 ± 0.747
4.58SerPhe: 4.58 ± 0.915
5.204SerGly: 5.204 ± 1.204
1.665SerHis: 1.665 ± 0.658
6.037SerIle: 6.037 ± 0.897
3.539SerLys: 3.539 ± 1.138
5.828SerLeu: 5.828 ± 1.317
2.082SerMet: 2.082 ± 0.761
2.914SerAsn: 2.914 ± 0.879
2.914SerPro: 2.914 ± 0.577
4.996SerGln: 4.996 ± 0.602
3.539SerArg: 3.539 ± 0.649
7.494SerSer: 7.494 ± 1.552
4.371SerThr: 4.371 ± 1.106
3.122SerVal: 3.122 ± 0.742
1.457SerTrp: 1.457 ± 0.674
1.665SerTyr: 1.665 ± 0.607
0.0SerXaa: 0.0 ± 0.0
Thr
4.788ThrAla: 4.788 ± 0.435
1.249ThrCys: 1.249 ± 0.371
2.706ThrAsp: 2.706 ± 0.783
4.371ThrGlu: 4.371 ± 1.138
2.498ThrPhe: 2.498 ± 0.635
4.788ThrGly: 4.788 ± 0.944
2.082ThrHis: 2.082 ± 0.692
4.996ThrIle: 4.996 ± 0.885
4.371ThrLys: 4.371 ± 0.608
4.58ThrLeu: 4.58 ± 0.919
2.29ThrMet: 2.29 ± 0.445
2.706ThrAsn: 2.706 ± 0.422
1.457ThrPro: 1.457 ± 0.481
3.331ThrGln: 3.331 ± 1.17
5.204ThrArg: 5.204 ± 0.842
3.331ThrSer: 3.331 ± 0.7
3.747ThrThr: 3.747 ± 0.96
4.58ThrVal: 4.58 ± 1.134
0.416ThrTrp: 0.416 ± 0.233
2.29ThrTyr: 2.29 ± 0.7
0.0ThrXaa: 0.0 ± 0.0
Val
2.706ValAla: 2.706 ± 0.851
2.082ValCys: 2.082 ± 0.963
3.747ValAsp: 3.747 ± 1.024
4.788ValGlu: 4.788 ± 0.915
2.498ValPhe: 2.498 ± 0.523
2.29ValGly: 2.29 ± 0.789
1.249ValHis: 1.249 ± 0.39
2.082ValIle: 2.082 ± 0.674
2.706ValLys: 2.706 ± 0.793
4.371ValLeu: 4.371 ± 1.6
1.873ValMet: 1.873 ± 0.537
3.331ValAsn: 3.331 ± 0.76
2.914ValPro: 2.914 ± 0.914
1.873ValGln: 1.873 ± 0.838
4.371ValArg: 4.371 ± 0.891
5.412ValSer: 5.412 ± 0.727
2.706ValThr: 2.706 ± 0.649
3.539ValVal: 3.539 ± 0.74
0.624ValTrp: 0.624 ± 0.405
1.249ValTyr: 1.249 ± 0.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.416
0.0TrpCys: 0.0 ± 0.0
0.833TrpAsp: 0.833 ± 0.297
1.457TrpGlu: 1.457 ± 0.57
0.624TrpPhe: 0.624 ± 0.283
0.624TrpGly: 0.624 ± 0.238
0.416TrpHis: 0.416 ± 0.31
1.249TrpIle: 1.249 ± 0.376
1.041TrpLys: 1.041 ± 0.649
1.041TrpLeu: 1.041 ± 0.481
0.833TrpMet: 0.833 ± 0.337
0.833TrpAsn: 0.833 ± 0.353
0.416TrpPro: 0.416 ± 0.258
0.208TrpGln: 0.208 ± 0.168
0.833TrpArg: 0.833 ± 0.545
1.457TrpSer: 1.457 ± 0.775
1.665TrpThr: 1.665 ± 0.565
0.833TrpVal: 0.833 ± 0.306
0.624TrpTrp: 0.624 ± 0.274
0.208TrpTyr: 0.208 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.624TyrAla: 0.624 ± 0.254
0.208TyrCys: 0.208 ± 0.192
2.29TyrAsp: 2.29 ± 0.687
1.457TyrGlu: 1.457 ± 0.572
1.041TyrPhe: 1.041 ± 0.333
1.873TyrGly: 1.873 ± 0.388
0.208TyrHis: 0.208 ± 0.168
1.457TyrIle: 1.457 ± 0.362
1.041TyrLys: 1.041 ± 0.356
1.665TyrLeu: 1.665 ± 0.455
0.416TyrMet: 0.416 ± 0.238
1.041TyrAsn: 1.041 ± 0.388
1.249TyrPro: 1.249 ± 0.509
1.457TyrGln: 1.457 ± 0.354
2.498TyrArg: 2.498 ± 1.054
2.29TyrSer: 2.29 ± 0.487
1.873TyrThr: 1.873 ± 0.692
1.665TyrVal: 1.665 ± 0.647
0.624TyrTrp: 0.624 ± 0.281
0.416TyrTyr: 0.416 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski