Amino acid dipepetide frequency for Influenza A virus (A/mallard/Wisconsin/08OS2271/2008(H11N9))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.441AlaAla: 3.441 ± 1.849
1.967AlaCys: 1.967 ± 0.575
1.475AlaAsp: 1.475 ± 0.78
4.425AlaGlu: 4.425 ± 1.111
1.967AlaPhe: 1.967 ± 1.352
3.441AlaGly: 3.441 ± 1.862
0.983AlaHis: 0.983 ± 0.827
4.425AlaIle: 4.425 ± 1.472
1.475AlaLys: 1.475 ± 0.832
6.391AlaLeu: 6.391 ± 0.96
1.967AlaMet: 1.967 ± 0.407
2.95AlaAsn: 2.95 ± 1.101
1.475AlaPro: 1.475 ± 0.78
0.492AlaGln: 0.492 ± 0.413
3.441AlaArg: 3.441 ± 0.43
7.866AlaSer: 7.866 ± 0.902
4.916AlaThr: 4.916 ± 1.461
3.441AlaVal: 3.441 ± 1.907
0.983AlaTrp: 0.983 ± 0.574
0.983AlaTyr: 0.983 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.492CysAla: 0.492 ± 0.37
0.0CysCys: 0.0 ± 0.0
0.492CysAsp: 0.492 ± 0.483
0.492CysGlu: 0.492 ± 0.37
1.967CysPhe: 1.967 ± 0.607
0.0CysGly: 0.0 ± 0.0
1.475CysHis: 1.475 ± 0.922
1.967CysIle: 1.967 ± 0.832
0.492CysLys: 0.492 ± 0.37
1.475CysLeu: 1.475 ± 0.662
0.492CysMet: 0.492 ± 0.37
2.458CysAsn: 2.458 ± 0.676
0.492CysPro: 0.492 ± 0.483
0.0CysGln: 0.0 ± 0.0
2.458CysArg: 2.458 ± 1.015
2.95CysSer: 2.95 ± 1.624
2.458CysThr: 2.458 ± 1.149
1.475CysVal: 1.475 ± 0.291
0.0CysTrp: 0.0 ± 0.0
0.983CysTyr: 0.983 ± 0.966
0.0CysXaa: 0.0 ± 0.0
Asp
2.458AspAla: 2.458 ± 0.842
0.983AspCys: 0.983 ± 0.574
0.983AspAsp: 0.983 ± 0.426
4.425AspGlu: 4.425 ± 0.991
2.458AspPhe: 2.458 ± 1.341
3.441AspGly: 3.441 ± 1.56
0.0AspHis: 0.0 ± 0.0
1.475AspIle: 1.475 ± 0.74
1.475AspLys: 1.475 ± 0.702
3.441AspLeu: 3.441 ± 1.311
2.458AspMet: 2.458 ± 0.842
3.441AspAsn: 3.441 ± 1.092
5.9AspPro: 5.9 ± 1.132
2.458AspGln: 2.458 ± 1.628
2.95AspArg: 2.95 ± 1.046
3.441AspSer: 3.441 ± 1.265
0.492AspThr: 0.492 ± 0.37
2.95AspVal: 2.95 ± 0.586
0.983AspTrp: 0.983 ± 0.679
1.967AspTyr: 1.967 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
1.967GluAla: 1.967 ± 0.876
2.95GluCys: 2.95 ± 2.242
6.391GluAsp: 6.391 ± 1.243
8.85GluGlu: 8.85 ± 1.802
2.458GluPhe: 2.458 ± 1.015
7.375GluGly: 7.375 ± 1.712
0.983GluHis: 0.983 ± 0.409
6.391GluIle: 6.391 ± 1.332
5.408GluLys: 5.408 ± 2.897
5.408GluLeu: 5.408 ± 1.465
1.967GluMet: 1.967 ± 1.078
5.408GluAsn: 5.408 ± 1.383
3.933GluPro: 3.933 ± 1.894
2.458GluGln: 2.458 ± 1.615
4.916GluArg: 4.916 ± 2.173
5.9GluSer: 5.9 ± 1.924
5.408GluThr: 5.408 ± 1.228
3.933GluVal: 3.933 ± 1.448
0.492GluTrp: 0.492 ± 0.569
1.967GluTyr: 1.967 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
1.475PheAla: 1.475 ± 1.081
0.0PheCys: 0.0 ± 0.0
1.967PheAsp: 1.967 ± 0.842
5.9PheGlu: 5.9 ± 2.128
1.475PhePhe: 1.475 ± 0.662
0.983PheGly: 0.983 ± 0.409
0.492PheHis: 0.492 ± 0.37
1.967PheIle: 1.967 ± 0.991
0.492PheLys: 0.492 ± 0.483
4.916PheLeu: 4.916 ± 1.121
1.967PheMet: 1.967 ± 0.712
2.458PheAsn: 2.458 ± 1.291
0.0PhePro: 0.0 ± 0.0
2.95PheGln: 2.95 ± 1.234
1.475PheArg: 1.475 ± 0.662
4.425PheSer: 4.425 ± 0.764
2.95PheThr: 2.95 ± 0.586
0.983PheVal: 0.983 ± 0.74
0.492PheTrp: 0.492 ± 0.413
1.475PheTyr: 1.475 ± 0.922
0.0PheXaa: 0.0 ± 0.0
Gly
1.967GlyAla: 1.967 ± 0.931
0.492GlyCys: 0.492 ± 0.37
2.458GlyAsp: 2.458 ± 0.765
7.375GlyGlu: 7.375 ± 1.663
1.967GlyPhe: 1.967 ± 0.947
2.95GlyGly: 2.95 ± 0.989
0.492GlyHis: 0.492 ± 0.569
3.441GlyIle: 3.441 ± 0.811
5.408GlyLys: 5.408 ± 1.267
4.425GlyLeu: 4.425 ± 1.01
1.475GlyMet: 1.475 ± 0.599
2.458GlyAsn: 2.458 ± 1.307
1.967GlyPro: 1.967 ± 0.685
2.458GlyGln: 2.458 ± 0.766
6.883GlyArg: 6.883 ± 1.662
4.916GlySer: 4.916 ± 1.437
4.916GlyThr: 4.916 ± 1.203
5.408GlyVal: 5.408 ± 0.457
0.983GlyTrp: 0.983 ± 0.426
1.967GlyTyr: 1.967 ± 1.053
0.0GlyXaa: 0.0 ± 0.0
His
0.492HisAla: 0.492 ± 0.37
0.492HisCys: 0.492 ± 0.37
0.983HisAsp: 0.983 ± 0.966
0.983HisGlu: 0.983 ± 0.74
0.492HisPhe: 0.492 ± 0.37
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.95HisIle: 2.95 ± 1.269
1.475HisLys: 1.475 ± 0.735
1.475HisLeu: 1.475 ± 0.634
0.492HisMet: 0.492 ± 0.413
0.492HisAsn: 0.492 ± 0.483
0.492HisPro: 0.492 ± 0.413
0.0HisGln: 0.0 ± 0.0
0.983HisArg: 0.983 ± 0.624
1.967HisSer: 1.967 ± 0.929
0.983HisThr: 0.983 ± 0.679
0.492HisVal: 0.492 ± 0.569
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.95IleAla: 2.95 ± 0.751
2.458IleCys: 2.458 ± 1.23
1.475IleAsp: 1.475 ± 0.291
9.833IleGlu: 9.833 ± 3.536
1.475IlePhe: 1.475 ± 0.59
6.883IleGly: 6.883 ± 1.588
0.983IleHis: 0.983 ± 0.426
2.95IleIle: 2.95 ± 0.468
2.95IleLys: 2.95 ± 1.7
4.916IleLeu: 4.916 ± 2.12
1.967IleMet: 1.967 ± 0.709
3.441IleAsn: 3.441 ± 0.999
1.475IlePro: 1.475 ± 1.109
2.458IleGln: 2.458 ± 0.766
6.391IleArg: 6.391 ± 2.544
3.441IleSer: 3.441 ± 1.56
2.458IleThr: 2.458 ± 1.055
4.916IleVal: 4.916 ± 0.947
1.475IleTrp: 1.475 ± 0.814
1.475IleTyr: 1.475 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
3.441LysAla: 3.441 ± 0.917
1.475LysCys: 1.475 ± 0.634
1.967LysAsp: 1.967 ± 0.407
4.916LysGlu: 4.916 ± 1.981
2.458LysPhe: 2.458 ± 1.027
3.933LysGly: 3.933 ± 0.814
1.475LysHis: 1.475 ± 0.832
3.441LysIle: 3.441 ± 1.421
0.983LysLys: 0.983 ± 0.409
2.458LysLeu: 2.458 ± 1.027
1.967LysMet: 1.967 ± 1.206
1.475LysAsn: 1.475 ± 1.081
1.967LysPro: 1.967 ± 0.851
1.967LysGln: 1.967 ± 1.629
1.967LysArg: 1.967 ± 1.389
2.95LysSer: 2.95 ± 0.991
3.933LysThr: 3.933 ± 2.427
2.458LysVal: 2.458 ± 0.734
2.95LysTrp: 2.95 ± 0.778
0.983LysTyr: 0.983 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
3.441LeuAla: 3.441 ± 1.644
1.475LeuCys: 1.475 ± 1.032
2.458LeuAsp: 2.458 ± 0.99
6.883LeuGlu: 6.883 ± 2.546
0.492LeuPhe: 0.492 ± 0.37
4.425LeuGly: 4.425 ± 1.628
0.983LeuHis: 0.983 ± 0.746
7.866LeuIle: 7.866 ± 1.689
6.391LeuLys: 6.391 ± 1.053
6.883LeuLeu: 6.883 ± 3.365
2.458LeuMet: 2.458 ± 0.628
2.95LeuAsn: 2.95 ± 0.718
4.425LeuPro: 4.425 ± 1.771
3.441LeuGln: 3.441 ± 1.265
6.391LeuArg: 6.391 ± 1.184
4.425LeuSer: 4.425 ± 1.136
4.916LeuThr: 4.916 ± 1.185
1.967LeuVal: 1.967 ± 1.179
0.983LeuTrp: 0.983 ± 0.574
3.933LeuTyr: 3.933 ± 2.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.933MetAla: 3.933 ± 1.345
1.967MetCys: 1.967 ± 1.179
4.916MetAsp: 4.916 ± 1.539
3.933MetGlu: 3.933 ± 1.2
0.492MetPhe: 0.492 ± 0.37
0.492MetGly: 0.492 ± 0.558
0.0MetHis: 0.0 ± 0.0
2.95MetIle: 2.95 ± 1.042
2.458MetLys: 2.458 ± 1.849
0.983MetLeu: 0.983 ± 0.574
1.475MetMet: 1.475 ± 1.24
0.492MetAsn: 0.492 ± 0.483
0.492MetPro: 0.492 ± 0.569
1.475MetGln: 1.475 ± 1.138
2.95MetArg: 2.95 ± 1.323
2.458MetSer: 2.458 ± 0.581
1.967MetThr: 1.967 ± 1.01
1.475MetVal: 1.475 ± 1.24
0.0MetTrp: 0.0 ± 0.0
0.492MetTyr: 0.492 ± 0.37
0.0MetXaa: 0.0 ± 0.0
Asn
6.391AsnAla: 6.391 ± 0.851
0.983AsnCys: 0.983 ± 0.966
3.441AsnAsp: 3.441 ± 0.43
2.95AsnGlu: 2.95 ± 0.989
2.458AsnPhe: 2.458 ± 0.949
4.425AsnGly: 4.425 ± 1.307
0.0AsnHis: 0.0 ± 0.0
3.441AsnIle: 3.441 ± 1.56
2.458AsnLys: 2.458 ± 1.341
2.95AsnLeu: 2.95 ± 0.582
1.967AsnMet: 1.967 ± 0.774
3.933AsnAsn: 3.933 ± 3.274
3.441AsnPro: 3.441 ± 0.811
0.983AsnGln: 0.983 ± 0.526
1.967AsnArg: 1.967 ± 1.292
4.916AsnSer: 4.916 ± 1.257
5.408AsnThr: 5.408 ± 1.371
0.0AsnVal: 0.0 ± 0.0
0.983AsnTrp: 0.983 ± 0.966
1.475AsnTyr: 1.475 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
2.458ProAla: 2.458 ± 1.103
0.492ProCys: 0.492 ± 0.37
1.475ProAsp: 1.475 ± 0.832
2.458ProGlu: 2.458 ± 0.721
1.967ProPhe: 1.967 ± 0.842
1.967ProGly: 1.967 ± 0.412
0.983ProHis: 0.983 ± 0.74
1.967ProIle: 1.967 ± 0.819
3.933ProLys: 3.933 ± 1.087
2.95ProLeu: 2.95 ± 1.054
1.475ProMet: 1.475 ± 1.109
5.408ProAsn: 5.408 ± 1.833
2.458ProPro: 2.458 ± 0.99
0.492ProGln: 0.492 ± 0.37
2.95ProArg: 2.95 ± 0.847
1.967ProSer: 1.967 ± 0.842
1.475ProThr: 1.475 ± 0.922
2.458ProVal: 2.458 ± 2.416
0.492ProTrp: 0.492 ± 0.37
0.983ProTyr: 0.983 ± 0.966
0.0ProXaa: 0.0 ± 0.0
Gln
2.458GlnAla: 2.458 ± 0.581
0.492GlnCys: 0.492 ± 0.37
0.983GlnAsp: 0.983 ± 0.624
1.967GlnGlu: 1.967 ± 0.712
0.492GlnPhe: 0.492 ± 0.558
2.95GlnGly: 2.95 ± 1.579
0.0GlnHis: 0.0 ± 0.0
3.441GlnIle: 3.441 ± 0.826
2.458GlnLys: 2.458 ± 1.754
4.425GlnLeu: 4.425 ± 2.204
1.967GlnMet: 1.967 ± 0.575
1.475GlnAsn: 1.475 ± 1.24
0.983GlnPro: 0.983 ± 0.526
0.983GlnGln: 0.983 ± 0.409
3.441GlnArg: 3.441 ± 1.475
2.95GlnSer: 2.95 ± 1.324
1.475GlnThr: 1.475 ± 0.922
2.458GlnVal: 2.458 ± 0.842
0.492GlnTrp: 0.492 ± 0.483
1.475GlnTyr: 1.475 ± 0.832
0.0GlnXaa: 0.0 ± 0.0
Arg
6.391ArgAla: 6.391 ± 1.307
0.492ArgCys: 0.492 ± 0.37
3.441ArgAsp: 3.441 ± 0.892
3.441ArgGlu: 3.441 ± 1.067
3.441ArgPhe: 3.441 ± 0.902
6.883ArgGly: 6.883 ± 2.155
0.492ArgHis: 0.492 ± 0.558
2.95ArgIle: 2.95 ± 0.462
1.967ArgLys: 1.967 ± 1.148
4.425ArgLeu: 4.425 ± 1.355
4.916ArgMet: 4.916 ± 2.499
3.933ArgAsn: 3.933 ± 1.211
4.425ArgPro: 4.425 ± 1.236
2.95ArgGln: 2.95 ± 1.323
6.391ArgArg: 6.391 ± 1.843
5.9ArgSer: 5.9 ± 1.215
7.375ArgThr: 7.375 ± 1.389
2.95ArgVal: 2.95 ± 1.426
0.492ArgTrp: 0.492 ± 0.558
0.983ArgTyr: 0.983 ± 0.695
0.0ArgXaa: 0.0 ± 0.0
Ser
4.425SerAla: 4.425 ± 1.826
2.95SerCys: 2.95 ± 1.101
3.933SerAsp: 3.933 ± 0.77
3.933SerGlu: 3.933 ± 1.319
5.408SerPhe: 5.408 ± 1.498
4.425SerGly: 4.425 ± 1.804
1.475SerHis: 1.475 ± 1.109
2.95SerIle: 2.95 ± 0.725
2.95SerLys: 2.95 ± 1.324
7.375SerLeu: 7.375 ± 2.295
1.967SerMet: 1.967 ± 0.947
4.425SerAsn: 4.425 ± 1.947
2.458SerPro: 2.458 ± 0.771
6.391SerGln: 6.391 ± 2.059
5.408SerArg: 5.408 ± 0.622
5.9SerSer: 5.9 ± 1.495
3.441SerThr: 3.441 ± 1.579
4.425SerVal: 4.425 ± 1.643
2.458SerTrp: 2.458 ± 1.015
1.475SerTyr: 1.475 ± 0.814
0.0SerXaa: 0.0 ± 0.0
Thr
4.916ThrAla: 4.916 ± 1.022
0.492ThrCys: 0.492 ± 0.483
4.425ThrAsp: 4.425 ± 1.561
4.425ThrGlu: 4.425 ± 1.341
2.458ThrPhe: 2.458 ± 1.151
3.933ThrGly: 3.933 ± 0.737
2.458ThrHis: 2.458 ± 0.721
6.391ThrIle: 6.391 ± 2.657
2.95ThrLys: 2.95 ± 1.072
3.933ThrLeu: 3.933 ± 2.169
1.967ThrMet: 1.967 ± 0.819
3.441ThrAsn: 3.441 ± 0.603
0.492ThrPro: 0.492 ± 0.37
1.967ThrGln: 1.967 ± 1.564
4.916ThrArg: 4.916 ± 1.461
3.933ThrSer: 3.933 ± 1.369
1.475ThrThr: 1.475 ± 0.634
2.458ThrVal: 2.458 ± 1.103
1.967ThrTrp: 1.967 ± 0.851
0.492ThrTyr: 0.492 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
2.458ValAla: 2.458 ± 0.635
1.967ValCys: 1.967 ± 0.851
1.475ValAsp: 1.475 ± 0.78
3.933ValGlu: 3.933 ± 1.132
1.967ValPhe: 1.967 ± 0.575
2.95ValGly: 2.95 ± 1.625
0.492ValHis: 0.492 ± 0.37
2.458ValIle: 2.458 ± 1.255
2.458ValLys: 2.458 ± 0.346
4.916ValLeu: 4.916 ± 1.668
1.475ValMet: 1.475 ± 0.801
1.475ValAsn: 1.475 ± 1.109
1.967ValPro: 1.967 ± 0.931
2.458ValGln: 2.458 ± 0.958
4.425ValArg: 4.425 ± 1.086
5.408ValSer: 5.408 ± 0.622
1.967ValThr: 1.967 ± 0.984
1.967ValVal: 1.967 ± 0.407
0.983ValTrp: 0.983 ± 0.526
1.967ValTyr: 1.967 ± 0.685
0.0ValXaa: 0.0 ± 0.0
Trp
1.967TrpAla: 1.967 ± 0.685
0.0TrpCys: 0.0 ± 0.0
0.492TrpAsp: 0.492 ± 0.37
0.983TrpGlu: 0.983 ± 0.426
0.983TrpPhe: 0.983 ± 0.574
0.492TrpGly: 0.492 ± 0.37
1.475TrpHis: 1.475 ± 0.702
0.983TrpIle: 0.983 ± 0.74
0.983TrpLys: 0.983 ± 0.74
0.983TrpLeu: 0.983 ± 0.746
0.492TrpMet: 0.492 ± 0.413
0.983TrpAsn: 0.983 ± 0.679
1.475TrpPro: 1.475 ± 0.832
0.492TrpGln: 0.492 ± 0.483
1.967TrpArg: 1.967 ± 1.067
0.983TrpSer: 0.983 ± 0.966
0.983TrpThr: 0.983 ± 0.426
0.983TrpVal: 0.983 ± 0.409
0.492TrpTrp: 0.492 ± 0.483
0.492TrpTyr: 0.492 ± 0.483
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.983TyrAla: 0.983 ± 0.426
0.492TyrCys: 0.492 ± 0.37
2.95TyrAsp: 2.95 ± 1.228
2.458TyrGlu: 2.458 ± 0.958
1.967TyrPhe: 1.967 ± 0.685
2.458TyrGly: 2.458 ± 0.6
0.0TyrHis: 0.0 ± 0.0
1.967TyrIle: 1.967 ± 0.407
0.492TyrLys: 0.492 ± 0.37
2.458TyrLeu: 2.458 ± 0.346
0.0TyrMet: 0.0 ± 0.0
1.475TyrAsn: 1.475 ± 1.449
0.492TyrPro: 0.492 ± 0.483
0.492TyrGln: 0.492 ± 0.413
1.967TyrArg: 1.967 ± 1.06
1.475TyrSer: 1.475 ± 0.291
0.492TyrThr: 0.492 ± 0.37
1.967TyrVal: 1.967 ± 0.851
0.983TyrTrp: 0.983 ± 0.526
1.475TyrTyr: 1.475 ± 0.832
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski