Amino acid dipepetide frequency for Influenza A virus (A/northern shoveler/California/JN1447/2007(H7N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.355AlaAla: 4.355 ± 1.198
1.34AlaCys: 1.34 ± 0.65
3.015AlaAsp: 3.015 ± 0.605
2.68AlaGlu: 2.68 ± 1.037
2.68AlaPhe: 2.68 ± 0.855
5.025AlaGly: 5.025 ± 1.171
0.67AlaHis: 0.67 ± 0.551
4.02AlaIle: 4.02 ± 1.245
2.01AlaLys: 2.01 ± 0.898
4.69AlaLeu: 4.69 ± 1.308
3.015AlaMet: 3.015 ± 1.172
2.68AlaAsn: 2.68 ± 0.521
1.675AlaPro: 1.675 ± 0.537
1.675AlaGln: 1.675 ± 0.345
2.68AlaArg: 2.68 ± 0.636
5.36AlaSer: 5.36 ± 1.543
7.035AlaThr: 7.035 ± 0.647
4.02AlaVal: 4.02 ± 0.886
1.005AlaTrp: 1.005 ± 0.67
1.005AlaTyr: 1.005 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.67CysAla: 0.67 ± 0.466
0.0CysCys: 0.0 ± 0.0
0.67CysAsp: 0.67 ± 0.546
1.005CysGlu: 1.005 ± 0.492
2.01CysPhe: 2.01 ± 0.903
1.005CysGly: 1.005 ± 0.626
1.005CysHis: 1.005 ± 0.37
2.345CysIle: 2.345 ± 1.057
0.67CysLys: 0.67 ± 0.515
0.67CysLeu: 0.67 ± 0.325
1.34CysMet: 1.34 ± 0.613
0.335CysAsn: 0.335 ± 0.276
0.67CysPro: 0.67 ± 0.361
0.335CysGln: 0.335 ± 0.321
1.675CysArg: 1.675 ± 1.038
1.34CysSer: 1.34 ± 0.825
1.005CysThr: 1.005 ± 0.534
1.34CysVal: 1.34 ± 1.008
0.335CysTrp: 0.335 ± 0.257
1.005CysTyr: 1.005 ± 0.626
0.0CysXaa: 0.0 ± 0.0
Asp
2.345AspAla: 2.345 ± 0.697
1.34AspCys: 1.34 ± 0.623
2.01AspAsp: 2.01 ± 0.741
2.345AspGlu: 2.345 ± 0.966
0.67AspPhe: 0.67 ± 0.325
3.015AspGly: 3.015 ± 1.266
0.67AspHis: 0.67 ± 0.313
3.015AspIle: 3.015 ± 0.75
2.345AspLys: 2.345 ± 0.764
4.02AspLeu: 4.02 ± 1.084
1.34AspMet: 1.34 ± 0.546
3.35AspAsn: 3.35 ± 1.112
3.685AspPro: 3.685 ± 0.604
3.015AspGln: 3.015 ± 1.193
3.35AspArg: 3.35 ± 0.661
5.025AspSer: 5.025 ± 1.073
1.34AspThr: 1.34 ± 0.546
3.35AspVal: 3.35 ± 0.633
0.335AspTrp: 0.335 ± 0.34
1.005AspTyr: 1.005 ± 0.408
0.0AspXaa: 0.0 ± 0.0
Glu
2.345GluAla: 2.345 ± 0.993
1.34GluCys: 1.34 ± 1.063
4.02GluAsp: 4.02 ± 1.031
6.365GluGlu: 6.365 ± 1.625
1.34GluPhe: 1.34 ± 0.626
4.69GluGly: 4.69 ± 0.903
0.335GluHis: 0.335 ± 0.276
5.025GluIle: 5.025 ± 1.108
2.345GluLys: 2.345 ± 1.128
5.025GluLeu: 5.025 ± 0.885
2.345GluMet: 2.345 ± 0.909
3.685GluAsn: 3.685 ± 1.546
1.34GluPro: 1.34 ± 0.664
5.025GluGln: 5.025 ± 1.724
6.03GluArg: 6.03 ± 1.023
4.69GluSer: 4.69 ± 1.187
3.685GluThr: 3.685 ± 1.097
5.025GluVal: 5.025 ± 1.938
0.67GluTrp: 0.67 ± 0.497
1.005GluTyr: 1.005 ± 0.501
0.0GluXaa: 0.0 ± 0.0
Phe
2.01PheAla: 2.01 ± 0.772
0.335PheCys: 0.335 ± 0.321
2.01PheAsp: 2.01 ± 0.448
3.685PheGlu: 3.685 ± 1.535
1.005PhePhe: 1.005 ± 0.586
2.01PheGly: 2.01 ± 0.545
1.675PheHis: 1.675 ± 0.86
2.01PheIle: 2.01 ± 0.868
1.34PheLys: 1.34 ± 0.657
3.015PheLeu: 3.015 ± 0.8
1.005PheMet: 1.005 ± 0.556
0.67PheAsn: 0.67 ± 0.313
0.67PhePro: 0.67 ± 0.313
3.35PheGln: 3.35 ± 1.176
1.675PheArg: 1.675 ± 0.408
3.685PheSer: 3.685 ± 0.578
3.015PheThr: 3.015 ± 0.973
1.675PheVal: 1.675 ± 1.004
0.335PheTrp: 0.335 ± 0.276
1.005PheTyr: 1.005 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
4.69GlyAla: 4.69 ± 1.357
0.67GlyCys: 0.67 ± 0.546
3.685GlyAsp: 3.685 ± 0.563
3.015GlyGlu: 3.015 ± 0.758
2.68GlyPhe: 2.68 ± 0.748
4.355GlyGly: 4.355 ± 0.829
1.34GlyHis: 1.34 ± 0.631
4.355GlyIle: 4.355 ± 1.147
5.36GlyLys: 5.36 ± 0.943
5.025GlyLeu: 5.025 ± 1.812
2.345GlyMet: 2.345 ± 0.413
3.685GlyAsn: 3.685 ± 0.609
3.015GlyPro: 3.015 ± 0.705
2.68GlyGln: 2.68 ± 0.527
5.025GlyArg: 5.025 ± 1.455
7.37GlySer: 7.37 ± 1.573
8.04GlyThr: 8.04 ± 1.456
4.355GlyVal: 4.355 ± 1.047
1.675GlyTrp: 1.675 ± 0.88
2.345GlyTyr: 2.345 ± 0.918
0.0GlyXaa: 0.0 ± 0.0
His
0.67HisAla: 0.67 ± 0.313
0.0HisCys: 0.0 ± 0.0
0.67HisAsp: 0.67 ± 0.641
0.67HisGlu: 0.67 ± 0.412
1.34HisPhe: 1.34 ± 0.612
0.67HisGly: 0.67 ± 0.412
0.67HisHis: 0.67 ± 0.546
0.67HisIle: 0.67 ± 0.641
1.005HisLys: 1.005 ± 0.537
1.34HisLeu: 1.34 ± 0.528
0.335HisMet: 0.335 ± 0.257
0.0HisAsn: 0.0 ± 0.0
0.67HisPro: 0.67 ± 0.501
0.67HisGln: 0.67 ± 0.313
1.34HisArg: 1.34 ± 0.875
2.345HisSer: 2.345 ± 0.6
1.005HisThr: 1.005 ± 0.631
0.67HisVal: 0.67 ± 0.424
0.335HisTrp: 0.335 ± 0.273
0.335HisTyr: 0.335 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.36IleAla: 5.36 ± 0.956
2.01IleCys: 2.01 ± 0.972
4.355IleAsp: 4.355 ± 1.843
5.025IleGlu: 5.025 ± 0.913
1.005IlePhe: 1.005 ± 0.409
4.69IleGly: 4.69 ± 0.738
0.335IleHis: 0.335 ± 0.321
4.355IleIle: 4.355 ± 0.941
2.68IleLys: 2.68 ± 0.75
8.04IleLeu: 8.04 ± 1.503
2.345IleMet: 2.345 ± 0.497
3.685IleAsn: 3.685 ± 1.065
2.01IlePro: 2.01 ± 0.673
1.675IleGln: 1.675 ± 0.5
7.035IleArg: 7.035 ± 1.568
1.675IleSer: 1.675 ± 0.63
4.69IleThr: 4.69 ± 1.117
4.69IleVal: 4.69 ± 1.204
1.675IleTrp: 1.675 ± 0.665
1.34IleTyr: 1.34 ± 0.772
0.0IleXaa: 0.0 ± 0.0
Lys
3.685LysAla: 3.685 ± 1.284
1.34LysCys: 1.34 ± 0.761
2.345LysAsp: 2.345 ± 0.425
3.685LysGlu: 3.685 ± 0.763
1.005LysPhe: 1.005 ± 0.511
2.345LysGly: 2.345 ± 0.639
0.67LysHis: 0.67 ± 0.332
3.015LysIle: 3.015 ± 0.882
1.34LysLys: 1.34 ± 0.419
3.685LysLeu: 3.685 ± 1.309
2.01LysMet: 2.01 ± 0.905
1.675LysAsn: 1.675 ± 0.766
0.67LysPro: 0.67 ± 0.361
2.345LysGln: 2.345 ± 1.133
5.36LysArg: 5.36 ± 1.732
2.345LysSer: 2.345 ± 0.773
2.68LysThr: 2.68 ± 0.818
1.675LysVal: 1.675 ± 0.831
1.34LysTrp: 1.34 ± 0.469
1.675LysTyr: 1.675 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
3.685LeuAla: 3.685 ± 0.725
0.335LeuCys: 0.335 ± 0.34
1.675LeuAsp: 1.675 ± 0.97
4.69LeuGlu: 4.69 ± 1.612
1.675LeuPhe: 1.675 ± 0.574
5.025LeuGly: 5.025 ± 1.018
1.675LeuHis: 1.675 ± 0.829
9.38LeuIle: 9.38 ± 1.687
4.69LeuLys: 4.69 ± 2.072
7.035LeuLeu: 7.035 ± 2.067
3.015LeuMet: 3.015 ± 0.703
2.68LeuAsn: 2.68 ± 0.491
3.35LeuPro: 3.35 ± 1.151
3.015LeuGln: 3.015 ± 1.006
6.365LeuArg: 6.365 ± 1.985
5.36LeuSer: 5.36 ± 1.07
6.7LeuThr: 6.7 ± 1.892
4.02LeuVal: 4.02 ± 0.575
1.34LeuTrp: 1.34 ± 0.631
2.01LeuTyr: 2.01 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
2.68MetAla: 2.68 ± 0.843
1.34MetCys: 1.34 ± 0.766
2.68MetAsp: 2.68 ± 1.218
4.355MetGlu: 4.355 ± 1.31
0.0MetPhe: 0.0 ± 0.0
3.35MetGly: 3.35 ± 1.064
0.335MetHis: 0.335 ± 0.257
2.01MetIle: 2.01 ± 0.875
1.34MetLys: 1.34 ± 0.736
1.675MetLeu: 1.675 ± 0.537
1.34MetMet: 1.34 ± 0.592
1.005MetAsn: 1.005 ± 0.555
0.67MetPro: 0.67 ± 0.424
2.345MetGln: 2.345 ± 1.061
3.35MetArg: 3.35 ± 1.04
2.68MetSer: 2.68 ± 0.69
2.01MetThr: 2.01 ± 0.781
3.685MetVal: 3.685 ± 1.428
0.335MetTrp: 0.335 ± 0.257
0.335MetTyr: 0.335 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.355AsnAla: 4.355 ± 1.477
0.335AsnCys: 0.335 ± 0.321
2.68AsnAsp: 2.68 ± 0.564
3.685AsnGlu: 3.685 ± 0.623
1.34AsnPhe: 1.34 ± 0.426
5.695AsnGly: 5.695 ± 1.318
0.0AsnHis: 0.0 ± 0.0
3.015AsnIle: 3.015 ± 0.856
2.345AsnLys: 2.345 ± 0.601
1.675AsnLeu: 1.675 ± 0.621
1.675AsnMet: 1.675 ± 0.498
3.015AsnAsn: 3.015 ± 1.836
4.02AsnPro: 4.02 ± 0.642
2.345AsnGln: 2.345 ± 0.518
3.015AsnArg: 3.015 ± 0.853
2.68AsnSer: 2.68 ± 0.976
3.685AsnThr: 3.685 ± 0.74
2.01AsnVal: 2.01 ± 1.007
2.01AsnTrp: 2.01 ± 0.828
0.335AsnTyr: 0.335 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
2.01ProAla: 2.01 ± 0.541
0.335ProCys: 0.335 ± 0.321
1.675ProAsp: 1.675 ± 0.58
2.345ProGlu: 2.345 ± 0.877
2.68ProPhe: 2.68 ± 0.564
3.35ProGly: 3.35 ± 1.008
0.335ProHis: 0.335 ± 0.321
2.68ProIle: 2.68 ± 0.758
2.01ProLys: 2.01 ± 0.764
3.35ProLeu: 3.35 ± 1.167
0.335ProMet: 0.335 ± 0.257
2.01ProAsn: 2.01 ± 0.517
1.34ProPro: 1.34 ± 0.756
1.675ProGln: 1.675 ± 0.895
2.68ProArg: 2.68 ± 1.055
3.35ProSer: 3.35 ± 1.265
2.01ProThr: 2.01 ± 0.681
1.675ProVal: 1.675 ± 0.649
0.0ProTrp: 0.0 ± 0.0
0.335ProTyr: 0.335 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
2.68GlnAla: 2.68 ± 1.588
1.34GlnCys: 1.34 ± 0.796
1.34GlnAsp: 1.34 ± 0.862
2.345GlnGlu: 2.345 ± 0.833
1.675GlnPhe: 1.675 ± 0.537
3.35GlnGly: 3.35 ± 0.752
1.005GlnHis: 1.005 ± 0.492
5.025GlnIle: 5.025 ± 0.952
2.345GlnLys: 2.345 ± 1.009
3.35GlnLeu: 3.35 ± 1.776
2.68GlnMet: 2.68 ± 1.179
4.02GlnAsn: 4.02 ± 0.758
0.335GlnPro: 0.335 ± 0.276
2.01GlnGln: 2.01 ± 0.836
3.35GlnArg: 3.35 ± 1.426
3.35GlnSer: 3.35 ± 1.393
2.01GlnThr: 2.01 ± 0.678
3.015GlnVal: 3.015 ± 1.158
0.67GlnTrp: 0.67 ± 0.515
0.67GlnTyr: 0.67 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
4.355ArgAla: 4.355 ± 1.057
1.34ArgCys: 1.34 ± 0.794
4.02ArgAsp: 4.02 ± 1.166
3.685ArgGlu: 3.685 ± 0.767
2.345ArgPhe: 2.345 ± 0.951
8.04ArgGly: 8.04 ± 1.073
1.34ArgHis: 1.34 ± 0.534
4.69ArgIle: 4.69 ± 1.108
2.68ArgLys: 2.68 ± 1.01
4.69ArgLeu: 4.69 ± 1.06
5.36ArgMet: 5.36 ± 1.902
6.365ArgAsn: 6.365 ± 1.122
2.68ArgPro: 2.68 ± 0.779
3.015ArgGln: 3.015 ± 0.854
5.36ArgArg: 5.36 ± 1.423
3.35ArgSer: 3.35 ± 1.358
6.03ArgThr: 6.03 ± 1.185
4.02ArgVal: 4.02 ± 1.189
0.335ArgTrp: 0.335 ± 0.422
1.675ArgTyr: 1.675 ± 0.337
0.0ArgXaa: 0.0 ± 0.0
Ser
5.025SerAla: 5.025 ± 1.06
2.345SerCys: 2.345 ± 1.232
3.015SerAsp: 3.015 ± 1.08
3.685SerGlu: 3.685 ± 0.778
4.69SerPhe: 4.69 ± 1.321
8.71SerGly: 8.71 ± 1.961
0.335SerHis: 0.335 ± 0.417
4.02SerIle: 4.02 ± 0.984
2.68SerLys: 2.68 ± 1.13
7.37SerLeu: 7.37 ± 1.399
1.34SerMet: 1.34 ± 0.715
4.355SerAsn: 4.355 ± 1.565
2.68SerPro: 2.68 ± 0.613
4.355SerGln: 4.355 ± 1.104
2.68SerArg: 2.68 ± 0.853
10.72SerSer: 10.72 ± 1.576
4.02SerThr: 4.02 ± 0.992
3.685SerVal: 3.685 ± 1.131
0.67SerTrp: 0.67 ± 0.641
2.01SerTyr: 2.01 ± 0.744
0.0SerXaa: 0.0 ± 0.0
Thr
4.02ThrAla: 4.02 ± 0.435
1.34ThrCys: 1.34 ± 0.587
2.68ThrAsp: 2.68 ± 0.909
7.035ThrGlu: 7.035 ± 1.182
3.35ThrPhe: 3.35 ± 0.776
4.02ThrGly: 4.02 ± 1.174
0.67ThrHis: 0.67 ± 0.501
5.36ThrIle: 5.36 ± 1.455
3.35ThrLys: 3.35 ± 0.704
5.695ThrLeu: 5.695 ± 1.428
1.34ThrMet: 1.34 ± 0.592
2.345ThrAsn: 2.345 ± 0.795
1.675ThrPro: 1.675 ± 0.574
3.015ThrGln: 3.015 ± 0.982
5.36ThrArg: 5.36 ± 0.942
3.35ThrSer: 3.35 ± 0.941
3.35ThrThr: 3.35 ± 1.645
5.36ThrVal: 5.36 ± 1.385
0.335ThrTrp: 0.335 ± 0.257
2.68ThrTyr: 2.68 ± 0.442
0.0ThrXaa: 0.0 ± 0.0
Val
4.69ValAla: 4.69 ± 1.865
2.345ValCys: 2.345 ± 1.511
3.35ValAsp: 3.35 ± 1.424
2.345ValGlu: 2.345 ± 0.663
3.35ValPhe: 3.35 ± 0.67
3.685ValGly: 3.685 ± 0.754
1.675ValHis: 1.675 ± 0.649
2.345ValIle: 2.345 ± 0.91
3.015ValLys: 3.015 ± 0.457
4.69ValLeu: 4.69 ± 2.358
2.345ValMet: 2.345 ± 0.789
2.68ValAsn: 2.68 ± 0.877
3.015ValPro: 3.015 ± 1.122
2.345ValGln: 2.345 ± 1.02
5.025ValArg: 5.025 ± 1.825
4.69ValSer: 4.69 ± 1.299
2.345ValThr: 2.345 ± 0.948
3.015ValVal: 3.015 ± 0.933
1.005ValTrp: 1.005 ± 0.705
1.34ValTyr: 1.34 ± 0.471
0.0ValXaa: 0.0 ± 0.0
Trp
0.67TrpAla: 0.67 ± 0.429
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.675TrpGlu: 1.675 ± 0.799
0.67TrpPhe: 0.67 ± 0.427
0.67TrpGly: 0.67 ± 0.313
0.67TrpHis: 0.67 ± 0.381
0.67TrpIle: 0.67 ± 0.521
0.335TrpLys: 0.335 ± 0.321
1.675TrpLeu: 1.675 ± 0.67
1.34TrpMet: 1.34 ± 0.562
0.67TrpAsn: 0.67 ± 0.442
0.335TrpPro: 0.335 ± 0.321
0.0TrpGln: 0.0 ± 0.0
1.34TrpArg: 1.34 ± 0.722
2.345TrpSer: 2.345 ± 1.114
1.005TrpThr: 1.005 ± 0.626
0.335TrpVal: 0.335 ± 0.321
0.67TrpTrp: 0.67 ± 0.332
0.335TrpTyr: 0.335 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.335TyrAla: 0.335 ± 0.273
0.0TyrCys: 0.0 ± 0.0
1.675TyrAsp: 1.675 ± 0.834
2.01TyrGlu: 2.01 ± 0.704
0.67TyrPhe: 0.67 ± 0.296
2.01TyrGly: 2.01 ± 0.579
0.0TyrHis: 0.0 ± 0.0
0.67TyrIle: 0.67 ± 0.296
1.005TyrLys: 1.005 ± 0.528
1.005TyrLeu: 1.005 ± 0.463
0.335TyrMet: 0.335 ± 0.257
1.34TyrAsn: 1.34 ± 0.532
1.675TyrPro: 1.675 ± 0.58
1.675TyrGln: 1.675 ± 0.345
2.68TyrArg: 2.68 ± 1.307
2.68TyrSer: 2.68 ± 0.476
0.67TyrThr: 0.67 ± 0.515
1.675TyrVal: 1.675 ± 0.923
0.335TyrTrp: 0.335 ± 0.276
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski