Amino acid dipepetide frequency for Influenza A virus (A/northern pintail/Interior Alaska/9BM4994R0/2009(H10N7))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.869AlaAla: 2.869 ± 1.082
1.043AlaCys: 1.043 ± 0.569
1.565AlaAsp: 1.565 ± 0.377
2.608AlaGlu: 2.608 ± 0.991
1.826AlaPhe: 1.826 ± 0.773
4.173AlaGly: 4.173 ± 0.944
0.782AlaHis: 0.782 ± 0.468
2.608AlaIle: 2.608 ± 0.843
2.087AlaLys: 2.087 ± 0.642
5.216AlaLeu: 5.216 ± 1.188
2.347AlaMet: 2.347 ± 0.633
1.304AlaAsn: 1.304 ± 0.346
1.043AlaPro: 1.043 ± 0.34
1.826AlaGln: 1.826 ± 0.503
2.347AlaArg: 2.347 ± 0.464
3.391AlaSer: 3.391 ± 1.442
3.912AlaThr: 3.912 ± 0.855
3.13AlaVal: 3.13 ± 0.587
0.261AlaTrp: 0.261 ± 0.304
1.043AlaTyr: 1.043 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.261CysAla: 0.261 ± 0.237
0.261CysCys: 0.261 ± 0.21
0.782CysAsp: 0.782 ± 0.504
0.261CysGlu: 0.261 ± 0.237
1.043CysPhe: 1.043 ± 0.552
0.261CysGly: 0.261 ± 0.218
1.304CysHis: 1.304 ± 0.402
0.782CysIle: 0.782 ± 0.481
1.304CysLys: 1.304 ± 0.557
1.304CysLeu: 1.304 ± 0.505
1.043CysMet: 1.043 ± 0.758
1.043CysAsn: 1.043 ± 0.401
0.261CysPro: 0.261 ± 0.294
0.0CysGln: 0.0 ± 0.0
0.261CysArg: 0.261 ± 0.218
1.043CysSer: 1.043 ± 0.439
1.043CysThr: 1.043 ± 0.475
1.826CysVal: 1.826 ± 1.017
0.0CysTrp: 0.0 ± 0.0
0.782CysTyr: 0.782 ± 0.481
0.0CysXaa: 0.0 ± 0.0
Asp
1.826AspAla: 1.826 ± 0.568
1.043AspCys: 1.043 ± 0.34
0.522AspAsp: 0.522 ± 0.413
2.347AspGlu: 2.347 ± 0.668
0.782AspPhe: 0.782 ± 0.28
2.347AspGly: 2.347 ± 0.767
0.522AspHis: 0.522 ± 0.286
1.043AspIle: 1.043 ± 0.444
2.347AspLys: 2.347 ± 0.53
3.13AspLeu: 3.13 ± 0.96
1.826AspMet: 1.826 ± 0.601
2.608AspAsn: 2.608 ± 0.906
2.608AspPro: 2.608 ± 0.687
1.565AspGln: 1.565 ± 0.866
2.869AspArg: 2.869 ± 0.862
3.13AspSer: 3.13 ± 0.929
1.565AspThr: 1.565 ± 0.385
2.347AspVal: 2.347 ± 0.456
0.522AspTrp: 0.522 ± 0.343
1.043AspTyr: 1.043 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
2.869GluAla: 2.869 ± 0.64
1.043GluCys: 1.043 ± 0.822
3.652GluAsp: 3.652 ± 0.956
5.738GluGlu: 5.738 ± 1.242
1.304GluPhe: 1.304 ± 0.518
2.347GluGly: 2.347 ± 1.033
1.304GluHis: 1.304 ± 0.552
4.434GluIle: 4.434 ± 0.964
3.13GluLys: 3.13 ± 1.101
4.434GluLeu: 4.434 ± 0.825
1.304GluMet: 1.304 ± 0.514
2.347GluAsn: 2.347 ± 1.087
0.782GluPro: 0.782 ± 0.263
3.912GluGln: 3.912 ± 1.554
3.912GluArg: 3.912 ± 1.25
5.216GluSer: 5.216 ± 0.793
3.652GluThr: 3.652 ± 0.859
3.652GluVal: 3.652 ± 1.173
0.782GluTrp: 0.782 ± 0.524
1.043GluTyr: 1.043 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
1.565PheAla: 1.565 ± 0.587
0.261PheCys: 0.261 ± 0.235
1.043PheAsp: 1.043 ± 0.531
4.434PheGlu: 4.434 ± 1.223
1.304PhePhe: 1.304 ± 0.447
1.826PheGly: 1.826 ± 0.326
0.522PheHis: 0.522 ± 0.301
0.782PheIle: 0.782 ± 0.28
0.522PheLys: 0.522 ± 0.413
2.347PheLeu: 2.347 ± 0.647
0.522PheMet: 0.522 ± 0.486
0.261PheAsn: 0.261 ± 0.211
1.304PhePro: 1.304 ± 0.361
1.826PheGln: 1.826 ± 0.91
0.782PheArg: 0.782 ± 0.292
2.347PheSer: 2.347 ± 0.476
1.304PheThr: 1.304 ± 0.438
2.087PheVal: 2.087 ± 0.648
0.261PheTrp: 0.261 ± 0.23
1.304PheTyr: 1.304 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
3.652GlyAla: 3.652 ± 0.714
0.522GlyCys: 0.522 ± 0.309
2.087GlyAsp: 2.087 ± 0.526
2.087GlyGlu: 2.087 ± 0.866
2.087GlyPhe: 2.087 ± 0.577
2.087GlyGly: 2.087 ± 0.914
1.826GlyHis: 1.826 ± 0.872
3.912GlyIle: 3.912 ± 0.768
2.869GlyLys: 2.869 ± 0.663
3.912GlyLeu: 3.912 ± 1.341
1.043GlyMet: 1.043 ± 0.411
3.391GlyAsn: 3.391 ± 0.949
2.869GlyPro: 2.869 ± 0.894
1.826GlyGln: 1.826 ± 0.632
3.912GlyArg: 3.912 ± 1.414
5.216GlySer: 5.216 ± 1.958
5.216GlyThr: 5.216 ± 0.902
3.652GlyVal: 3.652 ± 0.748
0.782GlyTrp: 0.782 ± 0.626
1.565GlyTyr: 1.565 ± 0.457
0.261GlyXaa: 0.261 ± 0.235
His
0.522HisAla: 0.522 ± 0.286
0.261HisCys: 0.261 ± 0.294
0.522HisAsp: 0.522 ± 0.437
0.261HisGlu: 0.261 ± 0.237
0.522HisPhe: 0.522 ± 0.362
1.304HisGly: 1.304 ± 0.419
0.522HisHis: 0.522 ± 0.47
0.782HisIle: 0.782 ± 0.382
1.565HisLys: 1.565 ± 0.494
1.565HisLeu: 1.565 ± 0.701
0.261HisMet: 0.261 ± 0.211
0.522HisAsn: 0.522 ± 0.437
1.304HisPro: 1.304 ± 0.503
0.782HisGln: 0.782 ± 0.286
1.043HisArg: 1.043 ± 0.65
1.304HisSer: 1.304 ± 0.574
1.043HisThr: 1.043 ± 0.552
0.261HisVal: 0.261 ± 0.304
0.0HisTrp: 0.0 ± 0.0
0.261HisTyr: 0.261 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
2.087IleAla: 2.087 ± 0.831
1.043IleCys: 1.043 ± 0.463
2.608IleAsp: 2.608 ± 0.614
4.173IleGlu: 4.173 ± 0.925
0.782IlePhe: 0.782 ± 0.404
3.13IleGly: 3.13 ± 0.826
0.522IleHis: 0.522 ± 0.309
2.087IleIle: 2.087 ± 0.667
2.347IleLys: 2.347 ± 0.65
5.477IleLeu: 5.477 ± 1.127
2.608IleMet: 2.608 ± 0.503
3.652IleAsn: 3.652 ± 0.759
1.043IlePro: 1.043 ± 0.391
1.565IleGln: 1.565 ± 0.478
4.956IleArg: 4.956 ± 1.543
2.608IleSer: 2.608 ± 0.838
3.391IleThr: 3.391 ± 1.163
2.087IleVal: 2.087 ± 0.855
1.043IleTrp: 1.043 ± 0.569
1.043IleTyr: 1.043 ± 0.49
0.0IleXaa: 0.0 ± 0.0
Lys
2.347LysAla: 2.347 ± 0.539
1.043LysCys: 1.043 ± 0.665
2.087LysAsp: 2.087 ± 0.591
3.13LysGlu: 3.13 ± 0.771
0.782LysPhe: 0.782 ± 0.449
2.869LysGly: 2.869 ± 0.762
0.522LysHis: 0.522 ± 0.272
2.608LysIle: 2.608 ± 0.867
1.826LysLys: 1.826 ± 0.823
3.652LysLeu: 3.652 ± 0.684
1.826LysMet: 1.826 ± 0.699
1.826LysAsn: 1.826 ± 0.615
0.261LysPro: 0.261 ± 0.218
2.087LysGln: 2.087 ± 1.365
4.173LysArg: 4.173 ± 1.193
2.869LysSer: 2.869 ± 0.732
2.608LysThr: 2.608 ± 0.73
1.826LysVal: 1.826 ± 0.665
1.565LysTrp: 1.565 ± 0.429
1.304LysTyr: 1.304 ± 0.517
0.261LysXaa: 0.261 ± 0.211
Leu
2.087LeuAla: 2.087 ± 0.7
1.826LeuCys: 1.826 ± 0.88
1.565LeuAsp: 1.565 ± 0.834
4.173LeuGlu: 4.173 ± 1.696
1.565LeuPhe: 1.565 ± 0.611
3.912LeuGly: 3.912 ± 0.982
1.565LeuHis: 1.565 ± 0.662
5.738LeuIle: 5.738 ± 1.687
6.26LeuLys: 6.26 ± 1.955
4.173LeuLeu: 4.173 ± 0.921
2.347LeuMet: 2.347 ± 0.733
3.652LeuAsn: 3.652 ± 0.944
2.347LeuPro: 2.347 ± 1.087
2.087LeuGln: 2.087 ± 0.95
3.912LeuArg: 3.912 ± 0.911
4.695LeuSer: 4.695 ± 1.212
4.956LeuThr: 4.956 ± 1.286
4.434LeuVal: 4.434 ± 0.728
1.304LeuTrp: 1.304 ± 0.386
2.087LeuTyr: 2.087 ± 0.456
0.261LeuXaa: 0.261 ± 0.21
Met
2.869MetAla: 2.869 ± 0.622
0.782MetCys: 0.782 ± 0.691
2.869MetAsp: 2.869 ± 1.28
4.956MetGlu: 4.956 ± 0.806
0.0MetPhe: 0.0 ± 0.0
2.087MetGly: 2.087 ± 0.897
0.261MetHis: 0.261 ± 0.294
1.565MetIle: 1.565 ± 0.473
1.304MetLys: 1.304 ± 0.608
1.304MetLeu: 1.304 ± 0.521
1.043MetMet: 1.043 ± 0.537
0.522MetAsn: 0.522 ± 0.437
0.522MetPro: 0.522 ± 0.343
1.304MetGln: 1.304 ± 0.833
2.608MetArg: 2.608 ± 0.893
2.347MetSer: 2.347 ± 0.733
1.043MetThr: 1.043 ± 0.779
2.347MetVal: 2.347 ± 0.922
0.261MetTrp: 0.261 ± 0.211
0.782MetTyr: 0.782 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
3.652AsnAla: 3.652 ± 1.037
0.522AsnCys: 0.522 ± 0.47
1.826AsnAsp: 1.826 ± 0.391
2.869AsnGlu: 2.869 ± 0.613
1.304AsnPhe: 1.304 ± 0.475
3.912AsnGly: 3.912 ± 1.159
0.0AsnHis: 0.0 ± 0.0
1.826AsnIle: 1.826 ± 0.652
1.826AsnLys: 1.826 ± 0.739
2.347AsnLeu: 2.347 ± 0.474
2.087AsnMet: 2.087 ± 0.479
3.652AsnAsn: 3.652 ± 1.204
3.652AsnPro: 3.652 ± 0.611
1.043AsnGln: 1.043 ± 0.441
2.347AsnArg: 2.347 ± 0.643
3.391AsnSer: 3.391 ± 0.92
4.434AsnThr: 4.434 ± 0.98
1.565AsnVal: 1.565 ± 0.683
1.304AsnTrp: 1.304 ± 0.658
0.782AsnTyr: 0.782 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
2.347ProAla: 2.347 ± 0.755
0.261ProCys: 0.261 ± 0.218
0.522ProAsp: 0.522 ± 0.272
2.347ProGlu: 2.347 ± 0.741
1.565ProPhe: 1.565 ± 0.529
1.826ProGly: 1.826 ± 0.389
0.0ProHis: 0.0 ± 0.0
2.087ProIle: 2.087 ± 0.41
1.826ProLys: 1.826 ± 0.678
2.869ProLeu: 2.869 ± 0.934
0.0ProMet: 0.0 ± 0.0
2.087ProAsn: 2.087 ± 0.738
1.304ProPro: 1.304 ± 0.589
0.522ProGln: 0.522 ± 0.271
1.043ProArg: 1.043 ± 0.691
3.13ProSer: 3.13 ± 0.9
1.565ProThr: 1.565 ± 0.521
2.087ProVal: 2.087 ± 0.524
0.261ProTrp: 0.261 ± 0.294
0.782ProTyr: 0.782 ± 0.401
0.0ProXaa: 0.0 ± 0.0
Gln
2.608GlnAla: 2.608 ± 1.197
0.0GlnCys: 0.0 ± 0.0
1.565GlnAsp: 1.565 ± 0.738
2.608GlnGlu: 2.608 ± 1.313
0.522GlnPhe: 0.522 ± 0.404
2.869GlnGly: 2.869 ± 0.735
0.522GlnHis: 0.522 ± 0.345
2.869GlnIle: 2.869 ± 0.573
2.869GlnLys: 2.869 ± 1.237
2.608GlnLeu: 2.608 ± 1.439
2.347GlnMet: 2.347 ± 0.988
2.347GlnAsn: 2.347 ± 0.807
0.261GlnPro: 0.261 ± 0.23
1.304GlnGln: 1.304 ± 0.481
2.608GlnArg: 2.608 ± 1.124
1.826GlnSer: 1.826 ± 0.758
2.869GlnThr: 2.869 ± 0.941
1.304GlnVal: 1.304 ± 0.621
0.782GlnTrp: 0.782 ± 0.46
1.304GlnTyr: 1.304 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
2.347ArgAla: 2.347 ± 1.338
0.522ArgCys: 0.522 ± 0.271
2.608ArgAsp: 2.608 ± 0.814
2.347ArgGlu: 2.347 ± 0.793
2.087ArgPhe: 2.087 ± 0.695
5.477ArgGly: 5.477 ± 1.513
0.782ArgHis: 0.782 ± 0.511
2.869ArgIle: 2.869 ± 0.984
1.304ArgLys: 1.304 ± 0.596
4.173ArgLeu: 4.173 ± 1.164
4.173ArgMet: 4.173 ± 1.676
4.695ArgAsn: 4.695 ± 0.756
2.087ArgPro: 2.087 ± 0.776
2.869ArgGln: 2.869 ± 0.573
4.173ArgArg: 4.173 ± 1.268
3.13ArgSer: 3.13 ± 1.188
4.434ArgThr: 4.434 ± 1.263
2.608ArgVal: 2.608 ± 0.678
0.522ArgTrp: 0.522 ± 0.518
1.304ArgTyr: 1.304 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
3.652SerAla: 3.652 ± 0.943
1.826SerCys: 1.826 ± 0.599
2.608SerAsp: 2.608 ± 0.977
3.391SerGlu: 3.391 ± 0.83
3.652SerPhe: 3.652 ± 1.194
4.434SerGly: 4.434 ± 0.729
1.043SerHis: 1.043 ± 0.637
3.912SerIle: 3.912 ± 1.184
2.347SerLys: 2.347 ± 0.653
5.477SerLeu: 5.477 ± 2.038
1.826SerMet: 1.826 ± 0.653
3.912SerAsn: 3.912 ± 1.584
2.608SerPro: 2.608 ± 1.004
3.652SerGln: 3.652 ± 0.904
2.869SerArg: 2.869 ± 0.672
7.303SerSer: 7.303 ± 1.234
4.695SerThr: 4.695 ± 0.867
2.869SerVal: 2.869 ± 0.966
0.261SerTrp: 0.261 ± 0.21
1.304SerTyr: 1.304 ± 0.591
0.0SerXaa: 0.0 ± 0.0
Thr
2.869ThrAla: 2.869 ± 0.656
1.304ThrCys: 1.304 ± 0.616
2.608ThrAsp: 2.608 ± 0.802
4.173ThrGlu: 4.173 ± 1.015
2.087ThrPhe: 2.087 ± 0.836
4.434ThrGly: 4.434 ± 1.019
1.043ThrHis: 1.043 ± 0.471
4.173ThrIle: 4.173 ± 1.057
2.347ThrLys: 2.347 ± 0.76
3.912ThrLeu: 3.912 ± 1.333
1.565ThrMet: 1.565 ± 0.548
2.608ThrAsn: 2.608 ± 0.839
1.043ThrPro: 1.043 ± 0.625
3.13ThrGln: 3.13 ± 1.032
3.912ThrArg: 3.912 ± 1.292
2.608ThrSer: 2.608 ± 0.798
4.173ThrThr: 4.173 ± 1.094
4.695ThrVal: 4.695 ± 0.923
1.043ThrTrp: 1.043 ± 0.47
3.391ThrTyr: 3.391 ± 0.985
0.0ThrXaa: 0.0 ± 0.0
Val
3.13ValAla: 3.13 ± 0.904
1.043ValCys: 1.043 ± 0.43
2.608ValAsp: 2.608 ± 0.892
2.608ValGlu: 2.608 ± 0.627
1.826ValPhe: 1.826 ± 0.633
3.391ValGly: 3.391 ± 0.8
0.782ValHis: 0.782 ± 0.444
2.869ValIle: 2.869 ± 1.064
2.087ValLys: 2.087 ± 0.495
4.695ValLeu: 4.695 ± 1.083
1.304ValMet: 1.304 ± 0.522
2.087ValAsn: 2.087 ± 0.58
2.347ValPro: 2.347 ± 0.626
2.347ValGln: 2.347 ± 0.866
2.608ValArg: 2.608 ± 0.864
4.956ValSer: 4.956 ± 0.822
2.087ValThr: 2.087 ± 0.641
3.13ValVal: 3.13 ± 0.83
0.522ValTrp: 0.522 ± 0.292
0.782ValTyr: 0.782 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.461
0.0TrpCys: 0.0 ± 0.0
0.522TrpAsp: 0.522 ± 0.309
1.304TrpGlu: 1.304 ± 0.542
0.522TrpPhe: 0.522 ± 0.348
0.261TrpGly: 0.261 ± 0.235
0.522TrpHis: 0.522 ± 0.396
0.782TrpIle: 0.782 ± 0.361
0.261TrpLys: 0.261 ± 0.294
1.304TrpLeu: 1.304 ± 0.643
0.782TrpMet: 0.782 ± 0.453
0.522TrpAsn: 0.522 ± 0.313
0.0TrpPro: 0.0 ± 0.0
0.261TrpGln: 0.261 ± 0.218
1.043TrpArg: 1.043 ± 0.692
1.043TrpSer: 1.043 ± 0.543
1.826TrpThr: 1.826 ± 0.695
0.261TrpVal: 0.261 ± 0.218
0.522TrpTrp: 0.522 ± 0.272
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.782TyrAla: 0.782 ± 0.286
0.261TyrCys: 0.261 ± 0.235
1.826TyrAsp: 1.826 ± 0.627
1.043TyrGlu: 1.043 ± 0.671
1.043TyrPhe: 1.043 ± 0.441
1.826TyrGly: 1.826 ± 0.442
0.261TyrHis: 0.261 ± 0.235
0.522TyrIle: 0.522 ± 0.28
1.043TyrLys: 1.043 ± 0.698
1.043TyrLeu: 1.043 ± 0.582
0.0TyrMet: 0.0 ± 0.0
1.304TyrAsn: 1.304 ± 0.371
0.782TyrPro: 0.782 ± 0.444
1.826TyrGln: 1.826 ± 0.468
3.13TyrArg: 3.13 ± 1.047
2.087TyrSer: 2.087 ± 0.509
1.304TyrThr: 1.304 ± 0.57
1.304TyrVal: 1.304 ± 0.876
0.522TyrTrp: 0.522 ± 0.292
0.261TyrTyr: 0.261 ± 0.218
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.261XaaAla: 0.261 ± 0.21
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.261XaaMet: 0.261 ± 0.211
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.261XaaGln: 0.261 ± 0.235
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
198.748XaaXaa: 198.748 ± 76.967
Statistics based on 10 proteins (3835 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski