Amino acid dipepetide frequency for Influenza A virus (strain A/Duck/Hokkaido/8/1980 H3N8)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.953AlaAla: 3.953 ± 0.998
1.664AlaCys: 1.664 ± 0.83
2.496AlaAsp: 2.496 ± 0.587
3.745AlaGlu: 3.745 ± 0.843
1.664AlaPhe: 1.664 ± 0.733
3.537AlaGly: 3.537 ± 1.121
0.624AlaHis: 0.624 ± 0.466
4.369AlaIle: 4.369 ± 0.918
2.08AlaLys: 2.08 ± 0.758
5.825AlaLeu: 5.825 ± 1.249
2.704AlaMet: 2.704 ± 0.955
2.704AlaAsn: 2.704 ± 0.801
2.496AlaPro: 2.496 ± 0.577
1.664AlaGln: 1.664 ± 0.629
3.12AlaArg: 3.12 ± 0.537
5.201AlaSer: 5.201 ± 1.367
5.201AlaThr: 5.201 ± 0.885
2.912AlaVal: 2.912 ± 0.755
1.04AlaTrp: 1.04 ± 0.499
0.832AlaTyr: 0.832 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.832CysAla: 0.832 ± 0.425
0.208CysCys: 0.208 ± 0.188
0.832CysAsp: 0.832 ± 0.52
0.624CysGlu: 0.624 ± 0.337
1.456CysPhe: 1.456 ± 0.651
0.208CysGly: 0.208 ± 0.222
0.832CysHis: 0.832 ± 0.306
1.248CysIle: 1.248 ± 0.71
1.248CysLys: 1.248 ± 0.361
1.664CysLeu: 1.664 ± 0.462
0.624CysMet: 0.624 ± 0.377
1.664CysAsn: 1.664 ± 0.444
0.208CysPro: 0.208 ± 0.216
0.416CysGln: 0.416 ± 0.292
1.248CysArg: 1.248 ± 0.731
1.248CysSer: 1.248 ± 0.472
1.248CysThr: 1.248 ± 0.47
1.248CysVal: 1.248 ± 0.4
0.208CysTrp: 0.208 ± 0.162
0.832CysTyr: 0.832 ± 0.523
0.0CysXaa: 0.0 ± 0.0
Asp
2.912AspAla: 2.912 ± 0.458
1.664AspCys: 1.664 ± 0.441
1.456AspAsp: 1.456 ± 0.395
3.745AspGlu: 3.745 ± 0.682
2.08AspPhe: 2.08 ± 0.855
3.328AspGly: 3.328 ± 1.009
0.416AspHis: 0.416 ± 0.242
2.288AspIle: 2.288 ± 0.698
2.08AspLys: 2.08 ± 0.571
4.993AspLeu: 4.993 ± 1.013
1.872AspMet: 1.872 ± 0.545
2.912AspAsn: 2.912 ± 0.785
3.745AspPro: 3.745 ± 0.965
1.872AspGln: 1.872 ± 0.711
2.08AspArg: 2.08 ± 0.397
3.12AspSer: 3.12 ± 0.83
2.08AspThr: 2.08 ± 0.607
3.953AspVal: 3.953 ± 0.736
0.624AspTrp: 0.624 ± 0.319
1.872AspTyr: 1.872 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
2.496GluAla: 2.496 ± 0.592
1.456GluCys: 1.456 ± 0.72
5.201GluAsp: 5.201 ± 0.906
5.617GluGlu: 5.617 ± 1.153
1.872GluPhe: 1.872 ± 0.598
4.577GluGly: 4.577 ± 1.236
0.832GluHis: 0.832 ± 0.541
4.785GluIle: 4.785 ± 1.142
6.449GluLys: 6.449 ± 1.456
5.201GluLeu: 5.201 ± 0.847
2.704GluMet: 2.704 ± 0.617
3.953GluAsn: 3.953 ± 0.982
2.496GluPro: 2.496 ± 1.213
3.745GluGln: 3.745 ± 1.201
4.161GluArg: 4.161 ± 1.243
6.449GluSer: 6.449 ± 1.448
3.953GluThr: 3.953 ± 0.51
5.409GluVal: 5.409 ± 1.282
1.04GluTrp: 1.04 ± 0.486
1.872GluTyr: 1.872 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
1.872PheAla: 1.872 ± 0.332
0.208PheCys: 0.208 ± 0.216
1.664PheAsp: 1.664 ± 0.546
4.993PheGlu: 4.993 ± 1.242
1.664PhePhe: 1.664 ± 0.436
1.664PheGly: 1.664 ± 0.266
0.832PheHis: 0.832 ± 0.413
2.08PheIle: 2.08 ± 0.759
1.04PheLys: 1.04 ± 0.567
3.745PheLeu: 3.745 ± 0.83
0.416PheMet: 0.416 ± 0.281
2.08PheAsn: 2.08 ± 0.794
0.624PhePro: 0.624 ± 0.377
2.912PheGln: 2.912 ± 0.805
1.872PheArg: 1.872 ± 0.414
4.161PheSer: 4.161 ± 0.647
3.12PheThr: 3.12 ± 0.442
3.328PheVal: 3.328 ± 0.715
0.416PheTrp: 0.416 ± 0.291
0.624PheTyr: 0.624 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
2.288GlyAla: 2.288 ± 0.668
0.416GlyCys: 0.416 ± 0.264
3.12GlyAsp: 3.12 ± 0.369
3.745GlyGlu: 3.745 ± 1.606
3.745GlyPhe: 3.745 ± 1.024
3.953GlyGly: 3.953 ± 0.977
1.248GlyHis: 1.248 ± 0.556
4.577GlyIle: 4.577 ± 0.845
4.369GlyLys: 4.369 ± 0.916
4.577GlyLeu: 4.577 ± 1.137
2.08GlyMet: 2.08 ± 0.549
2.912GlyAsn: 2.912 ± 0.867
3.328GlyPro: 3.328 ± 0.615
2.496GlyGln: 2.496 ± 0.539
4.785GlyArg: 4.785 ± 0.982
3.953GlySer: 3.953 ± 1.179
5.201GlyThr: 5.201 ± 0.644
5.201GlyVal: 5.201 ± 0.547
1.04GlyTrp: 1.04 ± 0.53
2.288GlyTyr: 2.288 ± 0.811
0.0GlyXaa: 0.0 ± 0.0
His
0.624HisAla: 0.624 ± 0.269
0.416HisCys: 0.416 ± 0.264
0.624HisAsp: 0.624 ± 0.482
1.04HisGlu: 1.04 ± 0.452
1.248HisPhe: 1.248 ± 0.386
0.832HisGly: 0.832 ± 0.438
0.416HisHis: 0.416 ± 0.431
2.08HisIle: 2.08 ± 0.939
1.248HisLys: 1.248 ± 0.47
1.248HisLeu: 1.248 ± 0.457
0.208HisMet: 0.208 ± 0.162
0.208HisAsn: 0.208 ± 0.222
1.04HisPro: 1.04 ± 0.411
0.832HisGln: 0.832 ± 0.427
1.456HisArg: 1.456 ± 0.691
1.456HisSer: 1.456 ± 0.55
0.416HisThr: 0.416 ± 0.305
0.416HisVal: 0.416 ± 0.444
0.0HisTrp: 0.0 ± 0.0
0.416HisTyr: 0.416 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
3.953IleAla: 3.953 ± 0.778
2.288IleCys: 2.288 ± 0.585
3.745IleAsp: 3.745 ± 1.327
6.657IleGlu: 6.657 ± 2.016
1.04IlePhe: 1.04 ± 0.288
4.161IleGly: 4.161 ± 0.918
0.416IleHis: 0.416 ± 0.281
3.745IleIle: 3.745 ± 0.922
3.537IleLys: 3.537 ± 1.107
6.033IleLeu: 6.033 ± 1.474
2.288IleMet: 2.288 ± 0.558
2.912IleAsn: 2.912 ± 0.729
2.08IlePro: 2.08 ± 0.656
2.496IleGln: 2.496 ± 0.57
6.241IleArg: 6.241 ± 1.297
2.704IleSer: 2.704 ± 0.804
4.369IleThr: 4.369 ± 1.1
3.953IleVal: 3.953 ± 0.711
0.832IleTrp: 0.832 ± 0.468
2.08IleTyr: 2.08 ± 0.736
0.0IleXaa: 0.0 ± 0.0
Lys
4.577LysAla: 4.577 ± 1.063
1.456LysCys: 1.456 ± 0.606
2.912LysAsp: 2.912 ± 0.49
4.369LysGlu: 4.369 ± 1.202
1.872LysPhe: 1.872 ± 0.628
2.704LysGly: 2.704 ± 0.452
0.832LysHis: 0.832 ± 0.33
5.201LysIle: 5.201 ± 0.832
3.537LysLys: 3.537 ± 1.483
4.369LysLeu: 4.369 ± 1.005
2.912LysMet: 2.912 ± 0.554
1.248LysAsn: 1.248 ± 0.378
1.248LysPro: 1.248 ± 0.564
1.872LysGln: 1.872 ± 0.626
4.993LysArg: 4.993 ± 1.371
3.12LysSer: 3.12 ± 0.882
4.993LysThr: 4.993 ± 1.298
1.872LysVal: 1.872 ± 0.587
1.872LysTrp: 1.872 ± 0.617
1.664LysTyr: 1.664 ± 0.431
0.0LysXaa: 0.0 ± 0.0
Leu
4.369LeuAla: 4.369 ± 0.96
0.832LeuCys: 0.832 ± 0.472
1.248LeuAsp: 1.248 ± 0.546
6.449LeuGlu: 6.449 ± 1.283
2.704LeuPhe: 2.704 ± 0.731
3.537LeuGly: 3.537 ± 0.61
1.04LeuHis: 1.04 ± 0.43
5.409LeuIle: 5.409 ± 1.282
7.697LeuLys: 7.697 ± 1.57
6.449LeuLeu: 6.449 ± 1.534
2.288LeuMet: 2.288 ± 0.571
4.161LeuAsn: 4.161 ± 1.045
3.953LeuPro: 3.953 ± 0.601
2.912LeuGln: 2.912 ± 1.5
5.409LeuArg: 5.409 ± 1.167
4.785LeuSer: 4.785 ± 1.004
5.409LeuThr: 5.409 ± 1.469
4.369LeuVal: 4.369 ± 1.079
1.664LeuTrp: 1.664 ± 0.515
2.704LeuTyr: 2.704 ± 1.064
0.0LeuXaa: 0.0 ± 0.0
Met
3.745MetAla: 3.745 ± 0.86
1.248MetCys: 1.248 ± 0.676
3.328MetAsp: 3.328 ± 0.984
4.577MetGlu: 4.577 ± 1.158
1.04MetPhe: 1.04 ± 0.72
2.288MetGly: 2.288 ± 1.073
0.208MetHis: 0.208 ± 0.162
2.912MetIle: 2.912 ± 0.459
2.496MetLys: 2.496 ± 0.959
1.248MetLeu: 1.248 ± 0.42
1.664MetMet: 1.664 ± 0.653
0.832MetAsn: 0.832 ± 0.523
0.832MetPro: 0.832 ± 0.461
1.04MetGln: 1.04 ± 0.539
2.704MetArg: 2.704 ± 0.707
2.288MetSer: 2.288 ± 0.51
1.456MetThr: 1.456 ± 0.587
3.12MetVal: 3.12 ± 1.239
0.624MetTrp: 0.624 ± 0.31
0.832MetTyr: 0.832 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 1.148
0.208AsnCys: 0.208 ± 0.216
2.704AsnAsp: 2.704 ± 0.564
3.537AsnGlu: 3.537 ± 1.074
1.664AsnPhe: 1.664 ± 0.523
4.993AsnGly: 4.993 ± 1.612
0.208AsnHis: 0.208 ± 0.188
2.496AsnIle: 2.496 ± 0.649
2.704AsnLys: 2.704 ± 0.711
3.12AsnLeu: 3.12 ± 0.456
2.08AsnMet: 2.08 ± 0.706
2.08AsnAsn: 2.08 ± 0.771
3.953AsnPro: 3.953 ± 0.733
2.288AsnGln: 2.288 ± 0.756
3.537AsnArg: 3.537 ± 0.799
3.12AsnSer: 3.12 ± 0.513
3.537AsnThr: 3.537 ± 0.684
2.288AsnVal: 2.288 ± 0.804
1.04AsnTrp: 1.04 ± 0.505
0.624AsnTyr: 0.624 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
2.912ProAla: 2.912 ± 1.027
0.624ProCys: 0.624 ± 0.329
1.664ProAsp: 1.664 ± 0.559
2.704ProGlu: 2.704 ± 0.528
2.496ProPhe: 2.496 ± 0.796
2.08ProGly: 2.08 ± 0.527
0.832ProHis: 0.832 ± 0.528
2.288ProIle: 2.288 ± 0.412
2.496ProLys: 2.496 ± 0.752
2.704ProLeu: 2.704 ± 0.968
1.456ProMet: 1.456 ± 0.714
3.953ProAsn: 3.953 ± 1.006
1.248ProPro: 1.248 ± 0.512
0.416ProGln: 0.416 ± 0.24
2.496ProArg: 2.496 ± 0.733
3.12ProSer: 3.12 ± 0.846
2.08ProThr: 2.08 ± 0.621
1.664ProVal: 1.664 ± 0.584
0.624ProTrp: 0.624 ± 0.335
1.04ProTyr: 1.04 ± 0.442
0.0ProXaa: 0.0 ± 0.0
Gln
3.328GlnAla: 3.328 ± 1.269
0.416GlnCys: 0.416 ± 0.281
2.08GlnAsp: 2.08 ± 0.659
2.08GlnGlu: 2.08 ± 0.771
0.624GlnPhe: 0.624 ± 0.371
3.12GlnGly: 3.12 ± 1.201
0.416GlnHis: 0.416 ± 0.347
3.12GlnIle: 3.12 ± 0.752
2.496GlnLys: 2.496 ± 0.763
3.328GlnLeu: 3.328 ± 0.918
2.288GlnMet: 2.288 ± 1.218
2.912GlnAsn: 2.912 ± 0.965
0.624GlnPro: 0.624 ± 0.426
1.456GlnGln: 1.456 ± 0.354
3.745GlnArg: 3.745 ± 1.191
3.12GlnSer: 3.12 ± 0.849
2.912GlnThr: 2.912 ± 0.892
1.664GlnVal: 1.664 ± 0.593
0.832GlnTrp: 0.832 ± 0.424
0.832GlnTyr: 0.832 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
4.161ArgAla: 4.161 ± 1.063
0.624ArgCys: 0.624 ± 0.265
3.745ArgAsp: 3.745 ± 0.827
3.328ArgGlu: 3.328 ± 0.8
2.912ArgPhe: 2.912 ± 0.658
7.489ArgGly: 7.489 ± 0.968
0.624ArgHis: 0.624 ± 0.365
3.745ArgIle: 3.745 ± 0.822
2.288ArgLys: 2.288 ± 0.561
4.785ArgLeu: 4.785 ± 0.813
3.328ArgMet: 3.328 ± 1.773
4.785ArgAsn: 4.785 ± 1.066
1.872ArgPro: 1.872 ± 0.429
4.161ArgGln: 4.161 ± 0.716
6.033ArgArg: 6.033 ± 1.082
4.369ArgSer: 4.369 ± 1.174
6.449ArgThr: 6.449 ± 1.15
3.537ArgVal: 3.537 ± 0.935
0.416ArgTrp: 0.416 ± 0.376
1.456ArgTyr: 1.456 ± 0.649
0.0ArgXaa: 0.0 ± 0.0
Ser
2.912SerAla: 2.912 ± 1.259
1.872SerCys: 1.872 ± 0.806
2.912SerAsp: 2.912 ± 0.652
3.745SerGlu: 3.745 ± 1.065
4.577SerPhe: 4.577 ± 0.932
5.617SerGly: 5.617 ± 1.121
1.664SerHis: 1.664 ± 0.699
5.825SerIle: 5.825 ± 0.696
3.12SerLys: 3.12 ± 0.919
5.617SerLeu: 5.617 ± 1.162
2.288SerMet: 2.288 ± 0.893
2.912SerAsn: 2.912 ± 0.903
3.328SerPro: 3.328 ± 1.089
3.953SerGln: 3.953 ± 0.835
3.537SerArg: 3.537 ± 0.737
6.865SerSer: 6.865 ± 1.179
4.785SerThr: 4.785 ± 1.321
2.912SerVal: 2.912 ± 0.841
1.456SerTrp: 1.456 ± 0.687
1.872SerTyr: 1.872 ± 0.671
0.0SerXaa: 0.0 ± 0.0
Thr
3.328ThrAla: 3.328 ± 0.488
1.04ThrCys: 1.04 ± 0.351
2.912ThrAsp: 2.912 ± 0.902
4.993ThrGlu: 4.993 ± 1.012
2.496ThrPhe: 2.496 ± 0.78
5.409ThrGly: 5.409 ± 0.982
2.288ThrHis: 2.288 ± 0.735
4.577ThrIle: 4.577 ± 1.054
4.161ThrLys: 4.161 ± 0.684
4.577ThrLeu: 4.577 ± 0.597
2.704ThrMet: 2.704 ± 0.532
3.537ThrAsn: 3.537 ± 0.722
1.664ThrPro: 1.664 ± 0.376
3.328ThrGln: 3.328 ± 1.036
4.785ThrArg: 4.785 ± 0.905
3.745ThrSer: 3.745 ± 1.029
2.704ThrThr: 2.704 ± 1.253
4.369ThrVal: 4.369 ± 1.218
1.04ThrTrp: 1.04 ± 0.453
2.704ThrTyr: 2.704 ± 0.764
0.0ThrXaa: 0.0 ± 0.0
Val
3.537ValAla: 3.537 ± 0.984
1.248ValCys: 1.248 ± 0.471
3.328ValAsp: 3.328 ± 0.92
5.409ValGlu: 5.409 ± 1.217
2.496ValPhe: 2.496 ± 0.632
2.704ValGly: 2.704 ± 1.149
1.456ValHis: 1.456 ± 0.498
1.456ValIle: 1.456 ± 0.61
2.912ValLys: 2.912 ± 0.818
5.617ValLeu: 5.617 ± 1.253
2.08ValMet: 2.08 ± 0.634
2.912ValAsn: 2.912 ± 0.662
2.704ValPro: 2.704 ± 0.924
2.288ValGln: 2.288 ± 0.941
4.369ValArg: 4.369 ± 1.249
5.617ValSer: 5.617 ± 0.826
2.912ValThr: 2.912 ± 0.923
4.161ValVal: 4.161 ± 1.545
0.416ValTrp: 0.416 ± 0.291
1.04ValTyr: 1.04 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
1.04TrpAla: 1.04 ± 0.28
0.0TrpCys: 0.0 ± 0.0
0.832TrpAsp: 0.832 ± 0.306
1.664TrpGlu: 1.664 ± 0.492
0.416TrpPhe: 0.416 ± 0.24
0.832TrpGly: 0.832 ± 0.32
0.832TrpHis: 0.832 ± 0.511
1.664TrpIle: 1.664 ± 0.52
0.624TrpLys: 0.624 ± 0.473
0.832TrpLeu: 0.832 ± 0.529
1.248TrpMet: 1.248 ± 0.557
0.832TrpAsn: 0.832 ± 0.307
0.208TrpPro: 0.208 ± 0.212
0.0TrpGln: 0.0 ± 0.0
0.832TrpArg: 0.832 ± 0.599
1.248TrpSer: 1.248 ± 0.685
1.872TrpThr: 1.872 ± 0.746
0.832TrpVal: 0.832 ± 0.474
0.416TrpTrp: 0.416 ± 0.221
0.416TrpTyr: 0.416 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.624TyrAla: 0.624 ± 0.265
0.208TyrCys: 0.208 ± 0.212
2.288TyrAsp: 2.288 ± 0.693
1.248TyrGlu: 1.248 ± 0.607
1.248TyrPhe: 1.248 ± 0.328
2.288TyrGly: 2.288 ± 0.489
0.208TyrHis: 0.208 ± 0.216
1.664TyrIle: 1.664 ± 0.435
1.04TyrLys: 1.04 ± 0.342
1.456TyrLeu: 1.456 ± 0.427
1.04TyrMet: 1.04 ± 0.438
1.248TyrAsn: 1.248 ± 0.403
1.248TyrPro: 1.248 ± 0.536
1.248TyrGln: 1.248 ± 0.444
2.704TyrArg: 2.704 ± 1.065
2.08TyrSer: 2.08 ± 0.462
1.664TyrThr: 1.664 ± 0.693
1.456TyrVal: 1.456 ± 0.669
1.04TyrTrp: 1.04 ± 0.484
0.416TyrTyr: 0.416 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4808 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski