Amino acid dipepetide frequency for Influenza A virus (A/chicken/Korea/S21/2004(H9N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.914AlaAla: 3.914 ± 0.924
0.391AlaCys: 0.391 ± 0.324
4.305AlaAsp: 4.305 ± 1.085
1.957AlaGlu: 1.957 ± 0.838
1.957AlaPhe: 1.957 ± 0.619
5.479AlaGly: 5.479 ± 1.546
0.0AlaHis: 0.0 ± 0.0
4.697AlaIle: 4.697 ± 1.202
1.957AlaLys: 1.957 ± 0.775
5.871AlaLeu: 5.871 ± 1.379
3.523AlaMet: 3.523 ± 1.227
1.957AlaAsn: 1.957 ± 0.818
1.957AlaPro: 1.957 ± 0.536
1.566AlaGln: 1.566 ± 0.484
3.131AlaArg: 3.131 ± 0.756
5.088AlaSer: 5.088 ± 1.266
5.871AlaThr: 5.871 ± 1.13
2.74AlaVal: 2.74 ± 0.753
1.174AlaTrp: 1.174 ± 0.698
1.566AlaTyr: 1.566 ± 0.563
0.0AlaXaa: 0.0 ± 0.0
Cys
0.783CysAla: 0.783 ± 0.488
0.0CysCys: 0.0 ± 0.0
0.391CysAsp: 0.391 ± 0.324
1.174CysGlu: 1.174 ± 0.623
1.957CysPhe: 1.957 ± 1.102
0.0CysGly: 0.0 ± 0.0
0.783CysHis: 0.783 ± 0.425
2.348CysIle: 2.348 ± 0.922
1.174CysLys: 1.174 ± 0.772
0.391CysLeu: 0.391 ± 0.375
0.783CysMet: 0.783 ± 0.543
0.783CysAsn: 0.783 ± 0.441
0.783CysPro: 0.783 ± 0.465
0.783CysGln: 0.783 ± 0.465
0.0CysArg: 0.0 ± 0.0
1.566CysSer: 1.566 ± 0.718
1.566CysThr: 1.566 ± 0.518
1.566CysVal: 1.566 ± 0.812
0.391CysTrp: 0.391 ± 0.349
1.174CysTyr: 1.174 ± 0.561
0.0CysXaa: 0.0 ± 0.0
Asp
3.131AspAla: 3.131 ± 0.791
1.174AspCys: 1.174 ± 0.518
1.566AspAsp: 1.566 ± 0.719
2.348AspGlu: 2.348 ± 1.016
1.174AspPhe: 1.174 ± 0.748
3.131AspGly: 3.131 ± 1.221
0.783AspHis: 0.783 ± 0.431
2.348AspIle: 2.348 ± 1.109
1.957AspLys: 1.957 ± 0.833
3.523AspLeu: 3.523 ± 1.115
0.391AspMet: 0.391 ± 0.381
2.348AspAsn: 2.348 ± 1.291
3.131AspPro: 3.131 ± 1.217
2.348AspGln: 2.348 ± 1.183
3.914AspArg: 3.914 ± 0.733
2.348AspSer: 2.348 ± 1.158
3.131AspThr: 3.131 ± 1.5
2.74AspVal: 2.74 ± 0.849
1.174AspTrp: 1.174 ± 0.523
1.174AspTyr: 1.174 ± 0.739
0.0AspXaa: 0.0 ± 0.0
Glu
2.74GluAla: 2.74 ± 0.915
0.783GluCys: 0.783 ± 0.635
3.523GluAsp: 3.523 ± 0.983
8.219GluGlu: 8.219 ± 2.075
0.391GluPhe: 0.391 ± 0.349
4.305GluGly: 4.305 ± 1.346
0.783GluHis: 0.783 ± 0.436
5.479GluIle: 5.479 ± 1.28
4.305GluLys: 4.305 ± 1.365
5.088GluLeu: 5.088 ± 1.101
2.348GluMet: 2.348 ± 0.954
3.914GluAsn: 3.914 ± 1.029
2.348GluPro: 2.348 ± 0.878
5.479GluGln: 5.479 ± 1.795
6.262GluArg: 6.262 ± 1.711
4.697GluSer: 4.697 ± 1.118
5.088GluThr: 5.088 ± 1.302
7.045GluVal: 7.045 ± 1.946
1.174GluTrp: 1.174 ± 0.544
1.174GluTyr: 1.174 ± 0.676
0.0GluXaa: 0.0 ± 0.0
Phe
1.957PheAla: 1.957 ± 0.762
0.0PheCys: 0.0 ± 0.0
0.391PheAsp: 0.391 ± 0.333
3.523PheGlu: 3.523 ± 1.767
0.783PhePhe: 0.783 ± 0.435
2.348PheGly: 2.348 ± 0.803
1.566PheHis: 1.566 ± 0.567
1.566PheIle: 1.566 ± 0.725
1.957PheLys: 1.957 ± 0.708
3.523PheLeu: 3.523 ± 1.185
1.566PheMet: 1.566 ± 0.641
1.174PheAsn: 1.174 ± 0.594
0.783PhePro: 0.783 ± 0.433
2.74PheGln: 2.74 ± 0.955
1.566PheArg: 1.566 ± 0.782
2.74PheSer: 2.74 ± 0.763
3.131PheThr: 3.131 ± 0.988
1.957PheVal: 1.957 ± 1.071
0.391PheTrp: 0.391 ± 0.333
0.783PheTyr: 0.783 ± 0.436
0.0PheXaa: 0.0 ± 0.0
Gly
3.131GlyAla: 3.131 ± 0.874
0.391GlyCys: 0.391 ± 0.369
2.348GlyAsp: 2.348 ± 0.698
4.697GlyGlu: 4.697 ± 1.745
2.74GlyPhe: 2.74 ± 0.553
5.088GlyGly: 5.088 ± 1.411
1.174GlyHis: 1.174 ± 0.539
2.74GlyIle: 2.74 ± 0.567
5.479GlyLys: 5.479 ± 1.178
6.654GlyLeu: 6.654 ± 1.755
3.131GlyMet: 3.131 ± 0.995
3.131GlyAsn: 3.131 ± 1.159
3.914GlyPro: 3.914 ± 0.578
1.566GlyGln: 1.566 ± 0.709
6.262GlyArg: 6.262 ± 1.671
4.305GlySer: 4.305 ± 1.599
5.871GlyThr: 5.871 ± 1.594
3.131GlyVal: 3.131 ± 0.842
1.566GlyTrp: 1.566 ± 0.793
1.566GlyTyr: 1.566 ± 0.733
0.0GlyXaa: 0.0 ± 0.0
His
1.174HisAla: 1.174 ± 0.539
0.0HisCys: 0.0 ± 0.0
0.783HisAsp: 0.783 ± 0.694
1.566HisGlu: 1.566 ± 0.563
1.566HisPhe: 1.566 ± 0.516
1.174HisGly: 1.174 ± 0.473
0.391HisHis: 0.391 ± 0.324
1.566HisIle: 1.566 ± 1.085
0.391HisLys: 0.391 ± 0.324
1.566HisLeu: 1.566 ± 0.665
0.783HisMet: 0.783 ± 0.431
0.783HisAsn: 0.783 ± 0.648
1.566HisPro: 1.566 ± 0.633
0.391HisGln: 0.391 ± 0.375
1.957HisArg: 1.957 ± 0.959
1.957HisSer: 1.957 ± 0.62
0.783HisThr: 0.783 ± 0.486
0.783HisVal: 0.783 ± 0.482
0.0HisTrp: 0.0 ± 0.0
0.391HisTyr: 0.391 ± 0.349
0.0HisXaa: 0.0 ± 0.0
Ile
5.871IleAla: 5.871 ± 1.414
1.957IleCys: 1.957 ± 0.724
1.957IleAsp: 1.957 ± 0.699
6.654IleGlu: 6.654 ± 2.154
1.566IlePhe: 1.566 ± 0.565
5.088IleGly: 5.088 ± 0.67
1.566IleHis: 1.566 ± 0.5
4.305IleIle: 4.305 ± 1.492
3.523IleLys: 3.523 ± 1.715
8.219IleLeu: 8.219 ± 1.901
1.566IleMet: 1.566 ± 0.928
1.566IleAsn: 1.566 ± 0.709
2.348IlePro: 2.348 ± 0.661
1.566IleGln: 1.566 ± 0.697
5.088IleArg: 5.088 ± 1.204
0.783IleSer: 0.783 ± 0.619
6.262IleThr: 6.262 ± 1.42
2.74IleVal: 2.74 ± 0.922
1.174IleTrp: 1.174 ± 0.752
1.957IleTyr: 1.957 ± 0.815
0.0IleXaa: 0.0 ± 0.0
Lys
4.697LysAla: 4.697 ± 0.923
1.566LysCys: 1.566 ± 1.103
2.348LysAsp: 2.348 ± 0.856
5.479LysGlu: 5.479 ± 1.496
1.957LysPhe: 1.957 ± 1.085
2.348LysGly: 2.348 ± 0.952
0.783LysHis: 0.783 ± 0.502
3.131LysIle: 3.131 ± 1.111
3.523LysLys: 3.523 ± 1.574
4.305LysLeu: 4.305 ± 1.588
2.74LysMet: 2.74 ± 1.291
1.957LysAsn: 1.957 ± 0.757
1.566LysPro: 1.566 ± 0.698
1.957LysGln: 1.957 ± 0.981
5.479LysArg: 5.479 ± 1.368
1.957LysSer: 1.957 ± 0.752
3.523LysThr: 3.523 ± 1.335
1.566LysVal: 1.566 ± 0.735
1.957LysTrp: 1.957 ± 0.926
1.957LysTyr: 1.957 ± 0.898
0.0LysXaa: 0.0 ± 0.0
Leu
4.697LeuAla: 4.697 ± 0.916
1.174LeuCys: 1.174 ± 0.561
2.348LeuAsp: 2.348 ± 0.971
7.045LeuGlu: 7.045 ± 1.598
1.957LeuPhe: 1.957 ± 1.037
3.523LeuGly: 3.523 ± 1.474
2.348LeuHis: 2.348 ± 0.927
7.045LeuIle: 7.045 ± 1.3
8.219LeuLys: 8.219 ± 1.575
6.262LeuLeu: 6.262 ± 1.762
3.914LeuMet: 3.914 ± 0.909
3.523LeuAsn: 3.523 ± 1.573
4.305LeuPro: 4.305 ± 1.332
2.74LeuGln: 2.74 ± 1.207
5.479LeuArg: 5.479 ± 1.947
4.697LeuSer: 4.697 ± 1.449
5.871LeuThr: 5.871 ± 1.582
3.914LeuVal: 3.914 ± 1.558
1.566LeuTrp: 1.566 ± 0.722
2.348LeuTyr: 2.348 ± 0.786
0.0LeuXaa: 0.0 ± 0.0
Met
3.914MetAla: 3.914 ± 1.166
1.957MetCys: 1.957 ± 0.984
3.523MetAsp: 3.523 ± 1.263
4.697MetGlu: 4.697 ± 1.252
0.0MetPhe: 0.0 ± 0.0
1.957MetGly: 1.957 ± 1.257
0.0MetHis: 0.0 ± 0.0
1.957MetIle: 1.957 ± 0.933
1.566MetLys: 1.566 ± 1.004
1.174MetLeu: 1.174 ± 0.539
1.174MetMet: 1.174 ± 0.538
1.174MetAsn: 1.174 ± 0.754
0.783MetPro: 0.783 ± 0.632
2.348MetGln: 2.348 ± 1.126
2.348MetArg: 2.348 ± 1.067
2.348MetSer: 2.348 ± 0.834
1.957MetThr: 1.957 ± 1.156
4.305MetVal: 4.305 ± 1.455
0.391MetTrp: 0.391 ± 0.324
0.783MetTyr: 0.783 ± 0.502
0.0MetXaa: 0.0 ± 0.0
Asn
2.74AsnAla: 2.74 ± 0.733
0.783AsnCys: 0.783 ± 0.648
2.74AsnAsp: 2.74 ± 0.96
3.523AsnGlu: 3.523 ± 1.526
1.566AsnPhe: 1.566 ± 0.802
5.871AsnGly: 5.871 ± 1.085
0.0AsnHis: 0.0 ± 0.0
2.348AsnIle: 2.348 ± 0.764
1.174AsnLys: 1.174 ± 0.518
2.348AsnLeu: 2.348 ± 0.935
1.174AsnMet: 1.174 ± 0.512
1.566AsnAsn: 1.566 ± 0.843
3.523AsnPro: 3.523 ± 0.868
2.348AsnGln: 2.348 ± 0.937
4.305AsnArg: 4.305 ± 0.747
3.131AsnSer: 3.131 ± 1.197
5.088AsnThr: 5.088 ± 1.527
1.957AsnVal: 1.957 ± 1.273
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.566ProAla: 1.566 ± 0.52
0.783ProCys: 0.783 ± 0.437
1.566ProAsp: 1.566 ± 0.804
1.174ProGlu: 1.174 ± 0.565
1.957ProPhe: 1.957 ± 0.755
3.131ProGly: 3.131 ± 1.083
0.391ProHis: 0.391 ± 0.347
2.348ProIle: 2.348 ± 0.555
3.131ProLys: 3.131 ± 0.891
5.088ProLeu: 5.088 ± 1.192
0.391ProMet: 0.391 ± 0.369
2.348ProAsn: 2.348 ± 1.09
1.566ProPro: 1.566 ± 0.804
0.783ProGln: 0.783 ± 0.511
1.566ProArg: 1.566 ± 0.69
5.871ProSer: 5.871 ± 1.407
1.566ProThr: 1.566 ± 0.772
1.957ProVal: 1.957 ± 0.823
0.391ProTrp: 0.391 ± 0.324
0.391ProTyr: 0.391 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
3.131GlnAla: 3.131 ± 1.596
1.174GlnCys: 1.174 ± 0.484
1.957GlnAsp: 1.957 ± 0.781
2.348GlnGlu: 2.348 ± 0.796
0.783GlnPhe: 0.783 ± 0.558
3.131GlnGly: 3.131 ± 0.939
0.783GlnHis: 0.783 ± 0.488
2.74GlnIle: 2.74 ± 0.89
2.348GlnLys: 2.348 ± 1.218
2.74GlnLeu: 2.74 ± 1.894
3.914GlnMet: 3.914 ± 1.626
3.131GlnAsn: 3.131 ± 1.037
0.391GlnPro: 0.391 ± 0.375
1.174GlnGln: 1.174 ± 0.594
3.523GlnArg: 3.523 ± 1.533
3.131GlnSer: 3.131 ± 1.1
3.131GlnThr: 3.131 ± 1.184
1.174GlnVal: 1.174 ± 0.749
0.0GlnTrp: 0.0 ± 0.0
0.783GlnTyr: 0.783 ± 0.443
0.0GlnXaa: 0.0 ± 0.0
Arg
4.305ArgAla: 4.305 ± 0.857
0.0ArgCys: 0.0 ± 0.0
2.74ArgAsp: 2.74 ± 0.885
3.131ArgGlu: 3.131 ± 0.805
3.523ArgPhe: 3.523 ± 1.308
4.697ArgGly: 4.697 ± 1.211
1.174ArgHis: 1.174 ± 0.673
5.871ArgIle: 5.871 ± 1.407
3.131ArgLys: 3.131 ± 0.962
6.262ArgLeu: 6.262 ± 1.056
4.305ArgMet: 4.305 ± 2.096
4.697ArgAsn: 4.697 ± 1.606
1.566ArgPro: 1.566 ± 0.722
4.305ArgGln: 4.305 ± 1.29
7.828ArgArg: 7.828 ± 2.188
5.871ArgSer: 5.871 ± 1.441
6.654ArgThr: 6.654 ± 0.805
2.74ArgVal: 2.74 ± 1.069
0.783ArgTrp: 0.783 ± 0.51
0.391ArgTyr: 0.391 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
4.305SerAla: 4.305 ± 1.135
2.348SerCys: 2.348 ± 0.808
2.74SerAsp: 2.74 ± 0.972
3.914SerGlu: 3.914 ± 0.964
5.088SerPhe: 5.088 ± 1.394
6.262SerGly: 6.262 ± 1.212
1.566SerHis: 1.566 ± 0.554
4.697SerIle: 4.697 ± 0.942
3.523SerLys: 3.523 ± 0.957
5.871SerLeu: 5.871 ± 1.832
1.174SerMet: 1.174 ± 0.756
4.305SerAsn: 4.305 ± 1.592
1.174SerPro: 1.174 ± 0.496
3.523SerGln: 3.523 ± 0.847
1.957SerArg: 1.957 ± 1.032
7.828SerSer: 7.828 ± 1.259
2.74SerThr: 2.74 ± 0.827
2.74SerVal: 2.74 ± 0.909
0.783SerTrp: 0.783 ± 0.694
2.348SerTyr: 2.348 ± 1.001
0.0SerXaa: 0.0 ± 0.0
Thr
3.131ThrAla: 3.131 ± 0.781
1.566ThrCys: 1.566 ± 0.496
1.957ThrAsp: 1.957 ± 0.775
6.654ThrGlu: 6.654 ± 1.651
2.74ThrPhe: 2.74 ± 0.991
4.697ThrGly: 4.697 ± 1.083
3.523ThrHis: 3.523 ± 0.858
6.654ThrIle: 6.654 ± 1.742
3.523ThrLys: 3.523 ± 1.077
7.828ThrLeu: 7.828 ± 1.794
2.74ThrMet: 2.74 ± 1.178
3.914ThrAsn: 3.914 ± 1.342
1.566ThrPro: 1.566 ± 0.608
2.348ThrGln: 2.348 ± 0.993
4.697ThrArg: 4.697 ± 1.023
3.523ThrSer: 3.523 ± 1.789
6.654ThrThr: 6.654 ± 1.581
5.479ThrVal: 5.479 ± 0.873
1.174ThrTrp: 1.174 ± 0.539
1.957ThrTyr: 1.957 ± 0.532
0.0ThrXaa: 0.0 ± 0.0
Val
3.131ValAla: 3.131 ± 1.181
2.348ValCys: 2.348 ± 1.361
4.305ValAsp: 4.305 ± 1.333
5.088ValGlu: 5.088 ± 0.958
1.566ValPhe: 1.566 ± 0.821
3.523ValGly: 3.523 ± 1.115
1.174ValHis: 1.174 ± 0.782
2.74ValIle: 2.74 ± 0.94
2.348ValLys: 2.348 ± 0.673
2.74ValLeu: 2.74 ± 0.819
1.566ValMet: 1.566 ± 0.632
1.957ValAsn: 1.957 ± 0.714
2.348ValPro: 2.348 ± 0.936
1.957ValGln: 1.957 ± 1.261
4.305ValArg: 4.305 ± 1.269
3.914ValSer: 3.914 ± 1.16
3.914ValThr: 3.914 ± 1.36
3.914ValVal: 3.914 ± 1.111
0.0ValTrp: 0.0 ± 0.0
1.957ValTyr: 1.957 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.391TrpAsp: 0.391 ± 0.369
0.783TrpGlu: 0.783 ± 0.548
0.391TrpPhe: 0.391 ± 0.381
1.174TrpGly: 1.174 ± 0.651
0.783TrpHis: 0.783 ± 0.484
0.391TrpIle: 0.391 ± 0.497
0.0TrpLys: 0.0 ± 0.0
1.566TrpLeu: 1.566 ± 0.677
0.391TrpMet: 0.391 ± 0.451
1.566TrpAsn: 1.566 ± 0.52
0.783TrpPro: 0.783 ± 0.465
0.0TrpGln: 0.0 ± 0.0
1.566TrpArg: 1.566 ± 0.854
1.957TrpSer: 1.957 ± 0.895
1.174TrpThr: 1.174 ± 0.756
0.783TrpVal: 0.783 ± 0.482
0.391TrpTrp: 0.391 ± 0.349
0.783TrpTyr: 0.783 ± 0.648
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.783TyrAla: 0.783 ± 0.443
0.0TyrCys: 0.0 ± 0.0
1.957TyrAsp: 1.957 ± 0.758
1.174TyrGlu: 1.174 ± 0.687
0.783TyrPhe: 0.783 ± 0.453
1.957TyrGly: 1.957 ± 1.037
0.0TyrHis: 0.0 ± 0.0
1.174TyrIle: 1.174 ± 0.476
1.566TyrLys: 1.566 ± 0.7
1.957TyrLeu: 1.957 ± 0.606
0.391TyrMet: 0.391 ± 0.349
0.391TyrAsn: 0.391 ± 0.421
1.566TyrPro: 1.566 ± 0.866
1.566TyrGln: 1.566 ± 0.632
2.74TyrArg: 2.74 ± 1.255
1.174TyrSer: 1.174 ± 0.59
2.348TyrThr: 2.348 ± 0.796
1.566TyrVal: 1.566 ± 0.567
0.391TyrTrp: 0.391 ± 0.324
0.783TyrTyr: 0.783 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (2556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski