Amino acid dipepetide frequency for Influenza A virus (A/American wigeon/Interior Alaska/10BM05537R0/2010(mixed))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.148AlaAla: 3.148 ± 1.07
0.741AlaCys: 0.741 ± 0.467
2.037AlaAsp: 2.037 ± 0.419
4.259AlaGlu: 4.259 ± 1.007
1.852AlaPhe: 1.852 ± 0.62
3.333AlaGly: 3.333 ± 1.061
1.111AlaHis: 1.111 ± 0.476
3.148AlaIle: 3.148 ± 0.56
1.852AlaLys: 1.852 ± 0.512
5.185AlaLeu: 5.185 ± 0.676
2.593AlaMet: 2.593 ± 0.654
3.519AlaAsn: 3.519 ± 0.723
1.667AlaPro: 1.667 ± 0.565
2.037AlaGln: 2.037 ± 0.454
2.963AlaArg: 2.963 ± 0.572
3.519AlaSer: 3.519 ± 1.094
3.889AlaThr: 3.889 ± 0.968
2.778AlaVal: 2.778 ± 0.631
0.37AlaTrp: 0.37 ± 0.208
0.741AlaTyr: 0.741 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.185CysAla: 0.185 ± 0.207
0.37CysCys: 0.37 ± 0.217
0.741CysAsp: 0.741 ± 0.545
0.741CysGlu: 0.741 ± 0.278
1.296CysPhe: 1.296 ± 0.556
0.37CysGly: 0.37 ± 0.441
1.111CysHis: 1.111 ± 0.372
1.111CysIle: 1.111 ± 0.586
0.741CysLys: 0.741 ± 0.222
1.852CysLeu: 1.852 ± 0.519
0.741CysMet: 0.741 ± 0.451
1.111CysAsn: 1.111 ± 0.445
0.37CysPro: 0.37 ± 0.274
0.37CysGln: 0.37 ± 0.336
0.741CysArg: 0.741 ± 0.487
1.296CysSer: 1.296 ± 0.417
0.37CysThr: 0.37 ± 0.229
0.926CysVal: 0.926 ± 0.402
0.185CysTrp: 0.185 ± 0.159
0.37CysTyr: 0.37 ± 0.274
0.0CysXaa: 0.0 ± 0.0
Asp
2.407AspAla: 2.407 ± 0.529
1.111AspCys: 1.111 ± 0.346
0.926AspAsp: 0.926 ± 0.389
2.593AspGlu: 2.593 ± 0.511
1.667AspPhe: 1.667 ± 0.555
2.778AspGly: 2.778 ± 0.872
0.926AspHis: 0.926 ± 0.455
1.481AspIle: 1.481 ± 0.677
1.667AspLys: 1.667 ± 0.399
3.333AspLeu: 3.333 ± 0.739
1.667AspMet: 1.667 ± 0.495
2.778AspAsn: 2.778 ± 0.546
2.778AspPro: 2.778 ± 0.648
1.667AspGln: 1.667 ± 0.598
2.222AspArg: 2.222 ± 0.56
2.037AspSer: 2.037 ± 0.655
2.778AspThr: 2.778 ± 0.632
2.778AspVal: 2.778 ± 0.679
0.556AspTrp: 0.556 ± 0.256
1.852AspTyr: 1.852 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
2.407GluAla: 2.407 ± 0.468
1.111GluCys: 1.111 ± 0.894
4.074GluAsp: 4.074 ± 0.562
6.481GluGlu: 6.481 ± 0.931
1.852GluPhe: 1.852 ± 0.506
2.778GluGly: 2.778 ± 0.698
0.556GluHis: 0.556 ± 0.363
4.444GluIle: 4.444 ± 0.649
4.444GluLys: 4.444 ± 0.904
5.0GluLeu: 5.0 ± 0.833
2.778GluMet: 2.778 ± 0.611
2.963GluAsn: 2.963 ± 0.713
1.111GluPro: 1.111 ± 0.279
4.444GluGln: 4.444 ± 1.156
4.074GluArg: 4.074 ± 1.089
6.481GluSer: 6.481 ± 1.171
3.889GluThr: 3.889 ± 0.777
4.63GluVal: 4.63 ± 1.143
1.111GluTrp: 1.111 ± 0.402
1.111GluTyr: 1.111 ± 0.292
0.0GluXaa: 0.0 ± 0.0
Phe
1.852PheAla: 1.852 ± 0.493
0.0PheCys: 0.0 ± 0.0
1.111PheAsp: 1.111 ± 0.401
5.0PheGlu: 5.0 ± 1.106
1.481PhePhe: 1.481 ± 0.451
1.111PheGly: 1.111 ± 0.357
0.556PheHis: 0.556 ± 0.269
1.667PheIle: 1.667 ± 0.484
1.111PheLys: 1.111 ± 0.756
3.519PheLeu: 3.519 ± 0.545
0.556PheMet: 0.556 ± 0.296
1.667PheAsn: 1.667 ± 0.55
0.926PhePro: 0.926 ± 0.385
2.222PheGln: 2.222 ± 0.707
1.481PheArg: 1.481 ± 0.224
2.593PheSer: 2.593 ± 0.493
2.407PheThr: 2.407 ± 0.347
2.593PheVal: 2.593 ± 0.727
0.185PheTrp: 0.185 ± 0.195
1.111PheTyr: 1.111 ± 0.4
0.0PheXaa: 0.0 ± 0.0
Gly
2.963GlyAla: 2.963 ± 0.959
0.37GlyCys: 0.37 ± 0.336
2.963GlyAsp: 2.963 ± 0.373
2.407GlyGlu: 2.407 ± 0.853
1.852GlyPhe: 1.852 ± 0.387
2.963GlyGly: 2.963 ± 0.757
1.111GlyHis: 1.111 ± 0.476
4.074GlyIle: 4.074 ± 0.884
3.519GlyLys: 3.519 ± 0.537
4.074GlyLeu: 4.074 ± 0.856
1.852GlyMet: 1.852 ± 0.387
3.148GlyAsn: 3.148 ± 0.908
3.704GlyPro: 3.704 ± 0.959
1.852GlyGln: 1.852 ± 0.624
5.0GlyArg: 5.0 ± 1.0
3.148GlySer: 3.148 ± 1.216
5.0GlyThr: 5.0 ± 0.584
3.889GlyVal: 3.889 ± 0.443
0.37GlyTrp: 0.37 ± 0.307
2.037GlyTyr: 2.037 ± 0.486
0.0GlyXaa: 0.0 ± 0.0
His
0.741HisAla: 0.741 ± 0.286
0.37HisCys: 0.37 ± 0.293
0.741HisAsp: 0.741 ± 0.678
0.556HisGlu: 0.556 ± 0.353
1.296HisPhe: 1.296 ± 0.318
1.111HisGly: 1.111 ± 0.366
0.37HisHis: 0.37 ± 0.336
1.852HisIle: 1.852 ± 0.742
1.111HisLys: 1.111 ± 0.506
1.481HisLeu: 1.481 ± 0.49
0.185HisMet: 0.185 ± 0.159
0.185HisAsn: 0.185 ± 0.22
0.926HisPro: 0.926 ± 0.306
1.296HisGln: 1.296 ± 0.491
1.111HisArg: 1.111 ± 0.448
0.926HisSer: 0.926 ± 0.429
0.556HisThr: 0.556 ± 0.298
0.185HisVal: 0.185 ± 0.164
0.0HisTrp: 0.0 ± 0.0
0.185HisTyr: 0.185 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
2.778IleAla: 2.778 ± 0.685
2.407IleCys: 2.407 ± 0.79
3.333IleAsp: 3.333 ± 0.854
5.37IleGlu: 5.37 ± 0.837
1.111IlePhe: 1.111 ± 0.269
3.333IleGly: 3.333 ± 0.625
0.37IleHis: 0.37 ± 0.263
2.778IleIle: 2.778 ± 1.067
2.963IleLys: 2.963 ± 0.546
6.296IleLeu: 6.296 ± 1.641
1.667IleMet: 1.667 ± 0.412
3.704IleAsn: 3.704 ± 0.541
1.111IlePro: 1.111 ± 0.386
2.407IleGln: 2.407 ± 0.614
5.37IleArg: 5.37 ± 1.155
2.407IleSer: 2.407 ± 0.755
3.519IleThr: 3.519 ± 0.667
2.963IleVal: 2.963 ± 0.658
0.556IleTrp: 0.556 ± 0.396
1.296IleTyr: 1.296 ± 0.538
0.185IleXaa: 0.185 ± 0.168
Lys
2.963LysAla: 2.963 ± 0.739
0.741LysCys: 0.741 ± 0.315
2.593LysAsp: 2.593 ± 0.345
4.074LysGlu: 4.074 ± 0.729
1.296LysPhe: 1.296 ± 0.373
2.037LysGly: 2.037 ± 0.458
0.37LysHis: 0.37 ± 0.206
3.333LysIle: 3.333 ± 1.065
3.889LysLys: 3.889 ± 1.675
4.815LysLeu: 4.815 ± 1.153
2.963LysMet: 2.963 ± 0.615
1.296LysAsn: 1.296 ± 0.394
0.37LysPro: 0.37 ± 0.263
2.222LysGln: 2.222 ± 0.987
5.0LysArg: 5.0 ± 1.313
2.963LysSer: 2.963 ± 0.681
2.778LysThr: 2.778 ± 0.807
1.852LysVal: 1.852 ± 0.484
1.667LysTrp: 1.667 ± 0.472
1.481LysTyr: 1.481 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
4.074LeuAla: 4.074 ± 0.497
1.296LeuCys: 1.296 ± 0.628
0.926LeuAsp: 0.926 ± 0.442
5.741LeuGlu: 5.741 ± 0.912
2.407LeuPhe: 2.407 ± 0.594
3.704LeuGly: 3.704 ± 0.867
1.111LeuHis: 1.111 ± 0.353
5.926LeuIle: 5.926 ± 1.089
7.037LeuLys: 7.037 ± 1.619
5.185LeuLeu: 5.185 ± 0.833
1.852LeuMet: 1.852 ± 0.584
4.444LeuAsn: 4.444 ± 1.041
3.333LeuPro: 3.333 ± 0.735
2.593LeuGln: 2.593 ± 0.553
4.444LeuArg: 4.444 ± 1.381
5.0LeuSer: 5.0 ± 1.344
5.37LeuThr: 5.37 ± 1.39
3.519LeuVal: 3.519 ± 0.838
1.852LeuTrp: 1.852 ± 0.46
2.037LeuTyr: 2.037 ± 0.437
0.185LeuXaa: 0.185 ± 0.166
Met
3.148MetAla: 3.148 ± 0.704
0.741MetCys: 0.741 ± 0.596
3.333MetAsp: 3.333 ± 1.133
4.444MetGlu: 4.444 ± 0.916
1.296MetPhe: 1.296 ± 0.674
1.667MetGly: 1.667 ± 0.854
0.556MetHis: 0.556 ± 0.344
2.222MetIle: 2.222 ± 0.485
1.481MetLys: 1.481 ± 0.49
1.111MetLeu: 1.111 ± 0.398
1.481MetMet: 1.481 ± 0.55
0.37MetAsn: 0.37 ± 0.274
0.556MetPro: 0.556 ± 0.256
1.296MetGln: 1.296 ± 0.481
1.852MetArg: 1.852 ± 0.667
2.778MetSer: 2.778 ± 0.582
2.407MetThr: 2.407 ± 0.688
3.333MetVal: 3.333 ± 0.937
0.185MetTrp: 0.185 ± 0.159
0.741MetTyr: 0.741 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
3.704AsnAla: 3.704 ± 0.785
0.37AsnCys: 0.37 ± 0.274
1.852AsnAsp: 1.852 ± 0.319
3.704AsnGlu: 3.704 ± 0.944
1.296AsnPhe: 1.296 ± 0.237
4.815AsnGly: 4.815 ± 1.712
0.0AsnHis: 0.0 ± 0.0
2.407AsnIle: 2.407 ± 0.671
2.407AsnLys: 2.407 ± 0.434
2.963AsnLeu: 2.963 ± 0.428
2.037AsnMet: 2.037 ± 0.728
3.333AsnAsn: 3.333 ± 1.218
4.63AsnPro: 4.63 ± 0.673
2.222AsnGln: 2.222 ± 0.565
3.148AsnArg: 3.148 ± 0.636
3.148AsnSer: 3.148 ± 0.754
3.519AsnThr: 3.519 ± 0.745
1.852AsnVal: 1.852 ± 0.618
0.926AsnTrp: 0.926 ± 0.525
0.556AsnTyr: 0.556 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
3.148ProAla: 3.148 ± 0.979
0.185ProCys: 0.185 ± 0.22
0.926ProAsp: 0.926 ± 0.362
2.593ProGlu: 2.593 ± 0.619
1.667ProPhe: 1.667 ± 0.398
2.593ProGly: 2.593 ± 0.375
0.185ProHis: 0.185 ± 0.22
1.852ProIle: 1.852 ± 0.373
1.667ProLys: 1.667 ± 0.463
3.148ProLeu: 3.148 ± 0.812
0.37ProMet: 0.37 ± 0.206
2.593ProAsn: 2.593 ± 0.75
1.296ProPro: 1.296 ± 0.364
0.556ProGln: 0.556 ± 0.259
1.296ProArg: 1.296 ± 0.601
3.333ProSer: 3.333 ± 0.74
2.037ProThr: 2.037 ± 0.523
2.222ProVal: 2.222 ± 0.791
0.556ProTrp: 0.556 ± 0.326
1.111ProTyr: 1.111 ± 0.495
0.0ProXaa: 0.0 ± 0.0
Gln
2.407GlnAla: 2.407 ± 0.819
0.556GlnCys: 0.556 ± 0.219
1.667GlnAsp: 1.667 ± 0.524
2.407GlnGlu: 2.407 ± 0.869
0.37GlnPhe: 0.37 ± 0.25
2.593GlnGly: 2.593 ± 0.604
0.741GlnHis: 0.741 ± 0.459
3.333GlnIle: 3.333 ± 0.735
2.778GlnLys: 2.778 ± 0.718
2.963GlnLeu: 2.963 ± 0.815
2.222GlnMet: 2.222 ± 0.864
3.148GlnAsn: 3.148 ± 0.816
0.926GlnPro: 0.926 ± 0.484
1.481GlnGln: 1.481 ± 0.418
3.704GlnArg: 3.704 ± 1.042
3.333GlnSer: 3.333 ± 0.828
3.704GlnThr: 3.704 ± 0.989
1.481GlnVal: 1.481 ± 0.539
0.556GlnTrp: 0.556 ± 0.344
0.556GlnTyr: 0.556 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
4.074ArgAla: 4.074 ± 0.789
0.926ArgCys: 0.926 ± 0.316
2.963ArgAsp: 2.963 ± 0.733
2.407ArgGlu: 2.407 ± 0.758
2.407ArgPhe: 2.407 ± 0.513
6.111ArgGly: 6.111 ± 1.286
0.926ArgHis: 0.926 ± 0.449
3.333ArgIle: 3.333 ± 0.698
2.222ArgLys: 2.222 ± 0.589
5.0ArgLeu: 5.0 ± 0.632
3.704ArgMet: 3.704 ± 1.403
3.704ArgAsn: 3.704 ± 0.996
2.222ArgPro: 2.222 ± 0.82
2.778ArgGln: 2.778 ± 0.507
5.37ArgArg: 5.37 ± 1.125
3.889ArgSer: 3.889 ± 1.206
5.0ArgThr: 5.0 ± 0.923
3.148ArgVal: 3.148 ± 0.821
0.926ArgTrp: 0.926 ± 0.422
1.481ArgTyr: 1.481 ± 0.512
0.0ArgXaa: 0.0 ± 0.0
Ser
2.963SerAla: 2.963 ± 1.004
1.481SerCys: 1.481 ± 0.621
2.037SerAsp: 2.037 ± 0.713
2.778SerGlu: 2.778 ± 0.546
3.889SerPhe: 3.889 ± 0.861
5.185SerGly: 5.185 ± 0.949
1.296SerHis: 1.296 ± 0.607
4.63SerIle: 4.63 ± 0.537
2.778SerLys: 2.778 ± 1.19
5.37SerLeu: 5.37 ± 1.424
2.407SerMet: 2.407 ± 0.914
2.222SerAsn: 2.222 ± 0.691
2.778SerPro: 2.778 ± 0.796
3.889SerGln: 3.889 ± 0.884
2.778SerArg: 2.778 ± 0.685
6.296SerSer: 6.296 ± 1.301
5.0SerThr: 5.0 ± 0.999
2.963SerVal: 2.963 ± 0.727
1.481SerTrp: 1.481 ± 1.276
1.852SerTyr: 1.852 ± 0.648
0.0SerXaa: 0.0 ± 0.0
Thr
3.333ThrAla: 3.333 ± 0.718
1.111ThrCys: 1.111 ± 0.373
2.593ThrAsp: 2.593 ± 0.873
3.889ThrGlu: 3.889 ± 1.138
2.222ThrPhe: 2.222 ± 0.53
5.185ThrGly: 5.185 ± 0.953
2.037ThrHis: 2.037 ± 0.578
4.444ThrIle: 4.444 ± 0.547
3.704ThrLys: 3.704 ± 0.552
4.259ThrLeu: 4.259 ± 0.65
2.778ThrMet: 2.778 ± 0.514
3.148ThrAsn: 3.148 ± 0.876
1.481ThrPro: 1.481 ± 0.453
4.074ThrGln: 4.074 ± 1.119
4.259ThrArg: 4.259 ± 0.91
3.333ThrSer: 3.333 ± 0.803
3.519ThrThr: 3.519 ± 0.99
4.074ThrVal: 4.074 ± 1.091
0.37ThrTrp: 0.37 ± 0.206
2.407ThrTyr: 2.407 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
2.963ValAla: 2.963 ± 0.788
1.111ValCys: 1.111 ± 0.355
3.148ValAsp: 3.148 ± 0.995
3.519ValGlu: 3.519 ± 0.584
2.037ValPhe: 2.037 ± 0.574
2.778ValGly: 2.778 ± 0.762
0.926ValHis: 0.926 ± 0.457
1.852ValIle: 1.852 ± 0.672
2.037ValLys: 2.037 ± 0.637
4.074ValLeu: 4.074 ± 1.532
2.037ValMet: 2.037 ± 0.446
3.148ValAsn: 3.148 ± 0.616
2.222ValPro: 2.222 ± 0.627
1.852ValGln: 1.852 ± 0.763
3.889ValArg: 3.889 ± 1.033
4.63ValSer: 4.63 ± 0.901
2.593ValThr: 2.593 ± 0.731
2.778ValVal: 2.778 ± 0.576
0.37ValTrp: 0.37 ± 0.263
0.926ValTyr: 0.926 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
0.741TrpAla: 0.741 ± 0.458
0.0TrpCys: 0.0 ± 0.0
0.556TrpAsp: 0.556 ± 0.241
1.667TrpGlu: 1.667 ± 0.459
0.556TrpPhe: 0.556 ± 0.249
0.556TrpGly: 0.556 ± 0.219
0.741TrpHis: 0.741 ± 0.47
0.741TrpIle: 0.741 ± 0.302
0.556TrpLys: 0.556 ± 0.348
1.111TrpLeu: 1.111 ± 0.504
0.556TrpMet: 0.556 ± 0.411
0.926TrpAsn: 0.926 ± 0.258
0.185TrpPro: 0.185 ± 0.166
0.0TrpGln: 0.0 ± 0.0
0.741TrpArg: 0.741 ± 0.456
0.926TrpSer: 0.926 ± 0.676
1.852TrpThr: 1.852 ± 0.64
0.185TrpVal: 0.185 ± 0.166
0.37TrpTrp: 0.37 ± 0.207
0.185TrpTyr: 0.185 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.741TyrAla: 0.741 ± 0.273
0.0TyrCys: 0.0 ± 0.0
1.481TyrAsp: 1.481 ± 0.604
1.111TyrGlu: 1.111 ± 0.563
1.296TyrPhe: 1.296 ± 0.405
1.296TyrGly: 1.296 ± 0.256
0.185TyrHis: 0.185 ± 0.168
1.111TyrIle: 1.111 ± 0.318
0.741TyrLys: 0.741 ± 0.332
1.296TyrLeu: 1.296 ± 0.399
0.37TyrMet: 0.37 ± 0.207
1.667TyrAsn: 1.667 ± 0.65
0.741TyrPro: 0.741 ± 0.418
1.481TyrGln: 1.481 ± 0.365
2.963TyrArg: 2.963 ± 1.088
2.037TyrSer: 2.037 ± 0.343
2.037TyrThr: 2.037 ± 0.636
0.556TyrVal: 0.556 ± 0.355
0.556TyrTrp: 0.556 ± 0.272
0.185TyrTyr: 0.185 ± 0.166
0.37TyrXaa: 0.37 ± 0.217
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.185XaaIle: 0.185 ± 0.159
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.185XaaGln: 0.185 ± 0.179
0.185XaaArg: 0.185 ± 0.168
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.185XaaVal: 0.185 ± 0.166
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
115.185XaaXaa: 115.185 ± 58.614
Statistics based on 13 proteins (5401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski