Amino acid dipepetide frequency for Influenza C virus (strain C/Ann Arbor/1/1950)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.778AlaAla: 3.778 ± 0.586
1.111AlaCys: 1.111 ± 0.569
3.333AlaAsp: 3.333 ± 0.673
3.778AlaGlu: 3.778 ± 1.128
3.111AlaPhe: 3.111 ± 0.726
4.222AlaGly: 4.222 ± 1.008
1.333AlaHis: 1.333 ± 0.471
7.111AlaIle: 7.111 ± 1.057
4.889AlaLys: 4.889 ± 0.531
7.111AlaLeu: 7.111 ± 0.677
2.889AlaMet: 2.889 ± 0.974
2.222AlaAsn: 2.222 ± 1.077
3.333AlaPro: 3.333 ± 0.93
2.222AlaGln: 2.222 ± 0.687
3.333AlaArg: 3.333 ± 0.827
5.111AlaSer: 5.111 ± 1.026
4.0AlaThr: 4.0 ± 0.961
4.222AlaVal: 4.222 ± 0.487
0.667AlaTrp: 0.667 ± 0.403
1.333AlaTyr: 1.333 ± 0.676
0.0AlaXaa: 0.0 ± 0.0
Cys
0.444CysAla: 0.444 ± 0.267
0.444CysCys: 0.444 ± 0.267
1.556CysAsp: 1.556 ± 0.597
0.889CysGlu: 0.889 ± 0.372
2.0CysPhe: 2.0 ± 0.609
1.111CysGly: 1.111 ± 0.436
0.444CysHis: 0.444 ± 0.266
1.333CysIle: 1.333 ± 0.528
2.444CysLys: 2.444 ± 0.619
2.667CysLeu: 2.667 ± 0.738
1.111CysMet: 1.111 ± 0.366
2.222CysAsn: 2.222 ± 0.643
0.222CysPro: 0.222 ± 0.197
0.667CysGln: 0.667 ± 0.284
1.333CysArg: 1.333 ± 0.444
0.667CysSer: 0.667 ± 0.307
0.444CysThr: 0.444 ± 0.348
0.667CysVal: 0.667 ± 0.236
0.222CysTrp: 0.222 ± 0.197
0.667CysTyr: 0.667 ± 0.41
0.0CysXaa: 0.0 ± 0.0
Asp
2.667AspAla: 2.667 ± 0.757
0.222AspCys: 0.222 ± 0.221
2.444AspAsp: 2.444 ± 0.602
4.222AspGlu: 4.222 ± 0.889
1.778AspPhe: 1.778 ± 0.542
2.889AspGly: 2.889 ± 0.603
1.556AspHis: 1.556 ± 0.507
3.556AspIle: 3.556 ± 1.106
3.333AspLys: 3.333 ± 0.796
4.222AspLeu: 4.222 ± 0.925
2.0AspMet: 2.0 ± 0.425
2.444AspAsn: 2.444 ± 0.732
1.111AspPro: 1.111 ± 0.278
2.0AspGln: 2.0 ± 0.556
1.556AspArg: 1.556 ± 0.584
3.778AspSer: 3.778 ± 0.448
2.444AspThr: 2.444 ± 0.921
2.444AspVal: 2.444 ± 0.872
0.667AspTrp: 0.667 ± 0.384
2.222AspTyr: 2.222 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
8.0GluAla: 8.0 ± 0.719
0.889GluCys: 0.889 ± 0.414
3.778GluAsp: 3.778 ± 1.072
6.0GluGlu: 6.0 ± 0.997
2.667GluPhe: 2.667 ± 0.985
4.889GluGly: 4.889 ± 1.331
1.111GluHis: 1.111 ± 0.463
5.778GluIle: 5.778 ± 1.021
9.556GluLys: 9.556 ± 1.349
6.0GluLeu: 6.0 ± 0.674
3.556GluMet: 3.556 ± 0.819
3.111GluAsn: 3.111 ± 0.689
1.778GluPro: 1.778 ± 0.785
2.222GluGln: 2.222 ± 0.679
4.222GluArg: 4.222 ± 0.541
4.889GluSer: 4.889 ± 0.828
4.222GluThr: 4.222 ± 0.683
4.222GluVal: 4.222 ± 0.675
0.444GluTrp: 0.444 ± 0.244
2.444GluTyr: 2.444 ± 0.751
0.0GluXaa: 0.0 ± 0.0
Phe
2.889PheAla: 2.889 ± 0.941
1.778PheCys: 1.778 ± 0.54
1.556PheAsp: 1.556 ± 0.329
3.111PheGlu: 3.111 ± 0.519
2.0PhePhe: 2.0 ± 0.393
3.556PheGly: 3.556 ± 1.088
0.0PheHis: 0.0 ± 0.0
1.111PheIle: 1.111 ± 0.355
2.222PheLys: 2.222 ± 0.642
4.444PheLeu: 4.444 ± 1.34
0.222PheMet: 0.222 ± 0.221
3.111PheAsn: 3.111 ± 0.633
1.333PhePro: 1.333 ± 0.397
2.0PheGln: 2.0 ± 0.581
2.444PheArg: 2.444 ± 0.737
3.333PheSer: 3.333 ± 0.536
2.444PheThr: 2.444 ± 0.657
3.111PheVal: 3.111 ± 0.727
0.222PheTrp: 0.222 ± 0.2
0.889PheTyr: 0.889 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
2.444GlyAla: 2.444 ± 0.611
1.111GlyCys: 1.111 ± 0.78
3.111GlyAsp: 3.111 ± 0.656
5.111GlyGlu: 5.111 ± 0.851
2.444GlyPhe: 2.444 ± 0.415
3.778GlyGly: 3.778 ± 1.12
0.444GlyHis: 0.444 ± 0.244
6.444GlyIle: 6.444 ± 0.627
5.778GlyLys: 5.778 ± 0.803
4.667GlyLeu: 4.667 ± 1.074
1.333GlyMet: 1.333 ± 0.515
2.889GlyAsn: 2.889 ± 0.895
2.667GlyPro: 2.667 ± 0.493
1.111GlyGln: 1.111 ± 0.437
5.556GlyArg: 5.556 ± 1.281
3.556GlySer: 3.556 ± 0.695
2.889GlyThr: 2.889 ± 0.393
3.778GlyVal: 3.778 ± 0.688
0.444GlyTrp: 0.444 ± 0.273
0.889GlyTyr: 0.889 ± 0.322
0.0GlyXaa: 0.0 ± 0.0
His
0.889HisAla: 0.889 ± 0.394
0.667HisCys: 0.667 ± 0.301
0.444HisAsp: 0.444 ± 0.256
1.556HisGlu: 1.556 ± 0.708
0.444HisPhe: 0.444 ± 0.241
0.889HisGly: 0.889 ± 0.417
0.444HisHis: 0.444 ± 0.292
0.889HisIle: 0.889 ± 0.402
0.444HisLys: 0.444 ± 0.273
1.778HisLeu: 1.778 ± 0.716
0.444HisMet: 0.444 ± 0.418
0.444HisAsn: 0.444 ± 0.259
1.556HisPro: 1.556 ± 0.503
0.444HisGln: 0.444 ± 0.331
0.222HisArg: 0.222 ± 0.228
0.444HisSer: 0.444 ± 0.241
0.667HisThr: 0.667 ± 0.403
0.222HisVal: 0.222 ± 0.209
0.444HisTrp: 0.444 ± 0.249
0.667HisTyr: 0.667 ± 0.416
0.0HisXaa: 0.0 ± 0.0
Ile
5.778IleAla: 5.778 ± 1.063
2.889IleCys: 2.889 ± 0.665
2.889IleAsp: 2.889 ± 0.549
4.222IleGlu: 4.222 ± 1.186
2.0IlePhe: 2.0 ± 0.756
6.222IleGly: 6.222 ± 1.072
1.556IleHis: 1.556 ± 0.445
4.444IleIle: 4.444 ± 1.039
8.0IleLys: 8.0 ± 1.108
4.444IleLeu: 4.444 ± 1.107
2.444IleMet: 2.444 ± 1.105
3.111IleAsn: 3.111 ± 0.98
3.333IlePro: 3.333 ± 0.78
2.0IleGln: 2.0 ± 0.745
4.667IleArg: 4.667 ± 0.565
3.778IleSer: 3.778 ± 0.736
4.889IleThr: 4.889 ± 1.199
1.778IleVal: 1.778 ± 0.463
0.444IleTrp: 0.444 ± 0.292
2.0IleTyr: 2.0 ± 0.633
0.0IleXaa: 0.0 ± 0.0
Lys
6.889LysAla: 6.889 ± 1.733
0.667LysCys: 0.667 ± 0.334
4.444LysAsp: 4.444 ± 1.264
5.556LysGlu: 5.556 ± 1.696
3.556LysPhe: 3.556 ± 0.872
3.778LysGly: 3.778 ± 0.668
1.556LysHis: 1.556 ± 0.458
6.667LysIle: 6.667 ± 1.202
6.0LysLys: 6.0 ± 1.398
6.0LysLeu: 6.0 ± 0.729
3.556LysMet: 3.556 ± 1.062
5.111LysAsn: 5.111 ± 0.861
2.889LysPro: 2.889 ± 0.674
2.667LysGln: 2.667 ± 0.594
5.333LysArg: 5.333 ± 1.482
7.111LysSer: 7.111 ± 1.368
5.778LysThr: 5.778 ± 1.418
4.0LysVal: 4.0 ± 0.812
1.333LysTrp: 1.333 ± 0.302
1.333LysTyr: 1.333 ± 0.593
0.0LysXaa: 0.0 ± 0.0
Leu
5.556LeuAla: 5.556 ± 1.082
2.0LeuCys: 2.0 ± 0.447
3.333LeuAsp: 3.333 ± 0.924
7.556LeuGlu: 7.556 ± 1.479
3.556LeuPhe: 3.556 ± 0.571
5.778LeuGly: 5.778 ± 1.249
0.889LeuHis: 0.889 ± 0.41
6.889LeuIle: 6.889 ± 0.918
5.778LeuLys: 5.778 ± 1.412
6.0LeuLeu: 6.0 ± 1.667
4.222LeuMet: 4.222 ± 0.863
4.222LeuAsn: 4.222 ± 0.799
4.444LeuPro: 4.444 ± 1.071
4.222LeuGln: 4.222 ± 0.828
4.889LeuArg: 4.889 ± 1.066
5.333LeuSer: 5.333 ± 0.66
3.778LeuThr: 3.778 ± 0.718
4.222LeuVal: 4.222 ± 0.988
0.889LeuTrp: 0.889 ± 0.345
2.889LeuTyr: 2.889 ± 0.634
0.0LeuXaa: 0.0 ± 0.0
Met
3.778MetAla: 3.778 ± 0.816
0.444MetCys: 0.444 ± 0.244
1.333MetAsp: 1.333 ± 0.401
2.889MetGlu: 2.889 ± 0.734
2.444MetPhe: 2.444 ± 0.788
2.0MetGly: 2.0 ± 0.708
0.667MetHis: 0.667 ± 0.374
1.111MetIle: 1.111 ± 0.591
4.444MetLys: 4.444 ± 1.254
4.222MetLeu: 4.222 ± 0.908
1.556MetMet: 1.556 ± 0.526
1.778MetAsn: 1.778 ± 0.533
1.111MetPro: 1.111 ± 0.275
1.333MetGln: 1.333 ± 0.557
3.111MetArg: 3.111 ± 1.101
3.111MetSer: 3.111 ± 0.595
0.889MetThr: 0.889 ± 0.297
1.556MetVal: 1.556 ± 0.375
1.111MetTrp: 1.111 ± 0.471
0.667MetTyr: 0.667 ± 0.342
0.0MetXaa: 0.0 ± 0.0
Asn
2.222AsnAla: 2.222 ± 0.483
1.556AsnCys: 1.556 ± 0.636
3.111AsnAsp: 3.111 ± 0.624
5.333AsnGlu: 5.333 ± 1.403
2.0AsnPhe: 2.0 ± 0.703
2.0AsnGly: 2.0 ± 0.791
0.444AsnHis: 0.444 ± 0.363
3.778AsnIle: 3.778 ± 1.036
4.667AsnLys: 4.667 ± 0.732
3.778AsnLeu: 3.778 ± 0.589
1.556AsnMet: 1.556 ± 0.685
2.444AsnAsn: 2.444 ± 0.746
3.556AsnPro: 3.556 ± 0.617
0.889AsnGln: 0.889 ± 0.41
2.0AsnArg: 2.0 ± 0.575
3.333AsnSer: 3.333 ± 0.719
1.556AsnThr: 1.556 ± 0.453
2.889AsnVal: 2.889 ± 0.932
0.889AsnTrp: 0.889 ± 0.41
1.111AsnTyr: 1.111 ± 0.6
0.0AsnXaa: 0.0 ± 0.0
Pro
1.556ProAla: 1.556 ± 0.323
0.444ProCys: 0.444 ± 0.266
2.444ProAsp: 2.444 ± 0.867
4.444ProGlu: 4.444 ± 0.977
2.222ProPhe: 2.222 ± 0.422
2.444ProGly: 2.444 ± 0.651
0.667ProHis: 0.667 ± 0.378
3.111ProIle: 3.111 ± 0.881
2.444ProLys: 2.444 ± 0.721
4.889ProLeu: 4.889 ± 1.03
2.0ProMet: 2.0 ± 0.794
0.889ProAsn: 0.889 ± 0.446
1.556ProPro: 1.556 ± 0.71
0.222ProGln: 0.222 ± 0.18
1.778ProArg: 1.778 ± 0.476
2.0ProSer: 2.0 ± 0.601
1.333ProThr: 1.333 ± 0.802
1.778ProVal: 1.778 ± 0.576
0.889ProTrp: 0.889 ± 0.402
1.556ProTyr: 1.556 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
1.333GlnAla: 1.333 ± 0.709
0.222GlnCys: 0.222 ± 0.197
0.889GlnAsp: 0.889 ± 0.586
2.889GlnGlu: 2.889 ± 0.8
1.333GlnPhe: 1.333 ± 0.397
2.444GlnGly: 2.444 ± 0.509
0.222GlnHis: 0.222 ± 0.209
2.667GlnIle: 2.667 ± 0.847
2.444GlnLys: 2.444 ± 0.753
1.333GlnLeu: 1.333 ± 0.438
1.778GlnMet: 1.778 ± 0.572
1.556GlnAsn: 1.556 ± 0.612
0.889GlnPro: 0.889 ± 0.266
0.444GlnGln: 0.444 ± 0.241
3.111GlnArg: 3.111 ± 0.959
2.667GlnSer: 2.667 ± 1.128
1.333GlnThr: 1.333 ± 0.442
1.778GlnVal: 1.778 ± 0.575
0.0GlnTrp: 0.0 ± 0.0
0.444GlnTyr: 0.444 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
5.778ArgAla: 5.778 ± 1.353
1.778ArgCys: 1.778 ± 0.761
2.667ArgAsp: 2.667 ± 0.652
4.444ArgGlu: 4.444 ± 1.419
1.778ArgPhe: 1.778 ± 0.499
1.778ArgGly: 1.778 ± 0.376
0.0ArgHis: 0.0 ± 0.0
2.667ArgIle: 2.667 ± 0.235
6.0ArgLys: 6.0 ± 1.167
5.111ArgLeu: 5.111 ± 1.284
2.889ArgMet: 2.889 ± 0.612
1.778ArgAsn: 1.778 ± 0.97
1.556ArgPro: 1.556 ± 0.386
2.0ArgGln: 2.0 ± 0.417
2.889ArgArg: 2.889 ± 0.892
3.778ArgSer: 3.778 ± 0.923
4.222ArgThr: 4.222 ± 0.768
3.333ArgVal: 3.333 ± 0.759
0.889ArgTrp: 0.889 ± 0.49
1.111ArgTyr: 1.111 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
4.667SerAla: 4.667 ± 1.125
0.889SerCys: 0.889 ± 0.397
3.778SerAsp: 3.778 ± 0.403
4.222SerGlu: 4.222 ± 0.786
2.889SerPhe: 2.889 ± 0.645
4.889SerGly: 4.889 ± 1.384
1.333SerHis: 1.333 ± 0.43
3.111SerIle: 3.111 ± 0.572
5.333SerLys: 5.333 ± 1.955
8.889SerLeu: 8.889 ± 1.554
2.0SerMet: 2.0 ± 0.44
4.889SerAsn: 4.889 ± 0.978
2.667SerPro: 2.667 ± 0.935
1.333SerGln: 1.333 ± 0.508
4.667SerArg: 4.667 ± 0.955
5.111SerSer: 5.111 ± 0.949
4.889SerThr: 4.889 ± 1.077
2.667SerVal: 2.667 ± 0.597
0.222SerTrp: 0.222 ± 0.209
1.333SerTyr: 1.333 ± 0.627
0.0SerXaa: 0.0 ± 0.0
Thr
5.333ThrAla: 5.333 ± 1.173
0.667ThrCys: 0.667 ± 0.397
2.0ThrAsp: 2.0 ± 0.602
4.444ThrGlu: 4.444 ± 0.999
1.778ThrPhe: 1.778 ± 0.727
3.333ThrGly: 3.333 ± 0.728
0.444ThrHis: 0.444 ± 0.241
5.111ThrIle: 5.111 ± 0.704
3.778ThrLys: 3.778 ± 1.214
3.556ThrLeu: 3.556 ± 0.705
3.333ThrMet: 3.333 ± 0.765
2.444ThrAsn: 2.444 ± 0.432
1.556ThrPro: 1.556 ± 0.496
1.333ThrGln: 1.333 ± 0.433
1.778ThrArg: 1.778 ± 0.436
4.667ThrSer: 4.667 ± 1.051
3.778ThrThr: 3.778 ± 0.577
3.556ThrVal: 3.556 ± 0.598
0.889ThrTrp: 0.889 ± 0.383
1.778ThrTyr: 1.778 ± 0.656
0.0ThrXaa: 0.0 ± 0.0
Val
3.333ValAla: 3.333 ± 0.86
1.778ValCys: 1.778 ± 0.688
2.222ValAsp: 2.222 ± 0.709
5.778ValGlu: 5.778 ± 0.634
2.222ValPhe: 2.222 ± 0.46
2.667ValGly: 2.667 ± 0.613
0.444ValHis: 0.444 ± 0.273
2.444ValIle: 2.444 ± 0.767
4.222ValLys: 4.222 ± 0.992
3.556ValLeu: 3.556 ± 0.535
1.333ValMet: 1.333 ± 0.497
2.444ValAsn: 2.444 ± 1.0
1.778ValPro: 1.778 ± 0.956
1.556ValGln: 1.556 ± 0.558
2.444ValArg: 2.444 ± 0.373
4.889ValSer: 4.889 ± 0.585
2.444ValThr: 2.444 ± 1.031
1.778ValVal: 1.778 ± 0.315
0.222ValTrp: 0.222 ± 0.228
1.333ValTyr: 1.333 ± 0.637
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.26
0.667TrpCys: 0.667 ± 0.349
0.222TrpAsp: 0.222 ± 0.2
1.333TrpGlu: 1.333 ± 0.532
0.222TrpPhe: 0.222 ± 0.221
0.889TrpGly: 0.889 ± 0.448
0.0TrpHis: 0.0 ± 0.0
1.333TrpIle: 1.333 ± 0.535
0.889TrpLys: 0.889 ± 0.377
1.333TrpLeu: 1.333 ± 0.632
0.444TrpMet: 0.444 ± 0.442
0.667TrpAsn: 0.667 ± 0.236
0.222TrpPro: 0.222 ± 0.233
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.889TrpSer: 0.889 ± 0.428
1.333TrpThr: 1.333 ± 0.651
0.667TrpVal: 0.667 ± 0.422
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.111TyrAla: 1.111 ± 0.484
1.333TyrCys: 1.333 ± 0.357
2.0TyrAsp: 2.0 ± 0.465
2.0TyrGlu: 2.0 ± 0.679
1.333TyrPhe: 1.333 ± 0.527
0.889TyrGly: 0.889 ± 0.375
0.444TyrHis: 0.444 ± 0.296
1.333TyrIle: 1.333 ± 0.479
1.778TyrLys: 1.778 ± 0.587
2.444TyrLeu: 2.444 ± 0.6
0.667TyrMet: 0.667 ± 0.34
1.333TyrAsn: 1.333 ± 0.41
1.111TyrPro: 1.111 ± 0.488
1.111TyrGln: 1.111 ± 0.359
1.111TyrArg: 1.111 ± 0.461
1.333TyrSer: 1.333 ± 0.619
2.222TyrThr: 2.222 ± 0.504
0.222TyrVal: 0.222 ± 0.209
0.889TyrTrp: 0.889 ± 0.424
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4501 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski