Amino acid dipepetide frequency for Influenza B virus (B/Utah/45/2015)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.339AlaAla: 4.339 ± 0.741
1.509AlaCys: 1.509 ± 0.709
3.962AlaAsp: 3.962 ± 1.046
3.396AlaGlu: 3.396 ± 1.003
0.943AlaPhe: 0.943 ± 0.384
3.962AlaGly: 3.962 ± 0.821
1.132AlaHis: 1.132 ± 0.495
4.527AlaIle: 4.527 ± 0.89
5.848AlaLys: 5.848 ± 0.814
6.98AlaLeu: 6.98 ± 1.154
2.452AlaMet: 2.452 ± 0.71
2.641AlaAsn: 2.641 ± 0.828
1.698AlaPro: 1.698 ± 0.525
1.698AlaGln: 1.698 ± 0.416
1.886AlaArg: 1.886 ± 0.428
4.905AlaSer: 4.905 ± 0.877
4.905AlaThr: 4.905 ± 0.879
4.15AlaVal: 4.15 ± 0.879
1.132AlaTrp: 1.132 ± 0.386
1.886AlaTyr: 1.886 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.424
0.189CysCys: 0.189 ± 0.148
0.377CysAsp: 0.377 ± 0.232
0.755CysGlu: 0.755 ± 0.345
1.886CysPhe: 1.886 ± 0.407
0.755CysGly: 0.755 ± 0.465
0.377CysHis: 0.377 ± 0.224
0.566CysIle: 0.566 ± 0.397
0.943CysLys: 0.943 ± 0.317
3.584CysLeu: 3.584 ± 0.763
0.566CysMet: 0.566 ± 0.233
1.132CysAsn: 1.132 ± 0.413
1.886CysPro: 1.886 ± 0.671
0.377CysGln: 0.377 ± 0.207
1.321CysArg: 1.321 ± 0.488
1.698CysSer: 1.698 ± 0.525
1.886CysThr: 1.886 ± 0.703
1.321CysVal: 1.321 ± 0.378
0.189CysTrp: 0.189 ± 0.206
0.755CysTyr: 0.755 ± 0.334
0.0CysXaa: 0.0 ± 0.0
Asp
1.698AspAla: 1.698 ± 0.513
1.509AspCys: 1.509 ± 0.47
3.584AspAsp: 3.584 ± 0.959
3.396AspGlu: 3.396 ± 0.934
1.698AspPhe: 1.698 ± 0.503
4.527AspGly: 4.527 ± 1.186
0.377AspHis: 0.377 ± 0.316
2.641AspIle: 2.641 ± 0.485
1.698AspLys: 1.698 ± 0.958
3.773AspLeu: 3.773 ± 0.525
2.264AspMet: 2.264 ± 0.666
3.584AspAsn: 3.584 ± 0.769
1.698AspPro: 1.698 ± 0.666
2.452AspGln: 2.452 ± 0.582
2.452AspArg: 2.452 ± 0.569
2.264AspSer: 2.264 ± 0.549
3.018AspThr: 3.018 ± 0.694
3.396AspVal: 3.396 ± 0.562
0.189AspTrp: 0.189 ± 0.148
2.641AspTyr: 2.641 ± 0.552
0.0AspXaa: 0.0 ± 0.0
Glu
4.339GluAla: 4.339 ± 0.443
1.698GluCys: 1.698 ± 0.791
4.339GluAsp: 4.339 ± 0.561
6.603GluGlu: 6.603 ± 1.219
2.075GluPhe: 2.075 ± 0.413
6.791GluGly: 6.791 ± 1.399
1.132GluHis: 1.132 ± 0.348
4.339GluIle: 4.339 ± 0.685
4.905GluLys: 4.905 ± 1.431
5.659GluLeu: 5.659 ± 1.945
1.886GluMet: 1.886 ± 0.642
1.321GluAsn: 1.321 ± 0.404
2.452GluPro: 2.452 ± 0.853
1.132GluGln: 1.132 ± 0.426
3.962GluArg: 3.962 ± 0.65
4.15GluSer: 4.15 ± 1.277
2.83GluThr: 2.83 ± 0.602
5.093GluVal: 5.093 ± 1.124
0.943GluTrp: 0.943 ± 0.466
1.509GluTyr: 1.509 ± 0.577
0.0GluXaa: 0.0 ± 0.0
Phe
2.075PheAla: 2.075 ± 0.505
0.755PheCys: 0.755 ± 0.36
1.509PheAsp: 1.509 ± 0.456
3.396PheGlu: 3.396 ± 0.574
1.886PhePhe: 1.886 ± 0.609
2.83PheGly: 2.83 ± 0.41
1.509PheHis: 1.509 ± 0.291
3.018PheIle: 3.018 ± 0.549
0.755PheLys: 0.755 ± 0.361
3.773PheLeu: 3.773 ± 0.91
0.755PheMet: 0.755 ± 0.385
1.886PheAsn: 1.886 ± 0.426
1.509PhePro: 1.509 ± 0.468
1.698PheGln: 1.698 ± 0.516
0.943PheArg: 0.943 ± 0.443
3.396PheSer: 3.396 ± 0.9
0.943PheThr: 0.943 ± 0.301
1.509PheVal: 1.509 ± 0.673
0.377PheTrp: 0.377 ± 0.225
0.755PheTyr: 0.755 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
3.018GlyAla: 3.018 ± 0.632
1.132GlyCys: 1.132 ± 0.506
3.773GlyAsp: 3.773 ± 0.327
4.905GlyGlu: 4.905 ± 1.354
4.905GlyPhe: 4.905 ± 0.506
5.471GlyGly: 5.471 ± 1.157
0.943GlyHis: 0.943 ± 0.454
5.471GlyIle: 5.471 ± 0.595
5.659GlyLys: 5.659 ± 0.905
4.716GlyLeu: 4.716 ± 0.863
2.83GlyMet: 2.83 ± 0.791
3.396GlyAsn: 3.396 ± 0.364
3.773GlyPro: 3.773 ± 0.68
2.075GlyGln: 2.075 ± 0.669
4.905GlyArg: 4.905 ± 0.637
3.773GlySer: 3.773 ± 0.897
6.037GlyThr: 6.037 ± 1.078
4.905GlyVal: 4.905 ± 1.07
1.132GlyTrp: 1.132 ± 0.44
2.452GlyTyr: 2.452 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
0.377HisAla: 0.377 ± 0.226
0.377HisCys: 0.377 ± 0.271
0.755HisAsp: 0.755 ± 0.384
1.509HisGlu: 1.509 ± 0.544
0.377HisPhe: 0.377 ± 0.292
1.886HisGly: 1.886 ± 0.621
0.0HisHis: 0.0 ± 0.0
0.943HisIle: 0.943 ± 0.416
0.755HisLys: 0.755 ± 0.333
1.509HisLeu: 1.509 ± 0.386
0.377HisMet: 0.377 ± 0.285
0.755HisAsn: 0.755 ± 0.276
0.755HisPro: 0.755 ± 0.31
0.377HisGln: 0.377 ± 0.262
1.321HisArg: 1.321 ± 0.492
2.264HisSer: 2.264 ± 0.694
0.755HisThr: 0.755 ± 0.259
0.943HisVal: 0.943 ± 0.229
0.0HisTrp: 0.0 ± 0.0
0.755HisTyr: 0.755 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
3.962IleAla: 3.962 ± 0.816
2.264IleCys: 2.264 ± 0.68
3.396IleAsp: 3.396 ± 0.616
4.527IleGlu: 4.527 ± 1.009
1.698IlePhe: 1.698 ± 0.521
6.791IleGly: 6.791 ± 0.503
0.566IleHis: 0.566 ± 0.412
3.584IleIle: 3.584 ± 0.701
5.471IleLys: 5.471 ± 1.486
5.093IleLeu: 5.093 ± 1.081
1.698IleMet: 1.698 ± 0.491
2.83IleAsn: 2.83 ± 0.505
3.018IlePro: 3.018 ± 0.604
2.641IleGln: 2.641 ± 0.789
3.773IleArg: 3.773 ± 0.698
3.773IleSer: 3.773 ± 0.78
4.905IleThr: 4.905 ± 0.568
3.207IleVal: 3.207 ± 0.773
0.755IleTrp: 0.755 ± 0.259
1.132IleTyr: 1.132 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
5.471LysAla: 5.471 ± 0.724
2.075LysCys: 2.075 ± 0.421
2.641LysAsp: 2.641 ± 0.542
5.471LysGlu: 5.471 ± 1.161
0.943LysPhe: 0.943 ± 0.286
6.98LysGly: 6.98 ± 1.163
1.321LysHis: 1.321 ± 0.522
5.093LysIle: 5.093 ± 0.626
4.716LysLys: 4.716 ± 1.05
6.603LysLeu: 6.603 ± 0.533
2.83LysMet: 2.83 ± 0.7
4.905LysAsn: 4.905 ± 0.75
2.452LysPro: 2.452 ± 0.382
1.698LysGln: 1.698 ± 0.668
5.659LysArg: 5.659 ± 1.242
3.962LysSer: 3.962 ± 0.744
6.414LysThr: 6.414 ± 1.144
2.83LysVal: 2.83 ± 0.51
1.132LysTrp: 1.132 ± 0.704
2.641LysTyr: 2.641 ± 0.968
0.0LysXaa: 0.0 ± 0.0
Leu
5.848LeuAla: 5.848 ± 1.035
1.132LeuCys: 1.132 ± 0.533
4.339LeuAsp: 4.339 ± 0.666
5.848LeuGlu: 5.848 ± 0.878
3.773LeuPhe: 3.773 ± 0.897
4.339LeuGly: 4.339 ± 0.53
2.452LeuHis: 2.452 ± 0.667
3.396LeuIle: 3.396 ± 0.609
8.678LeuLys: 8.678 ± 1.84
9.244LeuLeu: 9.244 ± 1.044
3.962LeuMet: 3.962 ± 0.685
5.659LeuAsn: 5.659 ± 1.102
3.962LeuPro: 3.962 ± 0.863
2.075LeuGln: 2.075 ± 0.588
5.093LeuArg: 5.093 ± 0.726
8.489LeuSer: 8.489 ± 1.237
3.962LeuThr: 3.962 ± 0.774
3.773LeuVal: 3.773 ± 0.979
1.132LeuTrp: 1.132 ± 0.57
2.641LeuTyr: 2.641 ± 0.625
0.0LeuXaa: 0.0 ± 0.0
Met
3.396MetAla: 3.396 ± 0.628
1.132MetCys: 1.132 ± 0.362
1.698MetAsp: 1.698 ± 0.611
2.264MetGlu: 2.264 ± 0.501
1.321MetPhe: 1.321 ± 0.5
3.207MetGly: 3.207 ± 0.864
0.566MetHis: 0.566 ± 0.239
2.264MetIle: 2.264 ± 0.467
3.773MetLys: 3.773 ± 0.783
2.641MetLeu: 2.641 ± 0.361
1.321MetMet: 1.321 ± 0.415
2.264MetAsn: 2.264 ± 0.693
0.566MetPro: 0.566 ± 0.273
0.566MetGln: 0.566 ± 0.417
1.132MetArg: 1.132 ± 0.573
2.264MetSer: 2.264 ± 0.61
2.452MetThr: 2.452 ± 0.625
3.584MetVal: 3.584 ± 0.798
0.189MetTrp: 0.189 ± 0.184
0.755MetTyr: 0.755 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
5.093AsnAla: 5.093 ± 0.507
1.509AsnCys: 1.509 ± 0.478
2.264AsnAsp: 2.264 ± 0.608
4.527AsnGlu: 4.527 ± 1.261
1.886AsnPhe: 1.886 ± 0.759
3.207AsnGly: 3.207 ± 0.588
0.755AsnHis: 0.755 ± 0.246
2.83AsnIle: 2.83 ± 0.495
4.527AsnLys: 4.527 ± 0.574
4.905AsnLeu: 4.905 ± 0.976
2.83AsnMet: 2.83 ± 0.993
1.132AsnAsn: 1.132 ± 0.538
3.207AsnPro: 3.207 ± 0.547
1.132AsnGln: 1.132 ± 0.333
1.886AsnArg: 1.886 ± 0.539
3.018AsnSer: 3.018 ± 0.647
2.452AsnThr: 2.452 ± 0.619
2.452AsnVal: 2.452 ± 0.623
0.189AsnTrp: 0.189 ± 0.184
1.132AsnTyr: 1.132 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
2.075ProAla: 2.075 ± 0.639
0.566ProCys: 0.566 ± 0.381
1.886ProAsp: 1.886 ± 0.352
2.452ProGlu: 2.452 ± 0.901
1.509ProPhe: 1.509 ± 0.567
3.584ProGly: 3.584 ± 0.8
0.755ProHis: 0.755 ± 0.259
3.584ProIle: 3.584 ± 0.779
3.018ProLys: 3.018 ± 1.257
3.584ProLeu: 3.584 ± 0.958
0.943ProMet: 0.943 ± 0.257
2.264ProAsn: 2.264 ± 0.492
1.321ProPro: 1.321 ± 0.369
2.641ProGln: 2.641 ± 0.78
1.698ProArg: 1.698 ± 0.454
2.83ProSer: 2.83 ± 0.744
1.698ProThr: 1.698 ± 0.52
2.264ProVal: 2.264 ± 0.545
0.566ProTrp: 0.566 ± 0.401
2.264ProTyr: 2.264 ± 0.612
0.0ProXaa: 0.0 ± 0.0
Gln
1.698GlnAla: 1.698 ± 0.4
0.377GlnCys: 0.377 ± 0.227
0.755GlnAsp: 0.755 ± 0.252
1.886GlnGlu: 1.886 ± 0.654
1.132GlnPhe: 1.132 ± 0.546
1.698GlnGly: 1.698 ± 0.672
0.189GlnHis: 0.189 ± 0.148
2.83GlnIle: 2.83 ± 0.781
3.962GlnLys: 3.962 ± 0.811
2.641GlnLeu: 2.641 ± 0.65
1.698GlnMet: 1.698 ± 0.404
0.943GlnAsn: 0.943 ± 0.29
0.566GlnPro: 0.566 ± 0.257
0.377GlnGln: 0.377 ± 0.225
2.452GlnArg: 2.452 ± 0.529
2.264GlnSer: 2.264 ± 0.519
3.207GlnThr: 3.207 ± 0.562
0.755GlnVal: 0.755 ± 0.372
0.566GlnTrp: 0.566 ± 0.552
0.755GlnTyr: 0.755 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
2.83ArgAla: 2.83 ± 0.523
0.755ArgCys: 0.755 ± 0.324
3.584ArgAsp: 3.584 ± 0.83
2.264ArgGlu: 2.264 ± 0.959
1.321ArgPhe: 1.321 ± 0.44
3.584ArgGly: 3.584 ± 0.924
0.377ArgHis: 0.377 ± 0.292
4.527ArgIle: 4.527 ± 0.476
3.584ArgLys: 3.584 ± 0.907
5.848ArgLeu: 5.848 ± 1.417
3.018ArgMet: 3.018 ± 0.733
2.264ArgAsn: 2.264 ± 0.708
2.641ArgPro: 2.641 ± 0.743
1.698ArgGln: 1.698 ± 0.678
2.075ArgArg: 2.075 ± 0.643
3.584ArgSer: 3.584 ± 0.805
2.641ArgThr: 2.641 ± 0.543
2.452ArgVal: 2.452 ± 1.028
0.377ArgTrp: 0.377 ± 0.262
0.755ArgTyr: 0.755 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
5.659SerAla: 5.659 ± 1.102
1.509SerCys: 1.509 ± 0.439
2.452SerAsp: 2.452 ± 0.534
2.641SerGlu: 2.641 ± 0.471
1.886SerPhe: 1.886 ± 0.587
6.603SerGly: 6.603 ± 1.006
1.509SerHis: 1.509 ± 0.572
4.527SerIle: 4.527 ± 0.594
4.339SerLys: 4.339 ± 0.728
7.168SerLeu: 7.168 ± 1.162
3.207SerMet: 3.207 ± 0.686
6.225SerAsn: 6.225 ± 0.588
3.018SerPro: 3.018 ± 0.493
2.641SerGln: 2.641 ± 0.666
2.83SerArg: 2.83 ± 0.486
4.527SerSer: 4.527 ± 1.17
4.15SerThr: 4.15 ± 0.476
1.886SerVal: 1.886 ± 0.564
0.566SerTrp: 0.566 ± 0.395
2.264SerTyr: 2.264 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
4.15ThrAla: 4.15 ± 0.872
0.943ThrCys: 0.943 ± 0.368
2.83ThrAsp: 2.83 ± 0.72
4.339ThrGlu: 4.339 ± 0.743
3.207ThrPhe: 3.207 ± 0.514
4.527ThrGly: 4.527 ± 1.109
1.132ThrHis: 1.132 ± 0.391
5.471ThrIle: 5.471 ± 0.578
6.037ThrLys: 6.037 ± 1.089
3.207ThrLeu: 3.207 ± 0.617
2.264ThrMet: 2.264 ± 0.356
2.264ThrAsn: 2.264 ± 0.676
3.396ThrPro: 3.396 ± 0.426
2.264ThrGln: 2.264 ± 0.56
2.264ThrArg: 2.264 ± 0.476
5.093ThrSer: 5.093 ± 1.313
4.339ThrThr: 4.339 ± 0.74
4.15ThrVal: 4.15 ± 0.425
0.755ThrTrp: 0.755 ± 0.546
1.698ThrTyr: 1.698 ± 0.767
0.0ThrXaa: 0.0 ± 0.0
Val
4.527ValAla: 4.527 ± 0.61
0.566ValCys: 0.566 ± 0.254
2.641ValAsp: 2.641 ± 0.45
3.773ValGlu: 3.773 ± 0.512
0.943ValPhe: 0.943 ± 0.542
2.264ValGly: 2.264 ± 0.718
0.566ValHis: 0.566 ± 0.397
3.962ValIle: 3.962 ± 0.938
4.905ValLys: 4.905 ± 0.645
6.98ValLeu: 6.98 ± 1.038
0.943ValMet: 0.943 ± 0.447
2.83ValAsn: 2.83 ± 0.871
2.641ValPro: 2.641 ± 0.535
1.321ValGln: 1.321 ± 0.361
2.075ValArg: 2.075 ± 0.676
4.15ValSer: 4.15 ± 0.583
4.15ValThr: 4.15 ± 0.936
3.396ValVal: 3.396 ± 0.666
0.755ValTrp: 0.755 ± 0.452
1.509ValTyr: 1.509 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
1.132TrpAla: 1.132 ± 0.335
0.377TrpCys: 0.377 ± 0.226
0.755TrpAsp: 0.755 ± 0.339
1.132TrpGlu: 1.132 ± 0.28
0.566TrpPhe: 0.566 ± 0.243
0.943TrpGly: 0.943 ± 0.384
0.566TrpHis: 0.566 ± 0.257
0.566TrpIle: 0.566 ± 0.312
0.377TrpLys: 0.377 ± 0.371
0.189TrpLeu: 0.189 ± 0.186
0.189TrpMet: 0.189 ± 0.186
0.377TrpAsn: 0.377 ± 0.184
0.189TrpPro: 0.189 ± 0.186
0.377TrpGln: 0.377 ± 0.257
0.755TrpArg: 0.755 ± 0.26
0.189TrpSer: 0.189 ± 0.183
0.943TrpThr: 0.943 ± 0.58
1.132TrpVal: 1.132 ± 0.52
0.189TrpTrp: 0.189 ± 0.184
0.377TrpTyr: 0.377 ± 0.366
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.321TyrAla: 1.321 ± 0.523
1.132TyrCys: 1.132 ± 0.438
1.321TyrAsp: 1.321 ± 0.341
2.075TyrGlu: 2.075 ± 0.389
1.321TyrPhe: 1.321 ± 0.287
1.321TyrGly: 1.321 ± 0.397
0.566TyrHis: 0.566 ± 0.299
0.943TyrIle: 0.943 ± 0.257
1.886TyrLys: 1.886 ± 0.365
1.886TyrLeu: 1.886 ± 0.812
1.132TyrMet: 1.132 ± 0.437
2.264TyrAsn: 2.264 ± 0.607
0.755TyrPro: 0.755 ± 0.493
1.509TyrGln: 1.509 ± 0.729
1.698TyrArg: 1.698 ± 0.702
3.018TyrSer: 3.018 ± 0.497
2.641TyrThr: 2.641 ± 0.558
1.698TyrVal: 1.698 ± 0.54
0.189TyrTrp: 0.189 ± 0.186
0.943TyrTyr: 0.943 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski