Amino acid dipepetide frequency for Phormidium virus WMP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.587AlaAla: 5.587 ± 0.626
0.302AlaCys: 0.302 ± 0.15
4.908AlaAsp: 4.908 ± 0.619
4.304AlaGlu: 4.304 ± 0.568
3.247AlaPhe: 3.247 ± 0.539
4.53AlaGly: 4.53 ± 0.76
1.359AlaHis: 1.359 ± 0.364
3.02AlaIle: 3.02 ± 0.442
4.681AlaLys: 4.681 ± 0.768
8.985AlaLeu: 8.985 ± 0.748
1.586AlaMet: 1.586 ± 0.451
3.7AlaAsn: 3.7 ± 0.602
2.341AlaPro: 2.341 ± 0.455
4.153AlaGln: 4.153 ± 1.035
3.926AlaArg: 3.926 ± 0.475
5.285AlaSer: 5.285 ± 0.612
5.889AlaThr: 5.889 ± 0.647
6.116AlaVal: 6.116 ± 0.828
0.529AlaTrp: 0.529 ± 0.185
3.171AlaTyr: 3.171 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.378CysAla: 0.378 ± 0.166
0.076CysCys: 0.076 ± 0.072
0.302CysAsp: 0.302 ± 0.161
0.604CysGlu: 0.604 ± 0.216
0.227CysPhe: 0.227 ± 0.118
0.604CysGly: 0.604 ± 0.217
0.227CysHis: 0.227 ± 0.116
0.076CysIle: 0.076 ± 0.067
0.302CysLys: 0.302 ± 0.155
0.755CysLeu: 0.755 ± 0.213
0.227CysMet: 0.227 ± 0.115
0.378CysAsn: 0.378 ± 0.172
0.227CysPro: 0.227 ± 0.121
0.378CysGln: 0.378 ± 0.18
0.453CysArg: 0.453 ± 0.23
0.453CysSer: 0.453 ± 0.239
0.68CysThr: 0.68 ± 0.247
0.68CysVal: 0.68 ± 0.235
0.0CysTrp: 0.0 ± 0.0
0.227CysTyr: 0.227 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
4.757AspAla: 4.757 ± 0.72
0.151AspCys: 0.151 ± 0.099
4.228AspAsp: 4.228 ± 0.536
5.361AspGlu: 5.361 ± 0.73
1.812AspPhe: 1.812 ± 0.304
4.757AspGly: 4.757 ± 0.679
0.982AspHis: 0.982 ± 0.256
3.473AspIle: 3.473 ± 0.499
4.379AspLys: 4.379 ± 0.816
6.342AspLeu: 6.342 ± 0.575
1.057AspMet: 1.057 ± 0.222
3.322AspAsn: 3.322 ± 0.502
2.492AspPro: 2.492 ± 0.515
1.963AspGln: 1.963 ± 0.462
3.096AspArg: 3.096 ± 0.474
2.718AspSer: 2.718 ± 0.487
5.134AspThr: 5.134 ± 0.64
6.342AspVal: 6.342 ± 0.674
0.831AspTrp: 0.831 ± 0.245
2.643AspTyr: 2.643 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
6.04GluAla: 6.04 ± 0.728
0.227GluCys: 0.227 ± 0.13
4.153GluAsp: 4.153 ± 0.411
2.945GluGlu: 2.945 ± 0.502
1.586GluPhe: 1.586 ± 0.364
2.718GluGly: 2.718 ± 0.449
1.057GluHis: 1.057 ± 0.231
1.359GluIle: 1.359 ± 0.24
1.359GluLys: 1.359 ± 0.303
7.626GluLeu: 7.626 ± 0.984
1.057GluMet: 1.057 ± 0.33
1.661GluAsn: 1.661 ± 0.43
2.718GluPro: 2.718 ± 0.692
3.7GluGln: 3.7 ± 0.653
4.681GluArg: 4.681 ± 0.662
2.416GluSer: 2.416 ± 0.429
3.775GluThr: 3.775 ± 0.474
6.418GluVal: 6.418 ± 0.744
0.529GluTrp: 0.529 ± 0.221
2.19GluTyr: 2.19 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
3.096PheAla: 3.096 ± 0.55
0.227PheCys: 0.227 ± 0.119
2.794PheAsp: 2.794 ± 0.501
1.963PheGlu: 1.963 ± 0.515
1.057PhePhe: 1.057 ± 0.288
3.096PheGly: 3.096 ± 0.511
0.68PheHis: 0.68 ± 0.229
1.133PheIle: 1.133 ± 0.334
2.039PheLys: 2.039 ± 0.392
1.661PheLeu: 1.661 ± 0.321
0.302PheMet: 0.302 ± 0.143
2.567PheAsn: 2.567 ± 0.49
0.755PhePro: 0.755 ± 0.216
1.586PheGln: 1.586 ± 0.251
1.812PheArg: 1.812 ± 0.284
2.945PheSer: 2.945 ± 0.434
3.322PheThr: 3.322 ± 0.622
2.643PheVal: 2.643 ± 0.518
0.227PheTrp: 0.227 ± 0.119
0.906PheTyr: 0.906 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
5.965GlyAla: 5.965 ± 0.806
0.68GlyCys: 0.68 ± 0.282
4.455GlyAsp: 4.455 ± 0.55
2.869GlyGlu: 2.869 ± 0.57
2.416GlyPhe: 2.416 ± 0.434
5.134GlyGly: 5.134 ± 0.645
1.133GlyHis: 1.133 ± 0.267
2.341GlyIle: 2.341 ± 0.377
4.077GlyLys: 4.077 ± 0.521
5.814GlyLeu: 5.814 ± 0.738
1.284GlyMet: 1.284 ± 0.324
2.794GlyAsn: 2.794 ± 0.446
0.0GlyPro: 0.0 ± 0.0
2.643GlyGln: 2.643 ± 0.395
4.077GlyArg: 4.077 ± 0.681
4.832GlySer: 4.832 ± 0.628
5.512GlyThr: 5.512 ± 0.795
4.681GlyVal: 4.681 ± 0.446
0.529GlyTrp: 0.529 ± 0.243
4.304GlyTyr: 4.304 ± 0.522
0.0GlyXaa: 0.0 ± 0.0
His
1.133HisAla: 1.133 ± 0.235
0.227HisCys: 0.227 ± 0.126
1.057HisAsp: 1.057 ± 0.322
1.284HisGlu: 1.284 ± 0.291
0.378HisPhe: 0.378 ± 0.193
0.68HisGly: 0.68 ± 0.199
0.302HisHis: 0.302 ± 0.225
1.133HisIle: 1.133 ± 0.257
0.831HisLys: 0.831 ± 0.325
1.661HisLeu: 1.661 ± 0.313
0.151HisMet: 0.151 ± 0.097
0.68HisAsn: 0.68 ± 0.223
0.982HisPro: 0.982 ± 0.286
0.529HisGln: 0.529 ± 0.185
0.831HisArg: 0.831 ± 0.231
0.831HisSer: 0.831 ± 0.331
0.906HisThr: 0.906 ± 0.368
1.208HisVal: 1.208 ± 0.383
0.529HisTrp: 0.529 ± 0.135
0.982HisTyr: 0.982 ± 0.352
0.0HisXaa: 0.0 ± 0.0
Ile
2.341IleAla: 2.341 ± 0.559
0.378IleCys: 0.378 ± 0.161
2.718IleAsp: 2.718 ± 0.538
1.51IleGlu: 1.51 ± 0.283
1.435IlePhe: 1.435 ± 0.412
3.02IleGly: 3.02 ± 0.407
0.68IleHis: 0.68 ± 0.261
0.982IleIle: 0.982 ± 0.322
2.492IleLys: 2.492 ± 0.322
2.114IleLeu: 2.114 ± 0.335
0.831IleMet: 0.831 ± 0.227
1.51IleAsn: 1.51 ± 0.407
2.265IlePro: 2.265 ± 0.393
1.51IleGln: 1.51 ± 0.292
2.492IleArg: 2.492 ± 0.447
2.039IleSer: 2.039 ± 0.566
1.963IleThr: 1.963 ± 0.341
1.51IleVal: 1.51 ± 0.381
0.529IleTrp: 0.529 ± 0.235
1.661IleTyr: 1.661 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
5.059LysAla: 5.059 ± 0.597
0.453LysCys: 0.453 ± 0.2
3.322LysAsp: 3.322 ± 0.624
4.153LysGlu: 4.153 ± 0.746
1.661LysPhe: 1.661 ± 0.321
2.567LysGly: 2.567 ± 0.529
0.982LysHis: 0.982 ± 0.236
0.906LysIle: 0.906 ± 0.209
2.869LysLys: 2.869 ± 0.467
5.361LysLeu: 5.361 ± 0.814
0.68LysMet: 0.68 ± 0.278
2.643LysAsn: 2.643 ± 0.51
3.322LysPro: 3.322 ± 0.966
3.775LysGln: 3.775 ± 0.587
4.228LysArg: 4.228 ± 0.441
2.794LysSer: 2.794 ± 0.406
3.398LysThr: 3.398 ± 0.519
4.304LysVal: 4.304 ± 0.581
0.529LysTrp: 0.529 ± 0.205
2.114LysTyr: 2.114 ± 0.424
0.0LysXaa: 0.0 ± 0.0
Leu
6.267LeuAla: 6.267 ± 0.803
1.359LeuCys: 1.359 ± 0.323
6.342LeuAsp: 6.342 ± 0.769
4.606LeuGlu: 4.606 ± 0.704
2.567LeuPhe: 2.567 ± 0.498
5.814LeuGly: 5.814 ± 0.536
1.133LeuHis: 1.133 ± 0.277
2.718LeuIle: 2.718 ± 0.474
4.304LeuLys: 4.304 ± 0.592
8.004LeuLeu: 8.004 ± 0.746
1.888LeuMet: 1.888 ± 0.436
3.171LeuAsn: 3.171 ± 0.541
4.757LeuPro: 4.757 ± 1.053
4.757LeuGln: 4.757 ± 0.778
6.342LeuArg: 6.342 ± 0.793
8.532LeuSer: 8.532 ± 0.691
6.947LeuThr: 6.947 ± 0.751
6.947LeuVal: 6.947 ± 0.717
0.755LeuTrp: 0.755 ± 0.21
3.02LeuTyr: 3.02 ± 0.661
0.0LeuXaa: 0.0 ± 0.0
Met
1.661MetAla: 1.661 ± 0.324
0.227MetCys: 0.227 ± 0.121
0.378MetAsp: 0.378 ± 0.165
0.604MetGlu: 0.604 ± 0.188
0.831MetPhe: 0.831 ± 0.196
0.68MetGly: 0.68 ± 0.283
0.378MetHis: 0.378 ± 0.165
0.378MetIle: 0.378 ± 0.175
1.359MetLys: 1.359 ± 0.241
2.039MetLeu: 2.039 ± 0.391
0.302MetMet: 0.302 ± 0.111
0.755MetAsn: 0.755 ± 0.19
0.831MetPro: 0.831 ± 0.308
1.057MetGln: 1.057 ± 0.257
1.057MetArg: 1.057 ± 0.273
2.114MetSer: 2.114 ± 0.508
1.51MetThr: 1.51 ± 0.304
0.906MetVal: 0.906 ± 0.232
0.151MetTrp: 0.151 ± 0.122
0.906MetTyr: 0.906 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
3.02AsnAla: 3.02 ± 0.466
0.227AsnCys: 0.227 ± 0.099
2.19AsnAsp: 2.19 ± 0.411
2.718AsnGlu: 2.718 ± 0.628
1.435AsnPhe: 1.435 ± 0.3
2.643AsnGly: 2.643 ± 0.399
0.831AsnHis: 0.831 ± 0.222
2.19AsnIle: 2.19 ± 0.293
3.322AsnLys: 3.322 ± 0.439
3.926AsnLeu: 3.926 ± 0.614
0.604AsnMet: 0.604 ± 0.22
3.02AsnAsn: 3.02 ± 0.662
3.096AsnPro: 3.096 ± 0.477
1.51AsnGln: 1.51 ± 0.456
2.19AsnArg: 2.19 ± 0.368
2.341AsnSer: 2.341 ± 0.38
3.322AsnThr: 3.322 ± 0.589
3.473AsnVal: 3.473 ± 0.539
0.604AsnTrp: 0.604 ± 0.191
2.114AsnTyr: 2.114 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
3.398ProAla: 3.398 ± 0.437
0.151ProCys: 0.151 ± 0.117
3.247ProAsp: 3.247 ± 0.625
2.945ProGlu: 2.945 ± 0.877
1.359ProPhe: 1.359 ± 0.288
2.718ProGly: 2.718 ± 0.571
0.604ProHis: 0.604 ± 0.236
1.057ProIle: 1.057 ± 0.258
1.963ProLys: 1.963 ± 0.423
3.322ProLeu: 3.322 ± 0.617
0.982ProMet: 0.982 ± 0.283
1.812ProAsn: 1.812 ± 0.428
1.359ProPro: 1.359 ± 0.412
1.812ProGln: 1.812 ± 0.345
1.888ProArg: 1.888 ± 0.376
4.002ProSer: 4.002 ± 0.731
3.851ProThr: 3.851 ± 0.758
3.7ProVal: 3.7 ± 0.8
0.453ProTrp: 0.453 ± 0.177
2.114ProTyr: 2.114 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
6.72GlnAla: 6.72 ± 0.946
0.076GlnCys: 0.076 ± 0.068
2.643GlnAsp: 2.643 ± 0.56
2.718GlnGlu: 2.718 ± 0.496
2.039GlnPhe: 2.039 ± 0.428
3.171GlnGly: 3.171 ± 0.662
0.68GlnHis: 0.68 ± 0.251
1.133GlnIle: 1.133 ± 0.254
2.643GlnLys: 2.643 ± 0.497
4.455GlnLeu: 4.455 ± 0.611
1.586GlnMet: 1.586 ± 0.369
1.359GlnAsn: 1.359 ± 0.281
2.039GlnPro: 2.039 ± 0.497
5.889GlnGln: 5.889 ± 1.681
2.945GlnArg: 2.945 ± 0.472
3.096GlnSer: 3.096 ± 0.603
3.624GlnThr: 3.624 ± 0.583
3.171GlnVal: 3.171 ± 0.393
0.529GlnTrp: 0.529 ± 0.211
2.19GlnTyr: 2.19 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
4.228ArgAla: 4.228 ± 0.486
0.453ArgCys: 0.453 ± 0.176
3.851ArgAsp: 3.851 ± 0.398
3.851ArgGlu: 3.851 ± 0.786
2.19ArgPhe: 2.19 ± 0.443
3.096ArgGly: 3.096 ± 0.565
0.68ArgHis: 0.68 ± 0.208
2.341ArgIle: 2.341 ± 0.378
3.549ArgLys: 3.549 ± 0.524
5.436ArgLeu: 5.436 ± 0.672
1.435ArgMet: 1.435 ± 0.315
2.718ArgAsn: 2.718 ± 0.494
2.718ArgPro: 2.718 ± 0.539
3.7ArgGln: 3.7 ± 0.549
4.455ArgArg: 4.455 ± 0.864
2.794ArgSer: 2.794 ± 0.417
3.926ArgThr: 3.926 ± 0.408
4.077ArgVal: 4.077 ± 0.429
0.453ArgTrp: 0.453 ± 0.17
2.869ArgTyr: 2.869 ± 0.623
0.0ArgXaa: 0.0 ± 0.0
Ser
4.832SerAla: 4.832 ± 0.625
0.453SerCys: 0.453 ± 0.228
4.908SerAsp: 4.908 ± 0.512
4.53SerGlu: 4.53 ± 0.758
2.567SerPhe: 2.567 ± 0.531
5.436SerGly: 5.436 ± 0.656
0.982SerHis: 0.982 ± 0.339
1.963SerIle: 1.963 ± 0.288
3.926SerLys: 3.926 ± 0.646
4.832SerLeu: 4.832 ± 0.779
1.133SerMet: 1.133 ± 0.302
2.794SerAsn: 2.794 ± 0.533
3.096SerPro: 3.096 ± 0.792
3.398SerGln: 3.398 ± 0.586
3.549SerArg: 3.549 ± 0.545
5.738SerSer: 5.738 ± 0.91
4.455SerThr: 4.455 ± 0.524
5.059SerVal: 5.059 ± 0.623
1.208SerTrp: 1.208 ± 0.343
3.096SerTyr: 3.096 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
4.908ThrAla: 4.908 ± 0.621
0.378ThrCys: 0.378 ± 0.185
3.851ThrAsp: 3.851 ± 0.445
3.624ThrGlu: 3.624 ± 0.795
3.096ThrPhe: 3.096 ± 0.586
5.663ThrGly: 5.663 ± 0.749
0.906ThrHis: 0.906 ± 0.214
2.718ThrIle: 2.718 ± 0.459
3.926ThrLys: 3.926 ± 0.544
6.267ThrLeu: 6.267 ± 0.701
1.133ThrMet: 1.133 ± 0.326
2.945ThrAsn: 2.945 ± 0.513
4.455ThrPro: 4.455 ± 0.65
3.624ThrGln: 3.624 ± 0.551
4.077ThrArg: 4.077 ± 0.617
5.285ThrSer: 5.285 ± 0.711
7.702ThrThr: 7.702 ± 0.951
5.814ThrVal: 5.814 ± 0.975
1.208ThrTrp: 1.208 ± 0.25
4.077ThrTyr: 4.077 ± 0.483
0.0ThrXaa: 0.0 ± 0.0
Val
4.228ValAla: 4.228 ± 0.489
0.755ValCys: 0.755 ± 0.237
6.116ValAsp: 6.116 ± 0.491
4.908ValGlu: 4.908 ± 0.487
2.869ValPhe: 2.869 ± 0.44
4.53ValGly: 4.53 ± 0.465
1.737ValHis: 1.737 ± 0.401
3.096ValIle: 3.096 ± 0.369
4.153ValLys: 4.153 ± 0.593
5.738ValLeu: 5.738 ± 0.681
0.982ValMet: 0.982 ± 0.255
3.398ValAsn: 3.398 ± 0.656
4.228ValPro: 4.228 ± 0.686
4.077ValGln: 4.077 ± 0.659
4.455ValArg: 4.455 ± 0.735
6.947ValSer: 6.947 ± 0.64
5.436ValThr: 5.436 ± 0.642
6.72ValVal: 6.72 ± 0.747
0.831ValTrp: 0.831 ± 0.313
3.171ValTyr: 3.171 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.982TrpAla: 0.982 ± 0.233
0.227TrpCys: 0.227 ± 0.117
0.529TrpAsp: 0.529 ± 0.173
0.604TrpGlu: 0.604 ± 0.172
0.453TrpPhe: 0.453 ± 0.157
0.604TrpGly: 0.604 ± 0.241
0.227TrpHis: 0.227 ± 0.129
0.151TrpIle: 0.151 ± 0.102
0.378TrpLys: 0.378 ± 0.163
1.51TrpLeu: 1.51 ± 0.388
0.151TrpMet: 0.151 ± 0.089
0.831TrpAsn: 0.831 ± 0.27
0.0TrpPro: 0.0 ± 0.0
0.529TrpGln: 0.529 ± 0.166
0.302TrpArg: 0.302 ± 0.187
0.982TrpSer: 0.982 ± 0.296
0.68TrpThr: 0.68 ± 0.249
1.133TrpVal: 1.133 ± 0.278
0.378TrpTrp: 0.378 ± 0.132
0.68TrpTyr: 0.68 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.02TyrAla: 3.02 ± 0.431
0.227TyrCys: 0.227 ± 0.122
3.851TyrAsp: 3.851 ± 0.502
2.114TyrGlu: 2.114 ± 0.357
1.661TyrPhe: 1.661 ± 0.397
4.077TyrGly: 4.077 ± 0.601
0.755TyrHis: 0.755 ± 0.281
1.812TyrIle: 1.812 ± 0.419
2.567TyrLys: 2.567 ± 0.308
4.379TyrLeu: 4.379 ± 0.644
0.68TyrMet: 0.68 ± 0.182
2.718TyrAsn: 2.718 ± 0.502
1.057TyrPro: 1.057 ± 0.243
2.114TyrGln: 2.114 ± 0.432
1.963TyrArg: 1.963 ± 0.412
1.812TyrSer: 1.812 ± 0.461
3.624TyrThr: 3.624 ± 0.595
3.473TyrVal: 3.473 ± 0.593
0.529TyrTrp: 0.529 ± 0.209
2.19TyrTyr: 2.19 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (13245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski