Amino acid dipepetide frequency for Paenibacillus phage Wanderer

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.397AlaAla: 4.397 ± 0.926
0.55AlaCys: 0.55 ± 0.214
4.24AlaAsp: 4.24 ± 0.609
6.045AlaGlu: 6.045 ± 0.708
3.062AlaPhe: 3.062 ± 0.663
4.475AlaGly: 4.475 ± 0.853
0.864AlaHis: 0.864 ± 0.233
4.318AlaIle: 4.318 ± 0.687
5.496AlaLys: 5.496 ± 0.572
6.281AlaLeu: 6.281 ± 1.005
2.277AlaMet: 2.277 ± 0.402
1.884AlaAsn: 1.884 ± 0.37
1.806AlaPro: 1.806 ± 0.408
2.826AlaGln: 2.826 ± 0.531
2.983AlaArg: 2.983 ± 0.396
3.612AlaSer: 3.612 ± 0.763
2.669AlaThr: 2.669 ± 0.52
4.475AlaVal: 4.475 ± 0.99
0.942AlaTrp: 0.942 ± 0.275
2.355AlaTyr: 2.355 ± 0.542
0.0AlaXaa: 0.0 ± 0.0
Cys
0.471CysAla: 0.471 ± 0.194
0.314CysCys: 0.314 ± 0.195
0.628CysAsp: 0.628 ± 0.222
0.628CysGlu: 0.628 ± 0.239
0.55CysPhe: 0.55 ± 0.217
0.942CysGly: 0.942 ± 0.306
0.236CysHis: 0.236 ± 0.128
0.785CysIle: 0.785 ± 0.22
1.021CysLys: 1.021 ± 0.352
0.707CysLeu: 0.707 ± 0.281
0.236CysMet: 0.236 ± 0.145
0.471CysAsn: 0.471 ± 0.251
0.55CysPro: 0.55 ± 0.217
0.236CysGln: 0.236 ± 0.165
0.785CysArg: 0.785 ± 0.293
0.55CysSer: 0.55 ± 0.182
0.0CysThr: 0.0 ± 0.0
0.785CysVal: 0.785 ± 0.247
0.314CysTrp: 0.314 ± 0.166
0.55CysTyr: 0.55 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
4.318AspAla: 4.318 ± 0.517
0.314AspCys: 0.314 ± 0.174
3.847AspAsp: 3.847 ± 0.617
4.789AspGlu: 4.789 ± 0.627
2.826AspPhe: 2.826 ± 0.454
4.711AspGly: 4.711 ± 0.523
0.707AspHis: 0.707 ± 0.265
4.318AspIle: 4.318 ± 0.533
4.004AspLys: 4.004 ± 0.682
4.789AspLeu: 4.789 ± 0.498
2.434AspMet: 2.434 ± 0.495
2.826AspAsn: 2.826 ± 0.467
2.12AspPro: 2.12 ± 0.677
2.277AspGln: 2.277 ± 0.507
3.376AspArg: 3.376 ± 0.585
1.806AspSer: 1.806 ± 0.423
3.376AspThr: 3.376 ± 0.638
5.103AspVal: 5.103 ± 0.712
1.178AspTrp: 1.178 ± 0.297
1.884AspTyr: 1.884 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
5.574GluAla: 5.574 ± 0.835
0.393GluCys: 0.393 ± 0.19
4.946GluAsp: 4.946 ± 0.66
6.909GluGlu: 6.909 ± 1.063
3.612GluPhe: 3.612 ± 0.528
3.847GluGly: 3.847 ± 0.515
1.727GluHis: 1.727 ± 0.434
5.653GluIle: 5.653 ± 0.601
6.83GluLys: 6.83 ± 0.912
8.087GluLeu: 8.087 ± 0.994
1.884GluMet: 1.884 ± 0.387
3.69GluAsn: 3.69 ± 0.573
1.57GluPro: 1.57 ± 0.414
5.653GluGln: 5.653 ± 0.787
4.397GluArg: 4.397 ± 0.567
4.24GluSer: 4.24 ± 0.679
4.318GluThr: 4.318 ± 0.598
5.967GluVal: 5.967 ± 0.817
1.021GluTrp: 1.021 ± 0.414
2.748GluTyr: 2.748 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.277PheAla: 2.277 ± 0.744
0.471PheCys: 0.471 ± 0.184
3.14PheAsp: 3.14 ± 0.53
3.533PheGlu: 3.533 ± 0.55
1.256PhePhe: 1.256 ± 0.377
2.983PheGly: 2.983 ± 0.392
0.628PheHis: 0.628 ± 0.252
2.983PheIle: 2.983 ± 0.531
3.612PheLys: 3.612 ± 0.504
3.297PheLeu: 3.297 ± 0.549
0.707PheMet: 0.707 ± 0.253
1.963PheAsn: 1.963 ± 0.411
1.413PhePro: 1.413 ± 0.337
1.963PheGln: 1.963 ± 0.531
1.57PheArg: 1.57 ± 0.423
2.512PheSer: 2.512 ± 0.472
1.492PheThr: 1.492 ± 0.486
1.963PheVal: 1.963 ± 0.409
0.471PheTrp: 0.471 ± 0.202
1.413PheTyr: 1.413 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
3.533GlyAla: 3.533 ± 0.811
0.393GlyCys: 0.393 ± 0.175
4.711GlyAsp: 4.711 ± 0.531
5.653GlyGlu: 5.653 ± 0.749
2.355GlyPhe: 2.355 ± 0.325
4.946GlyGly: 4.946 ± 0.755
1.413GlyHis: 1.413 ± 0.371
4.632GlyIle: 4.632 ± 0.773
5.81GlyLys: 5.81 ± 0.638
5.417GlyLeu: 5.417 ± 0.742
1.57GlyMet: 1.57 ± 0.307
2.983GlyAsn: 2.983 ± 0.652
0.471GlyPro: 0.471 ± 0.176
1.727GlyGln: 1.727 ± 0.337
2.512GlyArg: 2.512 ± 0.529
2.434GlySer: 2.434 ± 0.377
2.826GlyThr: 2.826 ± 0.446
5.025GlyVal: 5.025 ± 0.675
1.021GlyTrp: 1.021 ± 0.352
2.512GlyTyr: 2.512 ± 0.479
0.0GlyXaa: 0.0 ± 0.0
His
1.256HisAla: 1.256 ± 0.315
0.236HisCys: 0.236 ± 0.143
1.178HisAsp: 1.178 ± 0.298
1.806HisGlu: 1.806 ± 0.379
0.942HisPhe: 0.942 ± 0.304
0.942HisGly: 0.942 ± 0.284
0.157HisHis: 0.157 ± 0.097
1.099HisIle: 1.099 ± 0.287
0.942HisLys: 0.942 ± 0.284
2.355HisLeu: 2.355 ± 0.465
0.314HisMet: 0.314 ± 0.17
0.707HisAsn: 0.707 ± 0.209
0.628HisPro: 0.628 ± 0.228
1.099HisGln: 1.099 ± 0.275
0.942HisArg: 0.942 ± 0.265
0.707HisSer: 0.707 ± 0.22
0.628HisThr: 0.628 ± 0.26
0.864HisVal: 0.864 ± 0.222
0.157HisTrp: 0.157 ± 0.112
0.471HisTyr: 0.471 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
4.789IleAla: 4.789 ± 0.995
1.099IleCys: 1.099 ± 0.308
4.004IleAsp: 4.004 ± 0.53
6.202IleGlu: 6.202 ± 0.712
2.12IlePhe: 2.12 ± 0.379
4.946IleGly: 4.946 ± 0.725
0.942IleHis: 0.942 ± 0.251
4.083IleIle: 4.083 ± 0.758
6.595IleLys: 6.595 ± 0.835
5.339IleLeu: 5.339 ± 0.671
1.178IleMet: 1.178 ± 0.446
2.669IleAsn: 2.669 ± 0.486
3.062IlePro: 3.062 ± 0.535
2.983IleGln: 2.983 ± 0.513
3.14IleArg: 3.14 ± 0.443
3.69IleSer: 3.69 ± 0.515
3.455IleThr: 3.455 ± 0.481
4.868IleVal: 4.868 ± 0.584
0.785IleTrp: 0.785 ± 0.245
1.413IleTyr: 1.413 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
5.731LysAla: 5.731 ± 0.779
0.55LysCys: 0.55 ± 0.209
4.318LysAsp: 4.318 ± 0.609
6.909LysGlu: 6.909 ± 0.708
3.376LysPhe: 3.376 ± 0.506
4.946LysGly: 4.946 ± 0.546
1.884LysHis: 1.884 ± 0.487
5.103LysIle: 5.103 ± 0.559
6.516LysLys: 6.516 ± 0.819
8.087LysLeu: 8.087 ± 0.684
2.512LysMet: 2.512 ± 0.426
4.004LysAsn: 4.004 ± 0.487
2.748LysPro: 2.748 ± 0.447
3.062LysGln: 3.062 ± 0.434
5.496LysArg: 5.496 ± 0.694
4.711LysSer: 4.711 ± 0.579
5.26LysThr: 5.26 ± 0.717
4.632LysVal: 4.632 ± 0.606
1.57LysTrp: 1.57 ± 0.364
3.376LysTyr: 3.376 ± 0.575
0.0LysXaa: 0.0 ± 0.0
Leu
6.752LeuAla: 6.752 ± 0.785
1.178LeuCys: 1.178 ± 0.323
5.103LeuAsp: 5.103 ± 0.675
8.008LeuGlu: 8.008 ± 0.855
3.376LeuPhe: 3.376 ± 0.536
5.339LeuGly: 5.339 ± 0.939
1.963LeuHis: 1.963 ± 0.385
4.868LeuIle: 4.868 ± 0.579
6.83LeuLys: 6.83 ± 0.667
6.281LeuLeu: 6.281 ± 0.732
2.12LeuMet: 2.12 ± 0.455
5.496LeuAsn: 5.496 ± 0.719
3.69LeuPro: 3.69 ± 0.521
3.847LeuGln: 3.847 ± 0.599
5.103LeuArg: 5.103 ± 0.722
6.438LeuSer: 6.438 ± 0.552
6.045LeuThr: 6.045 ± 0.591
4.004LeuVal: 4.004 ± 0.572
1.178LeuTrp: 1.178 ± 0.445
2.434LeuTyr: 2.434 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
1.256MetAla: 1.256 ± 0.374
0.314MetCys: 0.314 ± 0.154
0.785MetAsp: 0.785 ± 0.223
2.355MetGlu: 2.355 ± 0.394
0.864MetPhe: 0.864 ± 0.305
2.041MetGly: 2.041 ± 0.405
0.55MetHis: 0.55 ± 0.222
1.963MetIle: 1.963 ± 0.326
2.748MetLys: 2.748 ± 0.486
2.748MetLeu: 2.748 ± 0.467
0.471MetMet: 0.471 ± 0.197
2.041MetAsn: 2.041 ± 0.557
1.178MetPro: 1.178 ± 0.288
0.942MetGln: 0.942 ± 0.344
1.492MetArg: 1.492 ± 0.349
2.198MetSer: 2.198 ± 0.433
1.256MetThr: 1.256 ± 0.307
1.649MetVal: 1.649 ± 0.422
0.079MetTrp: 0.079 ± 0.071
0.785MetTyr: 0.785 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
4.24AsnAla: 4.24 ± 0.684
0.628AsnCys: 0.628 ± 0.261
2.669AsnAsp: 2.669 ± 0.456
2.748AsnGlu: 2.748 ± 0.515
1.492AsnPhe: 1.492 ± 0.291
3.769AsnGly: 3.769 ± 0.663
0.471AsnHis: 0.471 ± 0.175
2.905AsnIle: 2.905 ± 0.504
3.533AsnLys: 3.533 ± 0.426
4.397AsnLeu: 4.397 ± 0.611
1.178AsnMet: 1.178 ± 0.374
2.12AsnAsn: 2.12 ± 0.491
1.963AsnPro: 1.963 ± 0.377
1.57AsnGln: 1.57 ± 0.349
2.983AsnArg: 2.983 ± 0.6
2.826AsnSer: 2.826 ± 0.469
2.277AsnThr: 2.277 ± 0.477
3.062AsnVal: 3.062 ± 0.463
0.393AsnTrp: 0.393 ± 0.176
1.727AsnTyr: 1.727 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
1.57ProAla: 1.57 ± 0.318
0.471ProCys: 0.471 ± 0.181
1.649ProAsp: 1.649 ± 0.46
2.905ProGlu: 2.905 ± 0.455
1.492ProPhe: 1.492 ± 0.315
1.492ProGly: 1.492 ± 0.396
0.942ProHis: 0.942 ± 0.278
2.591ProIle: 2.591 ± 0.464
3.14ProLys: 3.14 ± 0.652
3.14ProLeu: 3.14 ± 0.415
0.942ProMet: 0.942 ± 0.312
1.57ProAsn: 1.57 ± 0.283
1.178ProPro: 1.178 ± 0.352
0.942ProGln: 0.942 ± 0.244
1.335ProArg: 1.335 ± 0.352
2.041ProSer: 2.041 ± 0.584
1.492ProThr: 1.492 ± 0.325
2.434ProVal: 2.434 ± 0.661
0.157ProTrp: 0.157 ± 0.116
0.864ProTyr: 0.864 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
3.14GlnAla: 3.14 ± 0.615
0.157GlnCys: 0.157 ± 0.13
2.041GlnAsp: 2.041 ± 0.47
3.297GlnGlu: 3.297 ± 0.532
1.806GlnPhe: 1.806 ± 0.439
1.727GlnGly: 1.727 ± 0.398
1.021GlnHis: 1.021 ± 0.296
2.512GlnIle: 2.512 ± 0.453
2.748GlnLys: 2.748 ± 0.404
4.789GlnLeu: 4.789 ± 0.606
1.963GlnMet: 1.963 ± 0.356
1.57GlnAsn: 1.57 ± 0.376
1.021GlnPro: 1.021 ± 0.26
1.335GlnGln: 1.335 ± 0.362
2.041GlnArg: 2.041 ± 0.476
1.806GlnSer: 1.806 ± 0.42
1.884GlnThr: 1.884 ± 0.279
2.826GlnVal: 2.826 ± 0.39
0.393GlnTrp: 0.393 ± 0.197
1.806GlnTyr: 1.806 ± 0.465
0.0GlnXaa: 0.0 ± 0.0
Arg
2.826ArgAla: 2.826 ± 0.344
0.314ArgCys: 0.314 ± 0.131
2.591ArgAsp: 2.591 ± 0.523
4.161ArgGlu: 4.161 ± 0.776
1.806ArgPhe: 1.806 ± 0.444
2.355ArgGly: 2.355 ± 0.516
1.335ArgHis: 1.335 ± 0.363
4.161ArgIle: 4.161 ± 0.711
5.103ArgLys: 5.103 ± 0.631
4.868ArgLeu: 4.868 ± 0.728
1.492ArgMet: 1.492 ± 0.293
2.748ArgAsn: 2.748 ± 0.43
1.492ArgPro: 1.492 ± 0.379
1.963ArgGln: 1.963 ± 0.464
3.297ArgArg: 3.297 ± 0.659
2.198ArgSer: 2.198 ± 0.473
3.062ArgThr: 3.062 ± 0.548
3.297ArgVal: 3.297 ± 0.555
1.099ArgTrp: 1.099 ± 0.332
2.355ArgTyr: 2.355 ± 0.489
0.0ArgXaa: 0.0 ± 0.0
Ser
4.004SerAla: 4.004 ± 0.701
0.707SerCys: 0.707 ± 0.213
2.12SerAsp: 2.12 ± 0.374
4.083SerGlu: 4.083 ± 0.667
2.591SerPhe: 2.591 ± 0.521
3.69SerGly: 3.69 ± 0.576
0.942SerHis: 0.942 ± 0.348
4.397SerIle: 4.397 ± 0.693
4.789SerLys: 4.789 ± 0.591
4.083SerLeu: 4.083 ± 0.62
1.963SerMet: 1.963 ± 0.385
2.591SerAsn: 2.591 ± 0.528
1.806SerPro: 1.806 ± 0.329
1.413SerGln: 1.413 ± 0.365
3.69SerArg: 3.69 ± 0.611
2.669SerSer: 2.669 ± 0.505
2.748SerThr: 2.748 ± 0.403
2.512SerVal: 2.512 ± 0.49
0.393SerTrp: 0.393 ± 0.183
2.041SerTyr: 2.041 ± 0.383
0.0SerXaa: 0.0 ± 0.0
Thr
2.826ThrAla: 2.826 ± 0.513
0.628ThrCys: 0.628 ± 0.204
2.983ThrAsp: 2.983 ± 0.608
3.612ThrGlu: 3.612 ± 0.527
1.727ThrPhe: 1.727 ± 0.294
3.219ThrGly: 3.219 ± 0.468
0.314ThrHis: 0.314 ± 0.142
3.455ThrIle: 3.455 ± 0.448
5.025ThrLys: 5.025 ± 0.699
5.025ThrLeu: 5.025 ± 0.866
1.256ThrMet: 1.256 ± 0.326
2.277ThrAsn: 2.277 ± 0.467
2.041ThrPro: 2.041 ± 0.483
1.492ThrGln: 1.492 ± 0.413
2.277ThrArg: 2.277 ± 0.47
2.512ThrSer: 2.512 ± 0.394
3.219ThrThr: 3.219 ± 0.593
3.769ThrVal: 3.769 ± 0.478
0.707ThrTrp: 0.707 ± 0.216
2.512ThrTyr: 2.512 ± 0.553
0.0ThrXaa: 0.0 ± 0.0
Val
4.161ValAla: 4.161 ± 0.478
0.707ValCys: 0.707 ± 0.3
5.967ValAsp: 5.967 ± 0.616
5.26ValGlu: 5.26 ± 0.742
2.983ValPhe: 2.983 ± 0.524
2.826ValGly: 2.826 ± 0.391
0.471ValHis: 0.471 ± 0.186
4.632ValIle: 4.632 ± 1.235
5.574ValLys: 5.574 ± 0.719
5.731ValLeu: 5.731 ± 0.596
1.649ValMet: 1.649 ± 0.414
3.297ValAsn: 3.297 ± 0.502
2.277ValPro: 2.277 ± 0.399
2.198ValGln: 2.198 ± 0.454
2.434ValArg: 2.434 ± 0.387
4.161ValSer: 4.161 ± 0.64
3.376ValThr: 3.376 ± 0.616
4.711ValVal: 4.711 ± 0.594
0.864ValTrp: 0.864 ± 0.308
2.277ValTyr: 2.277 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
0.785TrpAla: 0.785 ± 0.248
0.471TrpCys: 0.471 ± 0.189
1.099TrpAsp: 1.099 ± 0.291
0.707TrpGlu: 0.707 ± 0.212
0.314TrpPhe: 0.314 ± 0.16
0.864TrpGly: 0.864 ± 0.309
0.236TrpHis: 0.236 ± 0.141
0.55TrpIle: 0.55 ± 0.195
1.178TrpLys: 1.178 ± 0.307
1.57TrpLeu: 1.57 ± 0.262
0.236TrpMet: 0.236 ± 0.132
0.55TrpAsn: 0.55 ± 0.251
0.079TrpPro: 0.079 ± 0.085
0.393TrpGln: 0.393 ± 0.183
1.021TrpArg: 1.021 ± 0.291
0.785TrpSer: 0.785 ± 0.344
0.236TrpThr: 0.236 ± 0.154
0.942TrpVal: 0.942 ± 0.278
0.314TrpTrp: 0.314 ± 0.153
0.942TrpTyr: 0.942 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.649TyrAla: 1.649 ± 0.392
0.864TyrCys: 0.864 ± 0.263
2.983TyrAsp: 2.983 ± 0.665
3.455TyrGlu: 3.455 ± 0.578
1.099TyrPhe: 1.099 ± 0.3
1.884TyrGly: 1.884 ± 0.414
0.628TyrHis: 0.628 ± 0.199
2.355TyrIle: 2.355 ± 0.512
3.297TyrLys: 3.297 ± 0.507
3.062TyrLeu: 3.062 ± 0.548
1.256TyrMet: 1.256 ± 0.288
1.492TyrAsn: 1.492 ± 0.404
1.256TyrPro: 1.256 ± 0.376
1.963TyrGln: 1.963 ± 0.395
1.649TyrArg: 1.649 ± 0.349
1.335TyrSer: 1.335 ± 0.258
1.256TyrThr: 1.256 ± 0.333
2.748TyrVal: 2.748 ± 0.591
0.236TyrTrp: 0.236 ± 0.147
1.57TyrTyr: 1.57 ± 0.432
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (12738 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski