Amino acid dipepetide frequency for Escherichia phage megetsur

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.223AlaAla: 13.223 ± 1.452
0.912AlaCys: 0.912 ± 0.276
5.168AlaAsp: 5.168 ± 0.556
7.143AlaGlu: 7.143 ± 0.937
3.496AlaPhe: 3.496 ± 0.535
7.675AlaGly: 7.675 ± 0.813
1.976AlaHis: 1.976 ± 0.379
4.56AlaIle: 4.56 ± 0.481
4.94AlaLys: 4.94 ± 0.709
9.119AlaLeu: 9.119 ± 1.018
2.736AlaMet: 2.736 ± 0.594
4.028AlaAsn: 4.028 ± 0.735
4.028AlaPro: 4.028 ± 0.711
5.472AlaGln: 5.472 ± 0.912
5.32AlaArg: 5.32 ± 0.56
6.383AlaSer: 6.383 ± 0.86
6.079AlaThr: 6.079 ± 0.907
6.839AlaVal: 6.839 ± 0.657
1.216AlaTrp: 1.216 ± 0.271
3.648AlaTyr: 3.648 ± 0.558
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.32
0.076CysCys: 0.076 ± 0.065
0.304CysAsp: 0.304 ± 0.185
0.456CysGlu: 0.456 ± 0.189
0.304CysPhe: 0.304 ± 0.125
0.836CysGly: 0.836 ± 0.315
0.152CysHis: 0.152 ± 0.162
0.532CysIle: 0.532 ± 0.2
0.532CysLys: 0.532 ± 0.225
0.76CysLeu: 0.76 ± 0.219
0.456CysMet: 0.456 ± 0.184
0.38CysAsn: 0.38 ± 0.204
0.608CysPro: 0.608 ± 0.242
0.152CysGln: 0.152 ± 0.106
0.608CysArg: 0.608 ± 0.243
0.608CysSer: 0.608 ± 0.202
0.304CysThr: 0.304 ± 0.141
0.76CysVal: 0.76 ± 0.219
0.152CysTrp: 0.152 ± 0.102
0.228CysTyr: 0.228 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
6.155AspAla: 6.155 ± 0.689
0.456AspCys: 0.456 ± 0.221
4.94AspAsp: 4.94 ± 0.741
4.332AspGlu: 4.332 ± 0.541
1.824AspPhe: 1.824 ± 0.397
6.155AspGly: 6.155 ± 0.794
0.836AspHis: 0.836 ± 0.252
3.724AspIle: 3.724 ± 0.573
2.508AspLys: 2.508 ± 0.435
4.028AspLeu: 4.028 ± 0.499
1.748AspMet: 1.748 ± 0.575
2.508AspAsn: 2.508 ± 0.355
2.812AspPro: 2.812 ± 0.48
1.14AspGln: 1.14 ± 0.324
2.964AspArg: 2.964 ± 0.453
3.952AspSer: 3.952 ± 0.575
3.572AspThr: 3.572 ± 0.435
3.04AspVal: 3.04 ± 0.519
1.216AspTrp: 1.216 ± 0.357
2.28AspTyr: 2.28 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
6.003GluAla: 6.003 ± 0.788
0.532GluCys: 0.532 ± 0.201
3.648GluAsp: 3.648 ± 0.573
3.724GluGlu: 3.724 ± 0.595
3.04GluPhe: 3.04 ± 0.541
3.8GluGly: 3.8 ± 0.479
1.368GluHis: 1.368 ± 0.327
2.052GluIle: 2.052 ± 0.29
2.66GluLys: 2.66 ± 0.5
5.32GluLeu: 5.32 ± 0.758
1.976GluMet: 1.976 ± 0.339
2.432GluAsn: 2.432 ± 0.471
2.128GluPro: 2.128 ± 0.423
3.724GluGln: 3.724 ± 0.495
3.648GluArg: 3.648 ± 0.584
2.964GluSer: 2.964 ± 0.505
4.028GluThr: 4.028 ± 0.48
5.168GluVal: 5.168 ± 0.766
1.14GluTrp: 1.14 ± 0.314
2.432GluTyr: 2.432 ± 0.51
0.0GluXaa: 0.0 ± 0.0
Phe
2.584PheAla: 2.584 ± 0.429
0.608PheCys: 0.608 ± 0.195
3.116PheAsp: 3.116 ± 0.504
2.052PheGlu: 2.052 ± 0.365
1.064PhePhe: 1.064 ± 0.226
2.888PheGly: 2.888 ± 0.429
0.38PheHis: 0.38 ± 0.146
2.66PheIle: 2.66 ± 0.392
1.672PheLys: 1.672 ± 0.444
2.052PheLeu: 2.052 ± 0.364
1.216PheMet: 1.216 ± 0.273
2.052PheAsn: 2.052 ± 0.444
1.748PhePro: 1.748 ± 0.32
1.216PheGln: 1.216 ± 0.378
1.672PheArg: 1.672 ± 0.389
1.9PheSer: 1.9 ± 0.497
1.976PheThr: 1.976 ± 0.305
2.052PheVal: 2.052 ± 0.244
0.532PheTrp: 0.532 ± 0.216
0.684PheTyr: 0.684 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
7.599GlyAla: 7.599 ± 0.836
0.836GlyCys: 0.836 ± 0.245
3.952GlyAsp: 3.952 ± 0.593
4.788GlyGlu: 4.788 ± 0.571
1.976GlyPhe: 1.976 ± 0.431
6.459GlyGly: 6.459 ± 0.914
1.52GlyHis: 1.52 ± 0.432
4.484GlyIle: 4.484 ± 0.553
4.408GlyLys: 4.408 ± 0.665
6.155GlyLeu: 6.155 ± 0.672
2.28GlyMet: 2.28 ± 0.384
3.344GlyAsn: 3.344 ± 0.573
1.52GlyPro: 1.52 ± 0.313
4.18GlyGln: 4.18 ± 0.726
5.168GlyArg: 5.168 ± 0.511
5.092GlySer: 5.092 ± 0.712
5.852GlyThr: 5.852 ± 1.006
4.864GlyVal: 4.864 ± 0.587
1.216GlyTrp: 1.216 ± 0.257
2.964GlyTyr: 2.964 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.596HisAla: 1.596 ± 0.377
0.304HisCys: 0.304 ± 0.155
1.292HisAsp: 1.292 ± 0.394
1.216HisGlu: 1.216 ± 0.286
0.76HisPhe: 0.76 ± 0.209
1.444HisGly: 1.444 ± 0.349
0.76HisHis: 0.76 ± 0.239
0.456HisIle: 0.456 ± 0.169
0.988HisLys: 0.988 ± 0.25
1.824HisLeu: 1.824 ± 0.365
0.684HisMet: 0.684 ± 0.177
0.76HisAsn: 0.76 ± 0.24
0.608HisPro: 0.608 ± 0.24
0.456HisGln: 0.456 ± 0.167
0.76HisArg: 0.76 ± 0.253
1.064HisSer: 1.064 ± 0.285
1.292HisThr: 1.292 ± 0.299
1.368HisVal: 1.368 ± 0.38
0.304HisTrp: 0.304 ± 0.177
0.76HisTyr: 0.76 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
3.952IleAla: 3.952 ± 0.548
0.532IleCys: 0.532 ± 0.273
3.42IleAsp: 3.42 ± 0.493
3.04IleGlu: 3.04 ± 0.436
1.368IlePhe: 1.368 ± 0.298
3.496IleGly: 3.496 ± 0.485
0.912IleHis: 0.912 ± 0.222
2.964IleIle: 2.964 ± 0.53
1.9IleLys: 1.9 ± 0.394
2.736IleLeu: 2.736 ± 0.38
1.292IleMet: 1.292 ± 0.282
2.052IleAsn: 2.052 ± 0.386
2.964IlePro: 2.964 ± 0.552
2.128IleGln: 2.128 ± 0.301
3.268IleArg: 3.268 ± 0.492
2.204IleSer: 2.204 ± 0.409
2.812IleThr: 2.812 ± 0.57
3.8IleVal: 3.8 ± 0.5
0.304IleTrp: 0.304 ± 0.155
1.368IleTyr: 1.368 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
5.852LysAla: 5.852 ± 0.813
0.456LysCys: 0.456 ± 0.187
2.812LysAsp: 2.812 ± 0.673
2.66LysGlu: 2.66 ± 0.457
1.672LysPhe: 1.672 ± 0.361
2.508LysGly: 2.508 ± 0.44
0.988LysHis: 0.988 ± 0.322
0.532LysIle: 0.532 ± 0.182
1.52LysLys: 1.52 ± 0.365
4.788LysLeu: 4.788 ± 0.841
1.216LysMet: 1.216 ± 0.284
0.988LysAsn: 0.988 ± 0.261
2.052LysPro: 2.052 ± 0.338
3.04LysGln: 3.04 ± 0.581
2.356LysArg: 2.356 ± 0.418
3.04LysSer: 3.04 ± 0.469
2.356LysThr: 2.356 ± 0.357
3.192LysVal: 3.192 ± 0.58
0.836LysTrp: 0.836 ± 0.277
0.836LysTyr: 0.836 ± 0.214
0.0LysXaa: 0.0 ± 0.0
Leu
9.271LeuAla: 9.271 ± 0.775
0.456LeuCys: 0.456 ± 0.204
4.788LeuAsp: 4.788 ± 0.499
5.244LeuGlu: 5.244 ± 0.474
3.572LeuPhe: 3.572 ± 0.55
5.852LeuGly: 5.852 ± 0.78
1.9LeuHis: 1.9 ± 0.398
3.04LeuIle: 3.04 ± 0.511
2.964LeuLys: 2.964 ± 0.405
6.687LeuLeu: 6.687 ± 0.751
2.128LeuMet: 2.128 ± 0.336
3.952LeuAsn: 3.952 ± 0.536
4.256LeuPro: 4.256 ± 0.585
4.028LeuGln: 4.028 ± 0.514
3.876LeuArg: 3.876 ± 0.589
5.168LeuSer: 5.168 ± 0.675
5.016LeuThr: 5.016 ± 0.645
5.244LeuVal: 5.244 ± 0.635
1.368LeuTrp: 1.368 ± 0.349
2.508LeuTyr: 2.508 ± 0.413
0.0LeuXaa: 0.0 ± 0.0
Met
3.42MetAla: 3.42 ± 0.439
0.304MetCys: 0.304 ± 0.131
1.14MetAsp: 1.14 ± 0.319
0.76MetGlu: 0.76 ± 0.209
0.684MetPhe: 0.684 ± 0.225
2.204MetGly: 2.204 ± 0.461
0.684MetHis: 0.684 ± 0.204
0.836MetIle: 0.836 ± 0.292
1.444MetLys: 1.444 ± 0.338
2.736MetLeu: 2.736 ± 0.527
1.444MetMet: 1.444 ± 0.397
1.444MetAsn: 1.444 ± 0.267
1.216MetPro: 1.216 ± 0.281
2.204MetGln: 2.204 ± 0.411
1.596MetArg: 1.596 ± 0.364
1.824MetSer: 1.824 ± 0.364
2.128MetThr: 2.128 ± 0.392
1.976MetVal: 1.976 ± 0.335
0.532MetTrp: 0.532 ± 0.269
1.064MetTyr: 1.064 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
4.028AsnAla: 4.028 ± 0.571
0.38AsnCys: 0.38 ± 0.2
1.748AsnAsp: 1.748 ± 0.321
2.66AsnGlu: 2.66 ± 0.385
1.444AsnPhe: 1.444 ± 0.338
3.192AsnGly: 3.192 ± 0.554
0.836AsnHis: 0.836 ± 0.26
2.736AsnIle: 2.736 ± 0.556
2.736AsnLys: 2.736 ± 0.354
3.116AsnLeu: 3.116 ± 0.479
1.444AsnMet: 1.444 ± 0.355
1.9AsnAsn: 1.9 ± 0.392
2.508AsnPro: 2.508 ± 0.579
1.824AsnGln: 1.824 ± 0.421
1.824AsnArg: 1.824 ± 0.313
2.584AsnSer: 2.584 ± 0.566
2.356AsnThr: 2.356 ± 0.419
4.256AsnVal: 4.256 ± 0.675
0.608AsnTrp: 0.608 ± 0.177
0.532AsnTyr: 0.532 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
4.104ProAla: 4.104 ± 0.491
0.456ProCys: 0.456 ± 0.181
3.724ProAsp: 3.724 ± 0.445
3.344ProGlu: 3.344 ± 0.581
1.216ProPhe: 1.216 ± 0.298
3.04ProGly: 3.04 ± 0.485
0.684ProHis: 0.684 ± 0.252
1.444ProIle: 1.444 ± 0.322
1.292ProLys: 1.292 ± 0.382
2.508ProLeu: 2.508 ± 0.521
0.836ProMet: 0.836 ± 0.218
2.584ProAsn: 2.584 ± 0.432
1.52ProPro: 1.52 ± 0.421
2.432ProGln: 2.432 ± 1.148
1.976ProArg: 1.976 ± 0.327
2.888ProSer: 2.888 ± 0.466
2.66ProThr: 2.66 ± 0.509
3.496ProVal: 3.496 ± 0.586
0.608ProTrp: 0.608 ± 0.193
1.9ProTyr: 1.9 ± 0.48
0.0ProXaa: 0.0 ± 0.0
Gln
5.7GlnAla: 5.7 ± 0.959
0.228GlnCys: 0.228 ± 0.121
2.584GlnAsp: 2.584 ± 0.401
2.508GlnGlu: 2.508 ± 0.386
2.356GlnPhe: 2.356 ± 0.467
3.572GlnGly: 3.572 ± 0.745
0.988GlnHis: 0.988 ± 0.189
1.824GlnIle: 1.824 ± 0.433
1.672GlnLys: 1.672 ± 0.399
3.724GlnLeu: 3.724 ± 0.703
1.824GlnMet: 1.824 ± 0.363
1.748GlnAsn: 1.748 ± 0.628
2.28GlnPro: 2.28 ± 1.144
3.724GlnGln: 3.724 ± 1.065
3.952GlnArg: 3.952 ± 0.639
1.824GlnSer: 1.824 ± 0.361
2.964GlnThr: 2.964 ± 0.61
3.268GlnVal: 3.268 ± 0.571
0.684GlnTrp: 0.684 ± 0.175
2.432GlnTyr: 2.432 ± 0.41
0.0GlnXaa: 0.0 ± 0.0
Arg
5.168ArgAla: 5.168 ± 0.737
0.608ArgCys: 0.608 ± 0.205
3.42ArgAsp: 3.42 ± 0.535
3.876ArgGlu: 3.876 ± 0.533
2.28ArgPhe: 2.28 ± 0.452
4.712ArgGly: 4.712 ± 0.439
0.988ArgHis: 0.988 ± 0.292
2.964ArgIle: 2.964 ± 0.475
2.736ArgLys: 2.736 ± 0.385
4.636ArgLeu: 4.636 ± 0.535
1.824ArgMet: 1.824 ± 0.35
1.596ArgAsn: 1.596 ± 0.415
1.748ArgPro: 1.748 ± 0.48
2.128ArgGln: 2.128 ± 0.323
3.8ArgArg: 3.8 ± 0.606
2.432ArgSer: 2.432 ± 0.418
3.04ArgThr: 3.04 ± 0.542
4.484ArgVal: 4.484 ± 0.657
0.836ArgTrp: 0.836 ± 0.263
2.584ArgTyr: 2.584 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
6.155SerAla: 6.155 ± 0.562
0.152SerCys: 0.152 ± 0.115
2.964SerAsp: 2.964 ± 0.461
2.964SerGlu: 2.964 ± 0.449
2.052SerPhe: 2.052 ± 0.382
5.548SerGly: 5.548 ± 0.439
0.76SerHis: 0.76 ± 0.242
3.42SerIle: 3.42 ± 0.463
2.66SerLys: 2.66 ± 0.384
4.408SerLeu: 4.408 ± 0.573
1.52SerMet: 1.52 ± 0.271
2.66SerAsn: 2.66 ± 0.442
3.42SerPro: 3.42 ± 0.495
2.66SerGln: 2.66 ± 0.537
2.888SerArg: 2.888 ± 0.537
2.812SerSer: 2.812 ± 0.754
3.952SerThr: 3.952 ± 0.741
3.876SerVal: 3.876 ± 0.573
0.76SerTrp: 0.76 ± 0.262
2.28SerTyr: 2.28 ± 0.508
0.0SerXaa: 0.0 ± 0.0
Thr
7.371ThrAla: 7.371 ± 0.865
0.38ThrCys: 0.38 ± 0.163
3.04ThrAsp: 3.04 ± 0.536
2.812ThrGlu: 2.812 ± 0.478
1.748ThrPhe: 1.748 ± 0.393
5.776ThrGly: 5.776 ± 0.585
0.684ThrHis: 0.684 ± 0.192
2.508ThrIle: 2.508 ± 0.395
2.964ThrLys: 2.964 ± 0.483
6.079ThrLeu: 6.079 ± 0.572
1.444ThrMet: 1.444 ± 0.33
3.116ThrAsn: 3.116 ± 0.725
2.66ThrPro: 2.66 ± 0.558
2.66ThrGln: 2.66 ± 0.429
2.736ThrArg: 2.736 ± 0.311
4.104ThrSer: 4.104 ± 0.593
3.268ThrThr: 3.268 ± 0.765
5.016ThrVal: 5.016 ± 0.826
0.684ThrTrp: 0.684 ± 0.18
2.508ThrTyr: 2.508 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
6.459ValAla: 6.459 ± 0.828
0.684ValCys: 0.684 ± 0.249
4.864ValAsp: 4.864 ± 0.587
5.168ValGlu: 5.168 ± 0.646
1.9ValPhe: 1.9 ± 0.521
5.548ValGly: 5.548 ± 0.488
1.292ValHis: 1.292 ± 0.297
3.496ValIle: 3.496 ± 0.576
2.812ValLys: 2.812 ± 0.516
5.548ValLeu: 5.548 ± 0.529
1.672ValMet: 1.672 ± 0.338
3.116ValAsn: 3.116 ± 0.525
2.812ValPro: 2.812 ± 0.527
3.876ValGln: 3.876 ± 0.626
4.408ValArg: 4.408 ± 0.842
3.876ValSer: 3.876 ± 0.55
5.168ValThr: 5.168 ± 0.837
5.928ValVal: 5.928 ± 0.614
0.988ValTrp: 0.988 ± 0.294
2.28ValTyr: 2.28 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.253
0.456TrpCys: 0.456 ± 0.185
1.064TrpAsp: 1.064 ± 0.276
0.836TrpGlu: 0.836 ± 0.221
0.76TrpPhe: 0.76 ± 0.211
0.912TrpGly: 0.912 ± 0.282
0.304TrpHis: 0.304 ± 0.124
0.532TrpIle: 0.532 ± 0.176
0.076TrpLys: 0.076 ± 0.085
2.204TrpLeu: 2.204 ± 0.466
0.38TrpMet: 0.38 ± 0.168
0.38TrpAsn: 0.38 ± 0.164
0.532TrpPro: 0.532 ± 0.2
0.912TrpGln: 0.912 ± 0.216
0.608TrpArg: 0.608 ± 0.188
1.14TrpSer: 1.14 ± 0.263
0.608TrpThr: 0.608 ± 0.189
0.988TrpVal: 0.988 ± 0.337
0.456TrpTrp: 0.456 ± 0.172
0.532TrpTyr: 0.532 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.496TyrAla: 3.496 ± 0.583
0.38TyrCys: 0.38 ± 0.17
2.204TyrAsp: 2.204 ± 0.447
2.28TyrGlu: 2.28 ± 0.525
0.608TyrPhe: 0.608 ± 0.196
3.04TyrGly: 3.04 ± 0.48
0.684TyrHis: 0.684 ± 0.228
1.748TyrIle: 1.748 ± 0.457
1.216TyrLys: 1.216 ± 0.334
2.888TyrLeu: 2.888 ± 0.565
1.444TyrMet: 1.444 ± 0.277
1.672TyrAsn: 1.672 ± 0.296
1.292TyrPro: 1.292 ± 0.297
1.9TyrGln: 1.9 ± 0.357
2.432TyrArg: 2.432 ± 0.496
2.052TyrSer: 2.052 ± 0.398
2.128TyrThr: 2.128 ± 0.635
2.28TyrVal: 2.28 ± 0.431
0.076TyrTrp: 0.076 ± 0.071
1.444TyrTyr: 1.444 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13160 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski