Amino acid dipepetide frequency for Paenibacillus phage vB_PlaP_API480

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.423AlaAla: 4.423 ± 0.824
0.653AlaCys: 0.653 ± 0.203
3.988AlaAsp: 3.988 ± 0.518
4.495AlaGlu: 4.495 ± 0.596
2.9AlaPhe: 2.9 ± 0.456
4.495AlaGly: 4.495 ± 0.952
0.725AlaHis: 0.725 ± 0.21
3.843AlaIle: 3.843 ± 0.556
5.8AlaLys: 5.8 ± 0.946
7.178AlaLeu: 7.178 ± 0.955
1.885AlaMet: 1.885 ± 0.351
3.263AlaAsn: 3.263 ± 0.548
2.175AlaPro: 2.175 ± 0.4
2.103AlaGln: 2.103 ± 0.477
2.755AlaArg: 2.755 ± 0.529
3.045AlaSer: 3.045 ± 0.455
2.32AlaThr: 2.32 ± 0.424
3.263AlaVal: 3.263 ± 0.593
0.58AlaTrp: 0.58 ± 0.265
2.755AlaTyr: 2.755 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
0.87CysAla: 0.87 ± 0.278
0.145CysCys: 0.145 ± 0.101
0.653CysAsp: 0.653 ± 0.226
0.87CysGlu: 0.87 ± 0.299
0.29CysPhe: 0.29 ± 0.129
1.233CysGly: 1.233 ± 0.361
0.073CysHis: 0.073 ± 0.065
0.653CysIle: 0.653 ± 0.253
1.233CysLys: 1.233 ± 0.338
0.725CysLeu: 0.725 ± 0.244
0.145CysMet: 0.145 ± 0.084
0.29CysAsn: 0.29 ± 0.175
0.943CysPro: 0.943 ± 0.269
0.145CysGln: 0.145 ± 0.107
0.87CysArg: 0.87 ± 0.258
0.58CysSer: 0.58 ± 0.207
0.218CysThr: 0.218 ± 0.131
0.29CysVal: 0.29 ± 0.151
0.218CysTrp: 0.218 ± 0.124
0.87CysTyr: 0.87 ± 0.24
0.0CysXaa: 0.0 ± 0.0
Asp
3.263AspAla: 3.263 ± 0.368
0.87AspCys: 0.87 ± 0.249
4.785AspAsp: 4.785 ± 0.679
5.003AspGlu: 5.003 ± 0.631
4.06AspPhe: 4.06 ± 0.539
3.263AspGly: 3.263 ± 0.52
1.595AspHis: 1.595 ± 0.234
5.365AspIle: 5.365 ± 0.583
5.22AspLys: 5.22 ± 0.749
5.293AspLeu: 5.293 ± 0.633
2.03AspMet: 2.03 ± 0.333
2.248AspAsn: 2.248 ± 0.382
2.61AspPro: 2.61 ± 0.55
1.668AspGln: 1.668 ± 0.395
2.973AspArg: 2.973 ± 0.511
3.19AspSer: 3.19 ± 0.438
2.248AspThr: 2.248 ± 0.385
4.858AspVal: 4.858 ± 0.526
1.088AspTrp: 1.088 ± 0.302
2.755AspTyr: 2.755 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
4.713GluAla: 4.713 ± 0.614
0.943GluCys: 0.943 ± 0.297
5.51GluAsp: 5.51 ± 0.624
9.208GluGlu: 9.208 ± 1.077
2.683GluPhe: 2.683 ± 0.503
4.35GluGly: 4.35 ± 0.593
1.595GluHis: 1.595 ± 0.361
6.09GluIle: 6.09 ± 0.787
7.613GluLys: 7.613 ± 0.81
8.555GluLeu: 8.555 ± 0.655
2.61GluMet: 2.61 ± 0.384
3.19GluAsn: 3.19 ± 0.408
2.393GluPro: 2.393 ± 0.546
3.335GluGln: 3.335 ± 0.504
4.785GluArg: 4.785 ± 0.511
4.568GluSer: 4.568 ± 0.583
3.48GluThr: 3.48 ± 0.328
5.438GluVal: 5.438 ± 0.546
1.305GluTrp: 1.305 ± 0.325
3.553GluTyr: 3.553 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
1.958PheAla: 1.958 ± 0.431
0.58PheCys: 0.58 ± 0.216
2.175PheAsp: 2.175 ± 0.369
3.118PheGlu: 3.118 ± 0.459
1.233PhePhe: 1.233 ± 0.368
2.755PheGly: 2.755 ± 0.516
1.015PheHis: 1.015 ± 0.266
3.19PheIle: 3.19 ± 0.54
2.103PheLys: 2.103 ± 0.389
3.045PheLeu: 3.045 ± 0.414
1.305PheMet: 1.305 ± 0.352
2.32PheAsn: 2.32 ± 0.446
2.03PhePro: 2.03 ± 0.343
1.74PheGln: 1.74 ± 0.345
1.595PheArg: 1.595 ± 0.319
2.465PheSer: 2.465 ± 0.354
1.813PheThr: 1.813 ± 0.302
2.465PheVal: 2.465 ± 0.392
0.435PheTrp: 0.435 ± 0.217
1.595PheTyr: 1.595 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
2.973GlyAla: 2.973 ± 0.477
0.508GlyCys: 0.508 ± 0.227
3.843GlyAsp: 3.843 ± 0.665
3.915GlyGlu: 3.915 ± 0.521
2.175GlyPhe: 2.175 ± 0.421
4.35GlyGly: 4.35 ± 0.627
1.595GlyHis: 1.595 ± 0.319
5.003GlyIle: 5.003 ± 0.672
6.453GlyLys: 6.453 ± 1.089
5.438GlyLeu: 5.438 ± 0.608
2.175GlyMet: 2.175 ± 0.371
3.045GlyAsn: 3.045 ± 0.459
0.58GlyPro: 0.58 ± 0.166
1.958GlyGln: 1.958 ± 0.374
3.335GlyArg: 3.335 ± 0.45
3.625GlySer: 3.625 ± 0.666
3.77GlyThr: 3.77 ± 0.444
3.915GlyVal: 3.915 ± 0.573
0.29GlyTrp: 0.29 ± 0.143
3.698GlyTyr: 3.698 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
0.508HisAla: 0.508 ± 0.201
0.508HisCys: 0.508 ± 0.21
1.305HisAsp: 1.305 ± 0.309
1.45HisGlu: 1.45 ± 0.328
0.725HisPhe: 0.725 ± 0.224
1.305HisGly: 1.305 ± 0.242
0.29HisHis: 0.29 ± 0.138
1.233HisIle: 1.233 ± 0.299
1.015HisLys: 1.015 ± 0.221
2.03HisLeu: 2.03 ± 0.341
1.015HisMet: 1.015 ± 0.272
0.653HisAsn: 0.653 ± 0.266
0.435HisPro: 0.435 ± 0.14
1.233HisGln: 1.233 ± 0.321
1.015HisArg: 1.015 ± 0.283
0.87HisSer: 0.87 ± 0.253
1.088HisThr: 1.088 ± 0.261
0.798HisVal: 0.798 ± 0.23
0.29HisTrp: 0.29 ± 0.156
0.798HisTyr: 0.798 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
3.48IleAla: 3.48 ± 0.564
1.015IleCys: 1.015 ± 0.351
5.51IleAsp: 5.51 ± 0.647
5.365IleGlu: 5.365 ± 0.553
2.683IlePhe: 2.683 ± 0.561
4.278IleGly: 4.278 ± 0.519
1.378IleHis: 1.378 ± 0.332
4.64IleIle: 4.64 ± 0.715
4.35IleLys: 4.35 ± 0.519
4.785IleLeu: 4.785 ± 0.674
1.885IleMet: 1.885 ± 0.329
3.263IleAsn: 3.263 ± 0.394
3.553IlePro: 3.553 ± 0.501
2.755IleGln: 2.755 ± 0.448
3.408IleArg: 3.408 ± 0.442
3.915IleSer: 3.915 ± 0.536
2.828IleThr: 2.828 ± 0.418
4.93IleVal: 4.93 ± 0.65
0.58IleTrp: 0.58 ± 0.265
2.538IleTyr: 2.538 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
5.945LysAla: 5.945 ± 0.885
0.87LysCys: 0.87 ± 0.254
5.075LysAsp: 5.075 ± 0.631
8.845LysGlu: 8.845 ± 0.91
2.755LysPhe: 2.755 ± 0.489
5.22LysGly: 5.22 ± 0.717
1.305LysHis: 1.305 ± 0.362
4.495LysIle: 4.495 ± 0.508
6.38LysLys: 6.38 ± 0.798
8.773LysLeu: 8.773 ± 0.748
2.61LysMet: 2.61 ± 0.558
4.568LysAsn: 4.568 ± 0.643
2.61LysPro: 2.61 ± 0.535
3.408LysGln: 3.408 ± 0.64
3.698LysArg: 3.698 ± 0.529
4.133LysSer: 4.133 ± 0.427
5.293LysThr: 5.293 ± 0.522
5.148LysVal: 5.148 ± 0.534
1.088LysTrp: 1.088 ± 0.292
4.278LysTyr: 4.278 ± 0.487
0.0LysXaa: 0.0 ± 0.0
Leu
7.178LeuAla: 7.178 ± 0.831
1.015LeuCys: 1.015 ± 0.317
6.09LeuAsp: 6.09 ± 0.646
8.338LeuGlu: 8.338 ± 0.677
2.9LeuPhe: 2.9 ± 0.375
5.22LeuGly: 5.22 ± 0.657
1.595LeuHis: 1.595 ± 0.329
5.583LeuIle: 5.583 ± 0.744
7.25LeuLys: 7.25 ± 0.716
5.583LeuLeu: 5.583 ± 0.637
1.74LeuMet: 1.74 ± 0.383
3.843LeuAsn: 3.843 ± 0.416
3.045LeuPro: 3.045 ± 0.549
2.973LeuGln: 2.973 ± 0.581
4.205LeuArg: 4.205 ± 0.531
5.873LeuSer: 5.873 ± 0.701
3.698LeuThr: 3.698 ± 0.41
5.8LeuVal: 5.8 ± 0.792
1.088LeuTrp: 1.088 ± 0.265
3.408LeuTyr: 3.408 ± 0.691
0.0LeuXaa: 0.0 ± 0.0
Met
1.813MetAla: 1.813 ± 0.328
0.218MetCys: 0.218 ± 0.115
1.523MetAsp: 1.523 ± 0.276
2.828MetGlu: 2.828 ± 0.503
1.305MetPhe: 1.305 ± 0.288
1.378MetGly: 1.378 ± 0.357
0.798MetHis: 0.798 ± 0.236
1.668MetIle: 1.668 ± 0.329
2.973MetLys: 2.973 ± 0.488
2.538MetLeu: 2.538 ± 0.392
0.508MetMet: 0.508 ± 0.271
1.668MetAsn: 1.668 ± 0.373
0.508MetPro: 0.508 ± 0.193
1.015MetGln: 1.015 ± 0.317
1.958MetArg: 1.958 ± 0.374
1.45MetSer: 1.45 ± 0.315
1.305MetThr: 1.305 ± 0.278
1.813MetVal: 1.813 ± 0.393
0.363MetTrp: 0.363 ± 0.161
1.088MetTyr: 1.088 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
4.205AsnAla: 4.205 ± 0.656
0.508AsnCys: 0.508 ± 0.2
1.958AsnAsp: 1.958 ± 0.414
3.553AsnGlu: 3.553 ± 0.512
1.523AsnPhe: 1.523 ± 0.322
2.755AsnGly: 2.755 ± 0.473
1.16AsnHis: 1.16 ± 0.289
2.9AsnIle: 2.9 ± 0.362
4.93AsnLys: 4.93 ± 0.751
3.843AsnLeu: 3.843 ± 0.497
1.16AsnMet: 1.16 ± 0.264
2.393AsnAsn: 2.393 ± 0.466
2.103AsnPro: 2.103 ± 0.309
1.74AsnGln: 1.74 ± 0.369
2.03AsnArg: 2.03 ± 0.364
1.885AsnSer: 1.885 ± 0.271
2.538AsnThr: 2.538 ± 0.464
2.973AsnVal: 2.973 ± 0.434
0.58AsnTrp: 0.58 ± 0.171
2.828AsnTyr: 2.828 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
1.885ProAla: 1.885 ± 0.448
0.218ProCys: 0.218 ± 0.152
2.393ProAsp: 2.393 ± 0.402
2.755ProGlu: 2.755 ± 0.332
2.175ProPhe: 2.175 ± 0.364
1.16ProGly: 1.16 ± 0.237
0.725ProHis: 0.725 ± 0.239
1.74ProIle: 1.74 ± 0.398
3.335ProLys: 3.335 ± 0.536
2.32ProLeu: 2.32 ± 0.384
0.58ProMet: 0.58 ± 0.186
2.538ProAsn: 2.538 ± 0.509
0.87ProPro: 0.87 ± 0.258
1.523ProGln: 1.523 ± 0.336
1.088ProArg: 1.088 ± 0.228
2.248ProSer: 2.248 ± 0.366
1.45ProThr: 1.45 ± 0.308
2.393ProVal: 2.393 ± 0.421
0.29ProTrp: 0.29 ± 0.15
1.305ProTyr: 1.305 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
3.553GlnAla: 3.553 ± 0.499
0.29GlnCys: 0.29 ± 0.141
2.175GlnAsp: 2.175 ± 0.337
3.698GlnGlu: 3.698 ± 0.591
1.16GlnPhe: 1.16 ± 0.289
2.175GlnGly: 2.175 ± 0.357
0.508GlnHis: 0.508 ± 0.201
2.248GlnIle: 2.248 ± 0.398
3.698GlnLys: 3.698 ± 0.665
2.755GlnLeu: 2.755 ± 0.509
1.16GlnMet: 1.16 ± 0.374
1.74GlnAsn: 1.74 ± 0.363
1.088GlnPro: 1.088 ± 0.349
1.523GlnGln: 1.523 ± 0.359
1.813GlnArg: 1.813 ± 0.349
1.885GlnSer: 1.885 ± 0.431
1.595GlnThr: 1.595 ± 0.266
1.813GlnVal: 1.813 ± 0.396
0.653GlnTrp: 0.653 ± 0.253
1.885GlnTyr: 1.885 ± 0.274
0.0GlnXaa: 0.0 ± 0.0
Arg
2.393ArgAla: 2.393 ± 0.553
0.145ArgCys: 0.145 ± 0.095
3.263ArgAsp: 3.263 ± 0.414
4.06ArgGlu: 4.06 ± 0.547
2.03ArgPhe: 2.03 ± 0.297
2.32ArgGly: 2.32 ± 0.284
0.943ArgHis: 0.943 ± 0.295
3.19ArgIle: 3.19 ± 0.488
5.148ArgLys: 5.148 ± 0.593
4.35ArgLeu: 4.35 ± 0.585
1.885ArgMet: 1.885 ± 0.279
2.683ArgAsn: 2.683 ± 0.378
1.378ArgPro: 1.378 ± 0.26
1.813ArgGln: 1.813 ± 0.401
1.668ArgArg: 1.668 ± 0.37
1.74ArgSer: 1.74 ± 0.343
2.61ArgThr: 2.61 ± 0.424
2.828ArgVal: 2.828 ± 0.438
0.435ArgTrp: 0.435 ± 0.178
1.958ArgTyr: 1.958 ± 0.342
0.0ArgXaa: 0.0 ± 0.0
Ser
3.335SerAla: 3.335 ± 0.366
0.218SerCys: 0.218 ± 0.115
3.48SerAsp: 3.48 ± 0.531
4.568SerGlu: 4.568 ± 0.545
2.465SerPhe: 2.465 ± 0.292
5.438SerGly: 5.438 ± 0.616
0.725SerHis: 0.725 ± 0.213
3.915SerIle: 3.915 ± 0.555
4.35SerLys: 4.35 ± 0.581
4.64SerLeu: 4.64 ± 0.486
1.74SerMet: 1.74 ± 0.298
2.32SerAsn: 2.32 ± 0.419
1.45SerPro: 1.45 ± 0.238
2.175SerGln: 2.175 ± 0.425
2.465SerArg: 2.465 ± 0.477
3.625SerSer: 3.625 ± 0.454
2.465SerThr: 2.465 ± 0.41
2.393SerVal: 2.393 ± 0.46
0.653SerTrp: 0.653 ± 0.239
2.61SerTyr: 2.61 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
2.538ThrAla: 2.538 ± 0.569
0.725ThrCys: 0.725 ± 0.208
3.045ThrAsp: 3.045 ± 0.468
2.9ThrGlu: 2.9 ± 0.437
2.03ThrPhe: 2.03 ± 0.301
3.625ThrGly: 3.625 ± 0.425
0.58ThrHis: 0.58 ± 0.207
3.843ThrIle: 3.843 ± 0.685
3.988ThrLys: 3.988 ± 0.571
4.713ThrLeu: 4.713 ± 0.509
0.943ThrMet: 0.943 ± 0.232
1.305ThrAsn: 1.305 ± 0.321
2.103ThrPro: 2.103 ± 0.39
1.16ThrGln: 1.16 ± 0.262
1.813ThrArg: 1.813 ± 0.395
3.263ThrSer: 3.263 ± 0.422
3.118ThrThr: 3.118 ± 0.48
2.828ThrVal: 2.828 ± 0.422
0.653ThrTrp: 0.653 ± 0.26
2.683ThrTyr: 2.683 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
4.06ValAla: 4.06 ± 0.526
1.233ValCys: 1.233 ± 0.333
3.335ValAsp: 3.335 ± 0.498
5.51ValGlu: 5.51 ± 0.773
1.523ValPhe: 1.523 ± 0.338
3.118ValGly: 3.118 ± 0.561
1.088ValHis: 1.088 ± 0.328
3.988ValIle: 3.988 ± 0.631
5.148ValLys: 5.148 ± 0.681
5.293ValLeu: 5.293 ± 0.578
1.378ValMet: 1.378 ± 0.262
3.698ValAsn: 3.698 ± 0.518
1.813ValPro: 1.813 ± 0.412
3.045ValGln: 3.045 ± 0.51
2.248ValArg: 2.248 ± 0.457
3.625ValSer: 3.625 ± 0.497
3.19ValThr: 3.19 ± 0.49
3.843ValVal: 3.843 ± 0.612
0.653ValTrp: 0.653 ± 0.23
3.118ValTyr: 3.118 ± 0.505
0.0ValXaa: 0.0 ± 0.0
Trp
0.653TrpAla: 0.653 ± 0.229
0.145TrpCys: 0.145 ± 0.121
1.015TrpAsp: 1.015 ± 0.283
1.015TrpGlu: 1.015 ± 0.228
0.218TrpPhe: 0.218 ± 0.151
0.653TrpGly: 0.653 ± 0.194
0.145TrpHis: 0.145 ± 0.119
0.653TrpIle: 0.653 ± 0.232
1.233TrpLys: 1.233 ± 0.44
0.943TrpLeu: 0.943 ± 0.251
0.363TrpMet: 0.363 ± 0.19
0.508TrpAsn: 0.508 ± 0.222
0.0TrpPro: 0.0 ± 0.0
0.508TrpGln: 0.508 ± 0.205
0.943TrpArg: 0.943 ± 0.263
0.653TrpSer: 0.653 ± 0.23
0.508TrpThr: 0.508 ± 0.169
0.725TrpVal: 0.725 ± 0.211
0.145TrpTrp: 0.145 ± 0.105
0.653TrpTyr: 0.653 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.9TyrAla: 2.9 ± 0.457
0.508TyrCys: 0.508 ± 0.246
3.263TyrAsp: 3.263 ± 0.57
4.423TyrGlu: 4.423 ± 0.62
2.103TyrPhe: 2.103 ± 0.338
3.553TyrGly: 3.553 ± 0.469
0.725TyrHis: 0.725 ± 0.291
2.9TyrIle: 2.9 ± 0.467
4.35TyrLys: 4.35 ± 0.516
3.625TyrLeu: 3.625 ± 0.496
1.523TyrMet: 1.523 ± 0.244
2.03TyrAsn: 2.03 ± 0.503
1.305TyrPro: 1.305 ± 0.305
1.595TyrGln: 1.595 ± 0.376
2.175TyrArg: 2.175 ± 0.439
2.393TyrSer: 2.393 ± 0.367
2.32TyrThr: 2.32 ± 0.43
2.393TyrVal: 2.393 ± 0.587
0.29TyrTrp: 0.29 ± 0.141
2.248TyrTyr: 2.248 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (13794 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski