Amino acid dipepetide frequency for EBPR podovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.749AlaAla: 21.749 ± 2.631
1.601AlaCys: 1.601 ± 0.337
8.756AlaAsp: 8.756 ± 1.003
11.581AlaGlu: 11.581 ± 2.066
3.201AlaPhe: 3.201 ± 0.569
10.828AlaGly: 10.828 ± 1.568
1.789AlaHis: 1.789 ± 0.383
6.685AlaIle: 6.685 ± 0.753
6.214AlaLys: 6.214 ± 1.026
10.733AlaLeu: 10.733 ± 1.164
3.86AlaMet: 3.86 ± 0.612
3.484AlaAsn: 3.484 ± 0.568
4.425AlaPro: 4.425 ± 0.807
5.367AlaGln: 5.367 ± 0.628
9.039AlaArg: 9.039 ± 1.051
7.626AlaSer: 7.626 ± 0.732
6.873AlaThr: 6.873 ± 1.071
8.756AlaVal: 8.756 ± 1.095
1.318AlaTrp: 1.318 ± 0.385
2.636AlaTyr: 2.636 ± 0.559
0.0AlaXaa: 0.0 ± 0.0
Cys
1.601CysAla: 1.601 ± 0.353
0.094CysCys: 0.094 ± 0.081
0.942CysAsp: 0.942 ± 0.306
1.318CysGlu: 1.318 ± 0.366
0.188CysPhe: 0.188 ± 0.135
1.506CysGly: 1.506 ± 0.375
0.282CysHis: 0.282 ± 0.207
0.565CysIle: 0.565 ± 0.206
0.565CysLys: 0.565 ± 0.236
0.847CysLeu: 0.847 ± 0.242
0.188CysMet: 0.188 ± 0.139
0.471CysAsn: 0.471 ± 0.232
0.847CysPro: 0.847 ± 0.222
0.282CysGln: 0.282 ± 0.159
1.13CysArg: 1.13 ± 0.361
0.659CysSer: 0.659 ± 0.213
0.565CysThr: 0.565 ± 0.238
0.377CysVal: 0.377 ± 0.195
0.188CysTrp: 0.188 ± 0.143
0.282CysTyr: 0.282 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
7.626AspAla: 7.626 ± 0.776
0.565AspCys: 0.565 ± 0.208
4.708AspAsp: 4.708 ± 0.755
4.708AspGlu: 4.708 ± 0.512
2.071AspPhe: 2.071 ± 0.44
5.084AspGly: 5.084 ± 0.523
0.753AspHis: 0.753 ± 0.256
2.73AspIle: 2.73 ± 0.498
2.354AspLys: 2.354 ± 0.559
3.578AspLeu: 3.578 ± 0.433
1.506AspMet: 1.506 ± 0.356
1.412AspAsn: 1.412 ± 0.361
2.071AspPro: 2.071 ± 0.379
2.26AspGln: 2.26 ± 0.444
3.672AspArg: 3.672 ± 0.547
2.919AspSer: 2.919 ± 0.42
2.354AspThr: 2.354 ± 0.501
3.201AspVal: 3.201 ± 0.586
1.13AspTrp: 1.13 ± 0.373
1.883AspTyr: 1.883 ± 0.405
0.0AspXaa: 0.0 ± 0.0
Glu
9.792GluAla: 9.792 ± 1.817
0.565GluCys: 0.565 ± 0.212
1.977GluAsp: 1.977 ± 0.401
3.39GluGlu: 3.39 ± 0.826
1.789GluPhe: 1.789 ± 0.513
3.39GluGly: 3.39 ± 0.604
1.036GluHis: 1.036 ± 0.28
4.237GluIle: 4.237 ± 0.694
3.672GluLys: 3.672 ± 0.693
6.497GluLeu: 6.497 ± 0.943
2.448GluMet: 2.448 ± 0.532
1.883GluAsn: 1.883 ± 0.42
2.354GluPro: 2.354 ± 0.336
4.237GluGln: 4.237 ± 0.77
5.932GluArg: 5.932 ± 1.036
4.519GluSer: 4.519 ± 0.635
3.39GluThr: 3.39 ± 0.446
3.013GluVal: 3.013 ± 0.492
1.036GluTrp: 1.036 ± 0.326
2.73GluTyr: 2.73 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
3.484PheAla: 3.484 ± 0.706
0.471PheCys: 0.471 ± 0.172
2.636PheAsp: 2.636 ± 0.419
2.448PheGlu: 2.448 ± 0.443
1.412PhePhe: 1.412 ± 0.432
2.919PheGly: 2.919 ± 0.452
0.753PheHis: 0.753 ± 0.253
1.695PheIle: 1.695 ± 0.4
1.036PheLys: 1.036 ± 0.226
1.224PheLeu: 1.224 ± 0.349
0.753PheMet: 0.753 ± 0.252
0.847PheAsn: 0.847 ± 0.297
1.036PhePro: 1.036 ± 0.342
0.847PheGln: 0.847 ± 0.259
1.601PheArg: 1.601 ± 0.308
1.506PheSer: 1.506 ± 0.408
1.318PheThr: 1.318 ± 0.395
2.542PheVal: 2.542 ± 0.491
0.659PheTrp: 0.659 ± 0.244
0.565PheTyr: 0.565 ± 0.271
0.0PheXaa: 0.0 ± 0.0
Gly
10.263GlyAla: 10.263 ± 1.29
1.695GlyCys: 1.695 ± 0.37
4.708GlyAsp: 4.708 ± 0.693
4.802GlyGlu: 4.802 ± 0.509
3.013GlyPhe: 3.013 ± 0.562
7.721GlyGly: 7.721 ± 1.342
0.942GlyHis: 0.942 ± 0.212
3.201GlyIle: 3.201 ± 0.458
3.484GlyLys: 3.484 ± 0.587
6.967GlyLeu: 6.967 ± 0.892
1.789GlyMet: 1.789 ± 0.44
2.542GlyAsn: 2.542 ± 0.582
2.542GlyPro: 2.542 ± 0.5
3.578GlyGln: 3.578 ± 0.676
5.837GlyArg: 5.837 ± 0.873
5.555GlySer: 5.555 ± 0.845
4.237GlyThr: 4.237 ± 0.665
4.99GlyVal: 4.99 ± 0.584
1.506GlyTrp: 1.506 ± 0.399
2.354GlyTyr: 2.354 ± 0.616
0.0GlyXaa: 0.0 ± 0.0
His
2.542HisAla: 2.542 ± 0.544
0.0HisCys: 0.0 ± 0.0
1.13HisAsp: 1.13 ± 0.325
1.318HisGlu: 1.318 ± 0.295
0.188HisPhe: 0.188 ± 0.112
1.318HisGly: 1.318 ± 0.349
0.094HisHis: 0.094 ± 0.081
0.942HisIle: 0.942 ± 0.272
0.659HisLys: 0.659 ± 0.203
1.318HisLeu: 1.318 ± 0.417
0.188HisMet: 0.188 ± 0.162
0.471HisAsn: 0.471 ± 0.211
0.565HisPro: 0.565 ± 0.293
0.565HisGln: 0.565 ± 0.259
1.036HisArg: 1.036 ± 0.331
0.377HisSer: 0.377 ± 0.15
0.942HisThr: 0.942 ± 0.264
1.224HisVal: 1.224 ± 0.316
0.377HisTrp: 0.377 ± 0.178
0.188HisTyr: 0.188 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
6.967IleAla: 6.967 ± 0.775
0.659IleCys: 0.659 ± 0.247
2.448IleAsp: 2.448 ± 0.312
3.766IleGlu: 3.766 ± 0.593
1.601IlePhe: 1.601 ± 0.365
4.237IleGly: 4.237 ± 0.681
0.565IleHis: 0.565 ± 0.245
1.036IleIle: 1.036 ± 0.291
2.166IleLys: 2.166 ± 0.388
2.26IleLeu: 2.26 ± 0.361
0.471IleMet: 0.471 ± 0.206
2.26IleAsn: 2.26 ± 0.435
2.542IlePro: 2.542 ± 0.428
1.13IleGln: 1.13 ± 0.329
2.919IleArg: 2.919 ± 0.441
2.825IleSer: 2.825 ± 0.506
3.201IleThr: 3.201 ± 0.615
2.825IleVal: 2.825 ± 0.558
0.565IleTrp: 0.565 ± 0.221
1.224IleTyr: 1.224 ± 0.389
0.0IleXaa: 0.0 ± 0.0
Lys
6.214LysAla: 6.214 ± 1.165
0.471LysCys: 0.471 ± 0.22
1.506LysAsp: 1.506 ± 0.461
3.578LysGlu: 3.578 ± 0.774
1.412LysPhe: 1.412 ± 0.335
2.071LysGly: 2.071 ± 0.401
0.942LysHis: 0.942 ± 0.351
1.412LysIle: 1.412 ± 0.336
2.354LysLys: 2.354 ± 0.601
4.143LysLeu: 4.143 ± 0.558
1.224LysMet: 1.224 ± 0.368
1.318LysAsn: 1.318 ± 0.311
1.883LysPro: 1.883 ± 0.496
1.883LysGln: 1.883 ± 0.402
3.86LysArg: 3.86 ± 0.697
3.39LysSer: 3.39 ± 0.587
3.107LysThr: 3.107 ± 0.598
3.013LysVal: 3.013 ± 0.553
0.847LysTrp: 0.847 ± 0.275
0.942LysTyr: 0.942 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
11.016LeuAla: 11.016 ± 1.199
1.036LeuCys: 1.036 ± 0.371
3.86LeuAsp: 3.86 ± 0.576
4.237LeuGlu: 4.237 ± 0.676
2.26LeuPhe: 2.26 ± 0.507
5.461LeuGly: 5.461 ± 0.667
0.942LeuHis: 0.942 ± 0.302
3.201LeuIle: 3.201 ± 0.494
3.484LeuLys: 3.484 ± 0.476
5.273LeuLeu: 5.273 ± 0.9
1.318LeuMet: 1.318 ± 0.32
2.919LeuAsn: 2.919 ± 0.491
3.578LeuPro: 3.578 ± 0.495
4.237LeuGln: 4.237 ± 0.658
6.402LeuArg: 6.402 ± 0.814
5.555LeuSer: 5.555 ± 0.781
3.954LeuThr: 3.954 ± 0.7
4.425LeuVal: 4.425 ± 0.66
0.942LeuTrp: 0.942 ± 0.316
1.412LeuTyr: 1.412 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
2.919MetAla: 2.919 ± 0.524
0.471MetCys: 0.471 ± 0.204
1.695MetAsp: 1.695 ± 0.34
1.036MetGlu: 1.036 ± 0.34
0.659MetPhe: 0.659 ± 0.244
1.224MetGly: 1.224 ± 0.428
0.377MetHis: 0.377 ± 0.155
0.753MetIle: 0.753 ± 0.22
1.036MetLys: 1.036 ± 0.316
1.695MetLeu: 1.695 ± 0.358
1.036MetMet: 1.036 ± 0.351
1.224MetAsn: 1.224 ± 0.324
1.036MetPro: 1.036 ± 0.304
1.506MetGln: 1.506 ± 0.359
2.542MetArg: 2.542 ± 0.471
1.789MetSer: 1.789 ± 0.347
1.789MetThr: 1.789 ± 0.325
1.13MetVal: 1.13 ± 0.352
0.188MetTrp: 0.188 ± 0.117
0.377MetTyr: 0.377 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
4.614AsnAla: 4.614 ± 0.747
0.659AsnCys: 0.659 ± 0.27
2.166AsnAsp: 2.166 ± 0.444
2.166AsnGlu: 2.166 ± 0.481
0.847AsnPhe: 0.847 ± 0.212
3.39AsnGly: 3.39 ± 0.605
0.377AsnHis: 0.377 ± 0.198
1.318AsnIle: 1.318 ± 0.316
0.753AsnLys: 0.753 ± 0.239
1.977AsnLeu: 1.977 ± 0.497
0.942AsnMet: 0.942 ± 0.268
0.942AsnAsn: 0.942 ± 0.259
1.412AsnPro: 1.412 ± 0.461
1.601AsnGln: 1.601 ± 0.428
2.448AsnArg: 2.448 ± 0.518
2.448AsnSer: 2.448 ± 0.602
1.318AsnThr: 1.318 ± 0.31
1.601AsnVal: 1.601 ± 0.441
0.659AsnTrp: 0.659 ± 0.238
0.659AsnTyr: 0.659 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
5.932ProAla: 5.932 ± 0.923
0.847ProCys: 0.847 ± 0.291
2.825ProAsp: 2.825 ± 0.377
2.919ProGlu: 2.919 ± 0.477
1.318ProPhe: 1.318 ± 0.319
3.954ProGly: 3.954 ± 0.674
0.377ProHis: 0.377 ± 0.187
1.977ProIle: 1.977 ± 0.372
1.601ProLys: 1.601 ± 0.326
3.86ProLeu: 3.86 ± 0.585
0.847ProMet: 0.847 ± 0.258
1.506ProAsn: 1.506 ± 0.385
2.542ProPro: 2.542 ± 0.733
1.977ProGln: 1.977 ± 0.395
1.789ProArg: 1.789 ± 0.382
3.201ProSer: 3.201 ± 0.438
2.071ProThr: 2.071 ± 0.526
2.636ProVal: 2.636 ± 0.541
0.282ProTrp: 0.282 ± 0.154
0.847ProTyr: 0.847 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
6.214GlnAla: 6.214 ± 0.686
0.188GlnCys: 0.188 ± 0.145
1.412GlnAsp: 1.412 ± 0.55
2.071GlnGlu: 2.071 ± 0.479
1.318GlnPhe: 1.318 ± 0.33
3.484GlnGly: 3.484 ± 0.518
1.13GlnHis: 1.13 ± 0.282
2.354GlnIle: 2.354 ± 0.319
2.354GlnLys: 2.354 ± 0.461
2.636GlnLeu: 2.636 ± 0.515
1.318GlnMet: 1.318 ± 0.418
1.789GlnAsn: 1.789 ± 0.313
2.071GlnPro: 2.071 ± 0.418
3.484GlnGln: 3.484 ± 0.635
3.295GlnArg: 3.295 ± 0.483
2.919GlnSer: 2.919 ± 0.483
2.354GlnThr: 2.354 ± 0.424
1.506GlnVal: 1.506 ± 0.371
1.13GlnTrp: 1.13 ± 0.247
0.942GlnTyr: 0.942 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
8.945ArgAla: 8.945 ± 1.267
1.318ArgCys: 1.318 ± 0.387
4.143ArgAsp: 4.143 ± 0.69
5.273ArgGlu: 5.273 ± 0.992
2.636ArgPhe: 2.636 ± 0.472
4.237ArgGly: 4.237 ± 0.711
0.942ArgHis: 0.942 ± 0.296
4.049ArgIle: 4.049 ± 0.485
4.614ArgLys: 4.614 ± 0.736
5.743ArgLeu: 5.743 ± 0.65
2.071ArgMet: 2.071 ± 0.51
1.224ArgAsn: 1.224 ± 0.345
1.977ArgPro: 1.977 ± 0.458
3.013ArgGln: 3.013 ± 0.582
6.12ArgArg: 6.12 ± 0.785
4.049ArgSer: 4.049 ± 0.524
3.295ArgThr: 3.295 ± 0.597
4.425ArgVal: 4.425 ± 0.494
1.13ArgTrp: 1.13 ± 0.3
1.412ArgTyr: 1.412 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
7.815SerAla: 7.815 ± 0.763
0.471SerCys: 0.471 ± 0.191
3.672SerAsp: 3.672 ± 0.606
3.672SerGlu: 3.672 ± 0.681
2.354SerPhe: 2.354 ± 0.385
6.779SerGly: 6.779 ± 1.002
0.847SerHis: 0.847 ± 0.227
2.26SerIle: 2.26 ± 0.497
2.166SerLys: 2.166 ± 0.453
4.802SerLeu: 4.802 ± 0.838
0.847SerMet: 0.847 ± 0.283
1.601SerAsn: 1.601 ± 0.423
3.484SerPro: 3.484 ± 0.771
2.354SerGln: 2.354 ± 0.446
4.425SerArg: 4.425 ± 0.612
4.425SerSer: 4.425 ± 0.735
4.237SerThr: 4.237 ± 0.698
4.708SerVal: 4.708 ± 0.649
0.942SerTrp: 0.942 ± 0.313
1.224SerTyr: 1.224 ± 0.298
0.0SerXaa: 0.0 ± 0.0
Thr
6.685ThrAla: 6.685 ± 1.045
0.282ThrCys: 0.282 ± 0.158
2.26ThrAsp: 2.26 ± 0.438
3.766ThrGlu: 3.766 ± 0.538
1.318ThrPhe: 1.318 ± 0.294
6.591ThrGly: 6.591 ± 0.99
1.13ThrHis: 1.13 ± 0.314
3.013ThrIle: 3.013 ± 0.615
2.071ThrLys: 2.071 ± 0.537
4.237ThrLeu: 4.237 ± 0.868
0.565ThrMet: 0.565 ± 0.257
1.977ThrAsn: 1.977 ± 0.434
3.013ThrPro: 3.013 ± 0.557
1.977ThrGln: 1.977 ± 0.423
2.919ThrArg: 2.919 ± 0.618
3.672ThrSer: 3.672 ± 0.7
3.39ThrThr: 3.39 ± 0.569
4.049ThrVal: 4.049 ± 0.698
1.13ThrTrp: 1.13 ± 0.358
1.224ThrTyr: 1.224 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
7.909ValAla: 7.909 ± 0.761
0.565ValCys: 0.565 ± 0.211
3.201ValAsp: 3.201 ± 0.524
3.766ValGlu: 3.766 ± 0.693
1.412ValPhe: 1.412 ± 0.402
4.425ValGly: 4.425 ± 0.54
1.789ValHis: 1.789 ± 0.439
3.107ValIle: 3.107 ± 0.506
2.354ValLys: 2.354 ± 0.542
4.049ValLeu: 4.049 ± 0.67
1.883ValMet: 1.883 ± 0.39
2.166ValAsn: 2.166 ± 0.47
3.672ValPro: 3.672 ± 0.667
1.883ValGln: 1.883 ± 0.332
3.295ValArg: 3.295 ± 0.717
3.672ValSer: 3.672 ± 0.456
4.99ValThr: 4.99 ± 0.992
3.766ValVal: 3.766 ± 0.639
0.659ValTrp: 0.659 ± 0.224
1.318ValTyr: 1.318 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
2.354TrpAla: 2.354 ± 0.442
0.565TrpCys: 0.565 ± 0.236
0.753TrpAsp: 0.753 ± 0.267
0.282TrpGlu: 0.282 ± 0.162
0.094TrpPhe: 0.094 ± 0.073
0.753TrpGly: 0.753 ± 0.208
0.094TrpHis: 0.094 ± 0.086
0.847TrpIle: 0.847 ± 0.31
1.412TrpLys: 1.412 ± 0.386
1.036TrpLeu: 1.036 ± 0.328
0.471TrpMet: 0.471 ± 0.175
1.318TrpAsn: 1.318 ± 0.436
0.753TrpPro: 0.753 ± 0.318
0.753TrpGln: 0.753 ± 0.268
0.753TrpArg: 0.753 ± 0.287
1.13TrpSer: 1.13 ± 0.422
0.847TrpThr: 0.847 ± 0.278
0.659TrpVal: 0.659 ± 0.235
0.282TrpTrp: 0.282 ± 0.154
0.565TrpTyr: 0.565 ± 0.214
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.883TyrAla: 1.883 ± 0.407
0.377TyrCys: 0.377 ± 0.175
1.789TyrAsp: 1.789 ± 0.57
1.789TyrGlu: 1.789 ± 0.366
0.659TyrPhe: 0.659 ± 0.23
2.354TyrGly: 2.354 ± 0.491
0.282TyrHis: 0.282 ± 0.159
0.282TyrIle: 0.282 ± 0.147
1.318TyrLys: 1.318 ± 0.278
2.636TyrLeu: 2.636 ± 0.632
0.659TyrMet: 0.659 ± 0.274
0.942TyrAsn: 0.942 ± 0.336
1.506TyrPro: 1.506 ± 0.487
1.036TyrGln: 1.036 ± 0.309
1.601TyrArg: 1.601 ± 0.366
0.847TyrSer: 0.847 ± 0.294
1.036TyrThr: 1.036 ± 0.431
1.036TyrVal: 1.036 ± 0.323
0.753TyrTrp: 0.753 ± 0.225
1.036TyrTyr: 1.036 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10622 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski