Amino acid dipepetide frequency for Podoviridae sp. ctdb7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.739AlaAla: 16.739 ± 1.572
1.428AlaCys: 1.428 ± 0.437
7.299AlaAsp: 7.299 ± 0.844
8.727AlaGlu: 8.727 ± 1.104
3.332AlaPhe: 3.332 ± 0.442
8.171AlaGly: 8.171 ± 0.721
2.142AlaHis: 2.142 ± 0.454
4.998AlaIle: 4.998 ± 0.648
6.109AlaLys: 6.109 ± 0.818
10.71AlaLeu: 10.71 ± 1.326
3.173AlaMet: 3.173 ± 0.573
4.205AlaAsn: 4.205 ± 0.668
4.205AlaPro: 4.205 ± 0.644
6.823AlaGln: 6.823 ± 1.018
7.219AlaArg: 7.219 ± 1.116
6.505AlaSer: 6.505 ± 1.02
6.505AlaThr: 6.505 ± 0.952
5.474AlaVal: 5.474 ± 0.677
1.825AlaTrp: 1.825 ± 0.45
2.142AlaTyr: 2.142 ± 0.445
0.0AlaXaa: 0.0 ± 0.0
Cys
1.349CysAla: 1.349 ± 0.319
0.079CysCys: 0.079 ± 0.096
0.555CysAsp: 0.555 ± 0.205
0.397CysGlu: 0.397 ± 0.167
0.397CysPhe: 0.397 ± 0.164
1.349CysGly: 1.349 ± 0.425
0.317CysHis: 0.317 ± 0.153
0.238CysIle: 0.238 ± 0.129
0.317CysLys: 0.317 ± 0.199
0.635CysLeu: 0.635 ± 0.277
0.317CysMet: 0.317 ± 0.156
0.317CysAsn: 0.317 ± 0.151
0.714CysPro: 0.714 ± 0.23
0.635CysGln: 0.635 ± 0.257
1.031CysArg: 1.031 ± 0.313
1.031CysSer: 1.031 ± 0.335
0.476CysThr: 0.476 ± 0.184
0.714CysVal: 0.714 ± 0.242
0.476CysTrp: 0.476 ± 0.193
0.238CysTyr: 0.238 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
5.95AspAla: 5.95 ± 0.614
0.873AspCys: 0.873 ± 0.258
3.808AspAsp: 3.808 ± 0.648
4.681AspGlu: 4.681 ± 0.649
1.507AspPhe: 1.507 ± 0.389
5.236AspGly: 5.236 ± 0.694
1.349AspHis: 1.349 ± 0.368
2.539AspIle: 2.539 ± 0.406
1.983AspLys: 1.983 ± 0.4
6.743AspLeu: 6.743 ± 0.761
1.904AspMet: 1.904 ± 0.365
2.777AspAsn: 2.777 ± 0.462
2.697AspPro: 2.697 ± 0.385
2.618AspGln: 2.618 ± 0.686
3.411AspArg: 3.411 ± 0.496
2.777AspSer: 2.777 ± 0.517
2.697AspThr: 2.697 ± 0.494
3.253AspVal: 3.253 ± 0.541
1.269AspTrp: 1.269 ± 0.3
1.507AspTyr: 1.507 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
7.616GluAla: 7.616 ± 1.093
0.635GluCys: 0.635 ± 0.216
3.253GluAsp: 3.253 ± 0.477
2.856GluGlu: 2.856 ± 0.476
1.349GluPhe: 1.349 ± 0.275
4.205GluGly: 4.205 ± 0.567
1.745GluHis: 1.745 ± 0.318
3.808GluIle: 3.808 ± 0.587
3.332GluLys: 3.332 ± 0.554
5.474GluLeu: 5.474 ± 0.755
1.983GluMet: 1.983 ± 0.343
1.904GluAsn: 1.904 ± 0.334
2.221GluPro: 2.221 ± 0.432
4.125GluGln: 4.125 ± 0.657
6.029GluArg: 6.029 ± 1.043
3.332GluSer: 3.332 ± 0.45
2.221GluThr: 2.221 ± 0.388
3.411GluVal: 3.411 ± 0.531
1.031GluTrp: 1.031 ± 0.299
2.221GluTyr: 2.221 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
3.729PheAla: 3.729 ± 0.474
0.159PheCys: 0.159 ± 0.101
1.825PheAsp: 1.825 ± 0.391
2.142PheGlu: 2.142 ± 0.358
1.269PhePhe: 1.269 ± 0.335
3.332PheGly: 3.332 ± 0.5
0.714PheHis: 0.714 ± 0.2
1.19PheIle: 1.19 ± 0.288
1.111PheLys: 1.111 ± 0.309
1.428PheLeu: 1.428 ± 0.469
0.873PheMet: 0.873 ± 0.241
1.19PheAsn: 1.19 ± 0.366
1.269PhePro: 1.269 ± 0.404
1.349PheGln: 1.349 ± 0.363
1.745PheArg: 1.745 ± 0.435
2.142PheSer: 2.142 ± 0.356
1.587PheThr: 1.587 ± 0.446
1.904PheVal: 1.904 ± 0.339
0.397PheTrp: 0.397 ± 0.198
0.873PheTyr: 0.873 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
6.902GlyAla: 6.902 ± 0.699
0.793GlyCys: 0.793 ± 0.257
5.157GlyAsp: 5.157 ± 0.857
6.029GlyGlu: 6.029 ± 0.712
3.411GlyPhe: 3.411 ± 0.601
7.378GlyGly: 7.378 ± 0.914
1.587GlyHis: 1.587 ± 0.37
3.57GlyIle: 3.57 ± 0.528
3.808GlyLys: 3.808 ± 0.617
6.981GlyLeu: 6.981 ± 0.702
1.983GlyMet: 1.983 ± 0.44
2.856GlyAsn: 2.856 ± 0.524
1.587GlyPro: 1.587 ± 0.333
4.046GlyGln: 4.046 ± 0.503
6.664GlyArg: 6.664 ± 0.788
4.125GlySer: 4.125 ± 0.583
3.808GlyThr: 3.808 ± 0.5
5.712GlyVal: 5.712 ± 0.642
1.587GlyTrp: 1.587 ± 0.35
2.301GlyTyr: 2.301 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
1.825HisAla: 1.825 ± 0.522
0.476HisCys: 0.476 ± 0.192
0.952HisAsp: 0.952 ± 0.26
1.19HisGlu: 1.19 ± 0.327
0.555HisPhe: 0.555 ± 0.17
1.428HisGly: 1.428 ± 0.314
0.397HisHis: 0.397 ± 0.176
0.873HisIle: 0.873 ± 0.277
0.793HisLys: 0.793 ± 0.254
2.142HisLeu: 2.142 ± 0.526
0.317HisMet: 0.317 ± 0.15
1.111HisAsn: 1.111 ± 0.286
1.349HisPro: 1.349 ± 0.455
0.397HisGln: 0.397 ± 0.222
1.666HisArg: 1.666 ± 0.372
1.111HisSer: 1.111 ± 0.312
0.793HisThr: 0.793 ± 0.264
0.714HisVal: 0.714 ± 0.268
0.159HisTrp: 0.159 ± 0.112
0.714HisTyr: 0.714 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
5.236IleAla: 5.236 ± 0.704
0.476IleCys: 0.476 ± 0.205
3.411IleAsp: 3.411 ± 0.468
3.173IleGlu: 3.173 ± 0.482
1.031IlePhe: 1.031 ± 0.372
3.808IleGly: 3.808 ± 0.519
0.397IleHis: 0.397 ± 0.183
1.19IleIle: 1.19 ± 0.24
2.459IleLys: 2.459 ± 0.367
2.618IleLeu: 2.618 ± 0.454
1.269IleMet: 1.269 ± 0.256
2.221IleAsn: 2.221 ± 0.396
2.221IlePro: 2.221 ± 0.392
1.745IleGln: 1.745 ± 0.416
2.935IleArg: 2.935 ± 0.442
2.618IleSer: 2.618 ± 0.359
3.411IleThr: 3.411 ± 0.601
2.142IleVal: 2.142 ± 0.486
0.793IleTrp: 0.793 ± 0.248
1.031IleTyr: 1.031 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
4.919LysAla: 4.919 ± 0.765
0.635LysCys: 0.635 ± 0.213
2.618LysAsp: 2.618 ± 0.411
2.301LysGlu: 2.301 ± 0.469
0.952LysPhe: 0.952 ± 0.273
2.697LysGly: 2.697 ± 0.519
1.031LysHis: 1.031 ± 0.263
2.221LysIle: 2.221 ± 0.442
1.031LysLys: 1.031 ± 0.306
4.522LysLeu: 4.522 ± 0.66
1.111LysMet: 1.111 ± 0.314
1.507LysAsn: 1.507 ± 0.408
1.825LysPro: 1.825 ± 0.305
1.349LysGln: 1.349 ± 0.411
2.935LysArg: 2.935 ± 0.472
2.063LysSer: 2.063 ± 0.325
2.697LysThr: 2.697 ± 0.343
3.094LysVal: 3.094 ± 0.432
0.714LysTrp: 0.714 ± 0.235
0.952LysTyr: 0.952 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
10.234LeuAla: 10.234 ± 1.112
0.793LeuCys: 0.793 ± 0.262
4.601LeuAsp: 4.601 ± 0.723
4.363LeuGlu: 4.363 ± 0.739
2.221LeuPhe: 2.221 ± 0.387
7.061LeuGly: 7.061 ± 0.905
1.19LeuHis: 1.19 ± 0.287
3.649LeuIle: 3.649 ± 0.419
4.205LeuLys: 4.205 ± 0.585
8.647LeuLeu: 8.647 ± 0.954
1.983LeuMet: 1.983 ± 0.454
2.539LeuAsn: 2.539 ± 0.458
4.76LeuPro: 4.76 ± 0.582
2.935LeuGln: 2.935 ± 0.471
6.981LeuArg: 6.981 ± 0.959
5.553LeuSer: 5.553 ± 0.857
4.839LeuThr: 4.839 ± 1.08
4.998LeuVal: 4.998 ± 0.546
1.19LeuTrp: 1.19 ± 0.33
1.745LeuTyr: 1.745 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
3.887MetAla: 3.887 ± 0.498
0.159MetCys: 0.159 ± 0.131
1.19MetAsp: 1.19 ± 0.305
0.952MetGlu: 0.952 ± 0.269
0.397MetPhe: 0.397 ± 0.231
1.666MetGly: 1.666 ± 0.345
0.317MetHis: 0.317 ± 0.151
1.19MetIle: 1.19 ± 0.282
0.873MetLys: 0.873 ± 0.264
2.459MetLeu: 2.459 ± 0.465
0.793MetMet: 0.793 ± 0.267
0.952MetAsn: 0.952 ± 0.288
1.269MetPro: 1.269 ± 0.3
1.269MetGln: 1.269 ± 0.261
2.221MetArg: 2.221 ± 0.475
2.142MetSer: 2.142 ± 0.302
1.825MetThr: 1.825 ± 0.47
1.507MetVal: 1.507 ± 0.382
0.317MetTrp: 0.317 ± 0.143
0.714MetTyr: 0.714 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
4.522AsnAla: 4.522 ± 0.62
0.476AsnCys: 0.476 ± 0.165
1.666AsnAsp: 1.666 ± 0.421
1.428AsnGlu: 1.428 ± 0.337
1.349AsnPhe: 1.349 ± 0.284
3.015AsnGly: 3.015 ± 0.414
0.476AsnHis: 0.476 ± 0.199
1.745AsnIle: 1.745 ± 0.382
1.428AsnLys: 1.428 ± 0.414
2.063AsnLeu: 2.063 ± 0.574
0.476AsnMet: 0.476 ± 0.219
1.507AsnAsn: 1.507 ± 0.386
2.142AsnPro: 2.142 ± 0.381
1.666AsnGln: 1.666 ± 0.328
2.856AsnArg: 2.856 ± 0.381
1.507AsnSer: 1.507 ± 0.512
2.539AsnThr: 2.539 ± 0.517
2.697AsnVal: 2.697 ± 0.522
0.555AsnTrp: 0.555 ± 0.213
1.269AsnTyr: 1.269 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
5.791ProAla: 5.791 ± 0.778
0.714ProCys: 0.714 ± 0.26
3.253ProAsp: 3.253 ± 0.38
3.808ProGlu: 3.808 ± 0.645
1.983ProPhe: 1.983 ± 0.474
3.57ProGly: 3.57 ± 0.569
0.714ProHis: 0.714 ± 0.233
1.745ProIle: 1.745 ± 0.333
1.349ProLys: 1.349 ± 0.395
3.57ProLeu: 3.57 ± 0.555
0.714ProMet: 0.714 ± 0.216
1.349ProAsn: 1.349 ± 0.328
1.349ProPro: 1.349 ± 0.37
1.904ProGln: 1.904 ± 0.524
2.142ProArg: 2.142 ± 0.546
2.935ProSer: 2.935 ± 0.559
3.253ProThr: 3.253 ± 0.54
2.935ProVal: 2.935 ± 0.432
0.635ProTrp: 0.635 ± 0.217
1.031ProTyr: 1.031 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
7.537GlnAla: 7.537 ± 1.085
0.555GlnCys: 0.555 ± 0.264
1.983GlnAsp: 1.983 ± 0.329
3.015GlnGlu: 3.015 ± 0.657
1.031GlnPhe: 1.031 ± 0.306
4.046GlnGly: 4.046 ± 0.589
0.793GlnHis: 0.793 ± 0.28
1.031GlnIle: 1.031 ± 0.378
1.745GlnLys: 1.745 ± 0.381
3.173GlnLeu: 3.173 ± 0.497
1.587GlnMet: 1.587 ± 0.399
1.19GlnAsn: 1.19 ± 0.383
2.697GlnPro: 2.697 ± 0.468
4.76GlnGln: 4.76 ± 1.263
4.681GlnArg: 4.681 ± 0.672
2.697GlnSer: 2.697 ± 0.39
2.459GlnThr: 2.459 ± 0.364
3.411GlnVal: 3.411 ± 0.507
0.635GlnTrp: 0.635 ± 0.217
1.349GlnTyr: 1.349 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
7.695ArgAla: 7.695 ± 1.126
0.873ArgCys: 0.873 ± 0.284
4.919ArgAsp: 4.919 ± 0.697
4.125ArgGlu: 4.125 ± 0.748
2.38ArgPhe: 2.38 ± 0.494
5.236ArgGly: 5.236 ± 0.757
1.269ArgHis: 1.269 ± 0.25
3.729ArgIle: 3.729 ± 0.469
2.856ArgLys: 2.856 ± 0.425
6.505ArgLeu: 6.505 ± 0.97
1.904ArgMet: 1.904 ± 0.308
2.38ArgAsn: 2.38 ± 0.485
2.618ArgPro: 2.618 ± 0.466
3.967ArgGln: 3.967 ± 0.657
6.267ArgArg: 6.267 ± 1.029
3.887ArgSer: 3.887 ± 0.552
3.887ArgThr: 3.887 ± 0.541
4.998ArgVal: 4.998 ± 0.767
1.269ArgTrp: 1.269 ± 0.287
2.618ArgTyr: 2.618 ± 0.476
0.0ArgXaa: 0.0 ± 0.0
Ser
6.505SerAla: 6.505 ± 0.794
0.317SerCys: 0.317 ± 0.173
2.935SerAsp: 2.935 ± 0.56
3.253SerGlu: 3.253 ± 0.431
2.301SerPhe: 2.301 ± 0.478
4.522SerGly: 4.522 ± 0.735
0.793SerHis: 0.793 ± 0.209
2.618SerIle: 2.618 ± 0.411
1.745SerLys: 1.745 ± 0.419
4.839SerLeu: 4.839 ± 0.526
2.301SerMet: 2.301 ± 0.48
1.825SerAsn: 1.825 ± 0.37
3.411SerPro: 3.411 ± 0.482
2.221SerGln: 2.221 ± 0.407
4.046SerArg: 4.046 ± 0.501
4.205SerSer: 4.205 ± 0.594
3.967SerThr: 3.967 ± 0.797
3.808SerVal: 3.808 ± 0.519
0.952SerTrp: 0.952 ± 0.258
2.459SerTyr: 2.459 ± 0.543
0.0SerXaa: 0.0 ± 0.0
Thr
6.743ThrAla: 6.743 ± 0.767
0.793ThrCys: 0.793 ± 0.235
3.491ThrAsp: 3.491 ± 0.527
3.332ThrGlu: 3.332 ± 0.5
1.507ThrPhe: 1.507 ± 0.347
5.474ThrGly: 5.474 ± 0.927
1.111ThrHis: 1.111 ± 0.369
2.697ThrIle: 2.697 ± 0.779
2.539ThrLys: 2.539 ± 0.418
4.125ThrLeu: 4.125 ± 0.638
0.873ThrMet: 0.873 ± 0.255
2.539ThrAsn: 2.539 ± 0.448
3.411ThrPro: 3.411 ± 0.551
3.094ThrGln: 3.094 ± 0.427
2.697ThrArg: 2.697 ± 0.592
3.094ThrSer: 3.094 ± 0.529
3.332ThrThr: 3.332 ± 0.725
3.094ThrVal: 3.094 ± 0.495
0.873ThrTrp: 0.873 ± 0.255
1.587ThrTyr: 1.587 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
6.664ValAla: 6.664 ± 0.739
0.555ValCys: 0.555 ± 0.266
3.967ValAsp: 3.967 ± 0.614
3.967ValGlu: 3.967 ± 0.527
1.904ValPhe: 1.904 ± 0.348
4.601ValGly: 4.601 ± 0.614
1.269ValHis: 1.269 ± 0.38
3.173ValIle: 3.173 ± 0.552
1.825ValLys: 1.825 ± 0.392
4.601ValLeu: 4.601 ± 0.696
1.111ValMet: 1.111 ± 0.251
2.221ValAsn: 2.221 ± 0.317
3.015ValPro: 3.015 ± 0.426
2.935ValGln: 2.935 ± 0.527
4.363ValArg: 4.363 ± 0.543
4.601ValSer: 4.601 ± 0.736
3.649ValThr: 3.649 ± 0.532
3.491ValVal: 3.491 ± 0.648
0.635ValTrp: 0.635 ± 0.203
1.507ValTyr: 1.507 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.952TrpAla: 0.952 ± 0.322
0.397TrpCys: 0.397 ± 0.16
0.714TrpAsp: 0.714 ± 0.229
1.031TrpGlu: 1.031 ± 0.306
0.476TrpPhe: 0.476 ± 0.225
1.428TrpGly: 1.428 ± 0.301
0.714TrpHis: 0.714 ± 0.185
0.635TrpIle: 0.635 ± 0.247
0.714TrpLys: 0.714 ± 0.261
1.111TrpLeu: 1.111 ± 0.344
0.555TrpMet: 0.555 ± 0.209
0.159TrpAsn: 0.159 ± 0.104
0.873TrpPro: 0.873 ± 0.301
1.269TrpGln: 1.269 ± 0.327
0.952TrpArg: 0.952 ± 0.262
1.349TrpSer: 1.349 ± 0.286
0.873TrpThr: 0.873 ± 0.254
1.111TrpVal: 1.111 ± 0.281
0.317TrpTrp: 0.317 ± 0.148
0.397TrpTyr: 0.397 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.935TyrAla: 2.935 ± 0.447
0.397TyrCys: 0.397 ± 0.151
1.983TyrAsp: 1.983 ± 0.393
1.983TyrGlu: 1.983 ± 0.4
0.873TyrPhe: 0.873 ± 0.292
2.063TyrGly: 2.063 ± 0.42
0.635TyrHis: 0.635 ± 0.237
1.349TyrIle: 1.349 ± 0.343
0.952TyrLys: 0.952 ± 0.296
2.142TyrLeu: 2.142 ± 0.467
0.793TyrMet: 0.793 ± 0.27
0.635TyrAsn: 0.635 ± 0.147
1.111TyrPro: 1.111 ± 0.396
1.269TyrGln: 1.269 ± 0.263
2.539TyrArg: 2.539 ± 0.467
1.349TyrSer: 1.349 ± 0.413
1.666TyrThr: 1.666 ± 0.351
1.587TyrVal: 1.587 ± 0.427
0.397TyrTrp: 0.397 ± 0.167
0.873TyrTyr: 0.873 ± 0.293
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski