Amino acid dipepetide frequency for Clostridium phage Clo-PEP-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.358AlaAla: 3.358 ± 0.613
0.517AlaCys: 0.517 ± 0.219
2.841AlaAsp: 2.841 ± 0.465
3.681AlaGlu: 3.681 ± 0.387
2.131AlaPhe: 2.131 ± 0.48
2.647AlaGly: 2.647 ± 0.528
0.904AlaHis: 0.904 ± 0.265
3.616AlaIle: 3.616 ± 0.519
6.199AlaLys: 6.199 ± 0.836
4.584AlaLeu: 4.584 ± 0.606
1.679AlaMet: 1.679 ± 0.291
2.712AlaAsn: 2.712 ± 0.359
1.421AlaPro: 1.421 ± 0.358
1.937AlaGln: 1.937 ± 0.317
2.647AlaArg: 2.647 ± 0.403
2.454AlaSer: 2.454 ± 0.508
3.035AlaThr: 3.035 ± 0.513
4.068AlaVal: 4.068 ± 0.518
0.452AlaTrp: 0.452 ± 0.163
2.002AlaTyr: 2.002 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
0.452CysAla: 0.452 ± 0.182
0.194CysCys: 0.194 ± 0.118
0.581CysAsp: 0.581 ± 0.165
1.356CysGlu: 1.356 ± 0.272
0.323CysPhe: 0.323 ± 0.137
0.839CysGly: 0.839 ± 0.229
0.387CysHis: 0.387 ± 0.166
0.581CysIle: 0.581 ± 0.189
1.485CysLys: 1.485 ± 0.383
1.227CysLeu: 1.227 ± 0.326
0.194CysMet: 0.194 ± 0.112
1.356CysAsn: 1.356 ± 0.287
0.71CysPro: 0.71 ± 0.293
0.581CysGln: 0.581 ± 0.201
0.129CysArg: 0.129 ± 0.086
0.71CysSer: 0.71 ± 0.191
0.387CysThr: 0.387 ± 0.138
0.646CysVal: 0.646 ± 0.203
0.129CysTrp: 0.129 ± 0.087
0.969CysTyr: 0.969 ± 0.24
0.0CysXaa: 0.0 ± 0.0
Asp
1.356AspAla: 1.356 ± 0.286
0.646AspCys: 0.646 ± 0.19
4.132AspAsp: 4.132 ± 0.717
6.07AspGlu: 6.07 ± 1.006
3.099AspPhe: 3.099 ± 0.528
3.551AspGly: 3.551 ± 0.572
0.646AspHis: 0.646 ± 0.246
6.974AspIle: 6.974 ± 0.721
5.618AspLys: 5.618 ± 0.672
6.974AspLeu: 6.974 ± 0.843
0.904AspMet: 0.904 ± 0.197
3.616AspAsn: 3.616 ± 0.512
2.97AspPro: 2.97 ± 0.562
1.033AspGln: 1.033 ± 0.224
2.325AspArg: 2.325 ± 0.429
3.422AspSer: 3.422 ± 0.477
3.229AspThr: 3.229 ± 0.497
3.358AspVal: 3.358 ± 0.383
1.033AspTrp: 1.033 ± 0.224
3.745AspTyr: 3.745 ± 0.501
0.0AspXaa: 0.0 ± 0.0
Glu
5.618GluAla: 5.618 ± 0.721
0.839GluCys: 0.839 ± 0.289
6.586GluAsp: 6.586 ± 0.828
9.427GluGlu: 9.427 ± 1.559
2.647GluPhe: 2.647 ± 0.408
5.682GluGly: 5.682 ± 0.622
0.969GluHis: 0.969 ± 0.237
6.005GluIle: 6.005 ± 0.724
6.328GluLys: 6.328 ± 0.679
8.136GluLeu: 8.136 ± 0.82
2.647GluMet: 2.647 ± 0.378
3.487GluAsn: 3.487 ± 0.436
2.325GluPro: 2.325 ± 0.489
1.614GluGln: 1.614 ± 0.292
2.325GluArg: 2.325 ± 0.48
3.551GluSer: 3.551 ± 0.362
3.099GluThr: 3.099 ± 0.445
5.553GluVal: 5.553 ± 0.579
0.904GluTrp: 0.904 ± 0.215
2.777GluTyr: 2.777 ± 0.541
0.0GluXaa: 0.0 ± 0.0
Phe
1.421PheAla: 1.421 ± 0.218
0.258PheCys: 0.258 ± 0.12
2.195PheAsp: 2.195 ± 0.361
2.647PheGlu: 2.647 ± 0.47
0.969PhePhe: 0.969 ± 0.217
2.26PheGly: 2.26 ± 0.463
0.71PheHis: 0.71 ± 0.223
3.358PheIle: 3.358 ± 0.458
3.551PheLys: 3.551 ± 0.551
2.454PheLeu: 2.454 ± 0.367
0.775PheMet: 0.775 ± 0.25
3.099PheAsn: 3.099 ± 0.457
0.969PhePro: 0.969 ± 0.293
1.291PheGln: 1.291 ± 0.363
1.356PheArg: 1.356 ± 0.235
2.97PheSer: 2.97 ± 0.49
3.035PheThr: 3.035 ± 0.586
1.873PheVal: 1.873 ± 0.3
0.517PheTrp: 0.517 ± 0.181
1.162PheTyr: 1.162 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
2.325GlyAla: 2.325 ± 0.555
0.969GlyCys: 0.969 ± 0.293
4.068GlyAsp: 4.068 ± 0.457
4.197GlyGlu: 4.197 ± 0.513
2.26GlyPhe: 2.26 ± 0.365
4.778GlyGly: 4.778 ± 0.696
0.969GlyHis: 0.969 ± 0.242
4.843GlyIle: 4.843 ± 0.495
7.361GlyLys: 7.361 ± 0.567
5.618GlyLeu: 5.618 ± 0.65
1.679GlyMet: 1.679 ± 0.325
2.777GlyAsn: 2.777 ± 0.458
1.485GlyPro: 1.485 ± 0.327
2.066GlyGln: 2.066 ± 0.349
1.873GlyArg: 1.873 ± 0.385
4.197GlySer: 4.197 ± 0.486
3.939GlyThr: 3.939 ± 0.507
5.036GlyVal: 5.036 ± 0.625
0.969GlyTrp: 0.969 ± 0.239
3.229GlyTyr: 3.229 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
0.387HisAla: 0.387 ± 0.185
0.387HisCys: 0.387 ± 0.179
0.969HisAsp: 0.969 ± 0.298
1.033HisGlu: 1.033 ± 0.245
0.71HisPhe: 0.71 ± 0.165
0.71HisGly: 0.71 ± 0.199
0.646HisHis: 0.646 ± 0.184
1.033HisIle: 1.033 ± 0.235
1.291HisLys: 1.291 ± 0.279
1.679HisLeu: 1.679 ± 0.322
0.129HisMet: 0.129 ± 0.086
1.098HisAsn: 1.098 ± 0.279
0.839HisPro: 0.839 ± 0.244
0.517HisGln: 0.517 ± 0.165
0.646HisArg: 0.646 ± 0.208
1.162HisSer: 1.162 ± 0.299
1.098HisThr: 1.098 ± 0.297
0.517HisVal: 0.517 ± 0.162
0.194HisTrp: 0.194 ± 0.107
0.969HisTyr: 0.969 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
3.551IleAla: 3.551 ± 0.417
1.227IleCys: 1.227 ± 0.3
5.23IleAsp: 5.23 ± 0.663
6.909IleGlu: 6.909 ± 0.683
2.26IlePhe: 2.26 ± 0.425
3.681IleGly: 3.681 ± 0.611
1.162IleHis: 1.162 ± 0.248
5.101IleIle: 5.101 ± 0.645
6.78IleLys: 6.78 ± 0.717
6.07IleLeu: 6.07 ± 0.618
1.743IleMet: 1.743 ± 0.365
5.553IleAsn: 5.553 ± 0.551
3.358IlePro: 3.358 ± 0.532
2.97IleGln: 2.97 ± 0.401
2.131IleArg: 2.131 ± 0.395
4.391IleSer: 4.391 ± 0.475
4.52IleThr: 4.52 ± 0.704
4.262IleVal: 4.262 ± 0.624
0.969IleTrp: 0.969 ± 0.239
3.099IleTyr: 3.099 ± 0.441
0.0IleXaa: 0.0 ± 0.0
Lys
8.071LysAla: 8.071 ± 0.899
1.033LysCys: 1.033 ± 0.253
5.94LysAsp: 5.94 ± 0.709
8.459LysGlu: 8.459 ± 1.093
3.229LysPhe: 3.229 ± 0.507
5.553LysGly: 5.553 ± 0.506
1.937LysHis: 1.937 ± 0.367
6.328LysIle: 6.328 ± 0.674
9.556LysLys: 9.556 ± 1.266
8.265LysLeu: 8.265 ± 0.732
2.777LysMet: 2.777 ± 0.412
5.101LysAsn: 5.101 ± 0.598
3.681LysPro: 3.681 ± 0.541
3.422LysGln: 3.422 ± 0.498
3.358LysArg: 3.358 ± 0.474
4.907LysSer: 4.907 ± 0.685
4.132LysThr: 4.132 ± 0.552
6.07LysVal: 6.07 ± 0.613
1.033LysTrp: 1.033 ± 0.307
4.262LysTyr: 4.262 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
4.972LeuAla: 4.972 ± 0.773
1.485LeuCys: 1.485 ± 0.387
5.101LeuAsp: 5.101 ± 0.498
7.103LeuGlu: 7.103 ± 0.885
2.389LeuPhe: 2.389 ± 0.474
5.618LeuGly: 5.618 ± 0.746
1.485LeuHis: 1.485 ± 0.287
6.134LeuIle: 6.134 ± 0.78
8.975LeuLys: 8.975 ± 0.934
6.134LeuLeu: 6.134 ± 0.63
1.808LeuMet: 1.808 ± 0.381
4.972LeuAsn: 4.972 ± 0.493
2.583LeuPro: 2.583 ± 0.477
3.099LeuGln: 3.099 ± 0.418
3.939LeuArg: 3.939 ± 0.564
5.036LeuSer: 5.036 ± 0.648
4.778LeuThr: 4.778 ± 0.552
4.52LeuVal: 4.52 ± 0.459
0.969LeuTrp: 0.969 ± 0.222
3.551LeuTyr: 3.551 ± 0.62
0.0LeuXaa: 0.0 ± 0.0
Met
1.614MetAla: 1.614 ± 0.293
0.258MetCys: 0.258 ± 0.137
1.614MetAsp: 1.614 ± 0.322
1.743MetGlu: 1.743 ± 0.374
0.969MetPhe: 0.969 ± 0.329
1.614MetGly: 1.614 ± 0.302
0.194MetHis: 0.194 ± 0.114
1.743MetIle: 1.743 ± 0.336
2.518MetLys: 2.518 ± 0.43
2.26MetLeu: 2.26 ± 0.481
0.323MetMet: 0.323 ± 0.148
1.679MetAsn: 1.679 ± 0.356
1.162MetPro: 1.162 ± 0.274
0.646MetGln: 0.646 ± 0.218
0.904MetArg: 0.904 ± 0.197
1.227MetSer: 1.227 ± 0.284
1.162MetThr: 1.162 ± 0.271
0.969MetVal: 0.969 ± 0.281
0.129MetTrp: 0.129 ± 0.101
0.71MetTyr: 0.71 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
2.518AsnAla: 2.518 ± 0.381
0.839AsnCys: 0.839 ± 0.253
2.712AsnAsp: 2.712 ± 0.462
3.616AsnGlu: 3.616 ± 0.488
2.195AsnPhe: 2.195 ± 0.393
3.874AsnGly: 3.874 ± 0.602
1.033AsnHis: 1.033 ± 0.361
4.778AsnIle: 4.778 ± 0.485
6.263AsnLys: 6.263 ± 0.763
4.649AsnLeu: 4.649 ± 0.5
1.162AsnMet: 1.162 ± 0.288
4.326AsnAsn: 4.326 ± 0.636
2.389AsnPro: 2.389 ± 0.503
2.131AsnGln: 2.131 ± 0.338
2.777AsnArg: 2.777 ± 0.472
4.907AsnSer: 4.907 ± 0.707
2.583AsnThr: 2.583 ± 0.372
2.583AsnVal: 2.583 ± 0.395
0.775AsnTrp: 0.775 ± 0.254
2.002AsnTyr: 2.002 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
2.002ProAla: 2.002 ± 0.401
0.581ProCys: 0.581 ± 0.21
2.389ProAsp: 2.389 ± 0.304
3.616ProGlu: 3.616 ± 0.422
0.71ProPhe: 0.71 ± 0.176
2.712ProGly: 2.712 ± 0.444
0.452ProHis: 0.452 ± 0.145
2.777ProIle: 2.777 ± 0.499
3.551ProLys: 3.551 ± 0.477
2.066ProLeu: 2.066 ± 0.406
0.775ProMet: 0.775 ± 0.19
1.808ProAsn: 1.808 ± 0.342
0.581ProPro: 0.581 ± 0.2
1.421ProGln: 1.421 ± 0.301
1.679ProArg: 1.679 ± 0.339
1.614ProSer: 1.614 ± 0.354
2.389ProThr: 2.389 ± 0.556
2.583ProVal: 2.583 ± 0.4
0.71ProTrp: 0.71 ± 0.235
1.356ProTyr: 1.356 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
2.389GlnAla: 2.389 ± 0.458
0.517GlnCys: 0.517 ± 0.174
1.679GlnAsp: 1.679 ± 0.289
2.583GlnGlu: 2.583 ± 0.476
1.098GlnPhe: 1.098 ± 0.256
3.035GlnGly: 3.035 ± 0.489
0.323GlnHis: 0.323 ± 0.121
2.518GlnIle: 2.518 ± 0.364
2.841GlnLys: 2.841 ± 0.508
3.099GlnLeu: 3.099 ± 0.608
0.839GlnMet: 0.839 ± 0.229
1.614GlnAsn: 1.614 ± 0.251
1.098GlnPro: 1.098 ± 0.229
1.227GlnGln: 1.227 ± 0.271
1.162GlnArg: 1.162 ± 0.299
1.55GlnSer: 1.55 ± 0.293
1.55GlnThr: 1.55 ± 0.325
2.583GlnVal: 2.583 ± 0.438
0.581GlnTrp: 0.581 ± 0.182
1.356GlnTyr: 1.356 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
2.712ArgAla: 2.712 ± 0.507
0.323ArgCys: 0.323 ± 0.126
2.647ArgAsp: 2.647 ± 0.338
3.229ArgGlu: 3.229 ± 0.61
2.002ArgPhe: 2.002 ± 0.276
2.325ArgGly: 2.325 ± 0.395
0.258ArgHis: 0.258 ± 0.134
2.647ArgIle: 2.647 ± 0.462
4.262ArgLys: 4.262 ± 0.721
3.551ArgLeu: 3.551 ± 0.635
0.969ArgMet: 0.969 ± 0.222
1.743ArgAsn: 1.743 ± 0.328
1.162ArgPro: 1.162 ± 0.269
0.904ArgGln: 0.904 ± 0.304
1.743ArgArg: 1.743 ± 0.378
2.26ArgSer: 2.26 ± 0.414
1.743ArgThr: 1.743 ± 0.317
2.002ArgVal: 2.002 ± 0.38
0.323ArgTrp: 0.323 ± 0.172
1.485ArgTyr: 1.485 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
2.777SerAla: 2.777 ± 0.677
0.904SerCys: 0.904 ± 0.269
3.293SerAsp: 3.293 ± 0.419
2.906SerGlu: 2.906 ± 0.469
2.97SerPhe: 2.97 ± 0.519
5.101SerGly: 5.101 ± 0.689
1.227SerHis: 1.227 ± 0.282
4.003SerIle: 4.003 ± 0.517
5.488SerLys: 5.488 ± 0.617
4.714SerLeu: 4.714 ± 0.571
0.969SerMet: 0.969 ± 0.262
3.358SerAsn: 3.358 ± 0.588
2.066SerPro: 2.066 ± 0.456
2.712SerGln: 2.712 ± 0.34
2.26SerArg: 2.26 ± 0.452
3.551SerSer: 3.551 ± 0.621
2.841SerThr: 2.841 ± 0.438
3.035SerVal: 3.035 ± 0.379
0.839SerTrp: 0.839 ± 0.223
2.647SerTyr: 2.647 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
2.389ThrAla: 2.389 ± 0.447
0.581ThrCys: 0.581 ± 0.181
3.745ThrAsp: 3.745 ± 0.512
3.164ThrGlu: 3.164 ± 0.538
2.389ThrPhe: 2.389 ± 0.329
4.262ThrGly: 4.262 ± 0.546
0.904ThrHis: 0.904 ± 0.204
4.262ThrIle: 4.262 ± 0.468
4.326ThrLys: 4.326 ± 0.521
3.874ThrLeu: 3.874 ± 0.395
1.421ThrMet: 1.421 ± 0.348
2.26ThrAsn: 2.26 ± 0.346
2.583ThrPro: 2.583 ± 0.514
1.679ThrGln: 1.679 ± 0.316
2.389ThrArg: 2.389 ± 0.404
2.97ThrSer: 2.97 ± 0.544
2.454ThrThr: 2.454 ± 0.418
3.099ThrVal: 3.099 ± 0.545
0.581ThrTrp: 0.581 ± 0.2
2.131ThrTyr: 2.131 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
2.777ValAla: 2.777 ± 0.381
0.839ValCys: 0.839 ± 0.295
4.649ValAsp: 4.649 ± 0.622
4.843ValGlu: 4.843 ± 0.631
2.777ValPhe: 2.777 ± 0.394
3.164ValGly: 3.164 ± 0.451
0.646ValHis: 0.646 ± 0.2
4.649ValIle: 4.649 ± 0.599
6.328ValLys: 6.328 ± 0.689
4.778ValLeu: 4.778 ± 0.66
1.421ValMet: 1.421 ± 0.299
3.487ValAsn: 3.487 ± 0.658
2.454ValPro: 2.454 ± 0.409
2.26ValGln: 2.26 ± 0.365
2.647ValArg: 2.647 ± 0.418
3.035ValSer: 3.035 ± 0.434
2.518ValThr: 2.518 ± 0.37
4.391ValVal: 4.391 ± 0.588
0.775ValTrp: 0.775 ± 0.248
2.454ValTyr: 2.454 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.248
0.258TrpCys: 0.258 ± 0.127
0.969TrpAsp: 0.969 ± 0.236
1.356TrpGlu: 1.356 ± 0.349
0.323TrpPhe: 0.323 ± 0.121
0.775TrpGly: 0.775 ± 0.188
0.194TrpHis: 0.194 ± 0.104
0.517TrpIle: 0.517 ± 0.179
1.033TrpLys: 1.033 ± 0.254
1.098TrpLeu: 1.098 ± 0.222
0.065TrpMet: 0.065 ± 0.064
1.033TrpAsn: 1.033 ± 0.26
0.258TrpPro: 0.258 ± 0.123
0.517TrpGln: 0.517 ± 0.168
0.387TrpArg: 0.387 ± 0.188
1.033TrpSer: 1.033 ± 0.277
0.258TrpThr: 0.258 ± 0.117
1.098TrpVal: 1.098 ± 0.225
0.194TrpTrp: 0.194 ± 0.098
0.517TrpTyr: 0.517 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.227TyrAla: 1.227 ± 0.287
0.581TyrCys: 0.581 ± 0.184
3.358TyrAsp: 3.358 ± 0.493
2.647TyrGlu: 2.647 ± 0.436
1.421TyrPhe: 1.421 ± 0.429
2.454TyrGly: 2.454 ± 0.456
0.969TyrHis: 0.969 ± 0.261
3.099TyrIle: 3.099 ± 0.35
3.358TyrLys: 3.358 ± 0.611
3.422TyrLeu: 3.422 ± 0.459
1.098TyrMet: 1.098 ± 0.294
3.035TyrAsn: 3.035 ± 0.442
1.679TyrPro: 1.679 ± 0.313
1.614TyrGln: 1.614 ± 0.274
1.614TyrArg: 1.614 ± 0.284
2.712TyrSer: 2.712 ± 0.414
2.712TyrThr: 2.712 ± 0.424
2.647TyrVal: 2.647 ± 0.495
0.775TyrTrp: 0.775 ± 0.262
1.873TyrTyr: 1.873 ± 0.35
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (15488 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski