Amino acid dipepetide frequency for Pectobacterium phage PP16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.742AlaAla: 11.742 ± 0.941
0.996AlaCys: 0.996 ± 0.258
6.761AlaAsp: 6.761 ± 0.804
5.693AlaGlu: 5.693 ± 0.76
3.131AlaPhe: 3.131 ± 0.383
8.042AlaGly: 8.042 ± 0.758
2.633AlaHis: 2.633 ± 0.471
4.128AlaIle: 4.128 ± 0.524
3.345AlaLys: 3.345 ± 0.453
8.682AlaLeu: 8.682 ± 0.997
2.277AlaMet: 2.277 ± 0.483
3.772AlaAsn: 3.772 ± 0.497
3.558AlaPro: 3.558 ± 0.489
4.768AlaGln: 4.768 ± 0.638
4.981AlaArg: 4.981 ± 0.636
4.697AlaSer: 4.697 ± 0.516
6.049AlaThr: 6.049 ± 0.865
7.33AlaVal: 7.33 ± 0.775
0.925AlaTrp: 0.925 ± 0.177
3.416AlaTyr: 3.416 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.498CysAla: 0.498 ± 0.214
0.142CysCys: 0.142 ± 0.1
0.854CysAsp: 0.854 ± 0.286
0.427CysGlu: 0.427 ± 0.169
0.356CysPhe: 0.356 ± 0.151
0.569CysGly: 0.569 ± 0.22
0.498CysHis: 0.498 ± 0.223
0.783CysIle: 0.783 ± 0.291
0.498CysLys: 0.498 ± 0.173
0.712CysLeu: 0.712 ± 0.199
0.64CysMet: 0.64 ± 0.241
0.498CysAsn: 0.498 ± 0.198
0.498CysPro: 0.498 ± 0.171
0.498CysGln: 0.498 ± 0.163
0.285CysArg: 0.285 ± 0.194
0.712CysSer: 0.712 ± 0.285
0.996CysThr: 0.996 ± 0.334
1.21CysVal: 1.21 ± 0.329
0.213CysTrp: 0.213 ± 0.13
0.783CysTyr: 0.783 ± 0.232
0.0CysXaa: 0.0 ± 0.0
Asp
6.689AspAla: 6.689 ± 0.566
0.569AspCys: 0.569 ± 0.242
4.341AspAsp: 4.341 ± 0.61
3.914AspGlu: 3.914 ± 0.495
1.708AspPhe: 1.708 ± 0.274
5.124AspGly: 5.124 ± 0.817
0.64AspHis: 0.64 ± 0.225
2.775AspIle: 2.775 ± 0.378
3.202AspLys: 3.202 ± 0.456
4.626AspLeu: 4.626 ± 0.572
2.206AspMet: 2.206 ± 0.34
2.918AspAsn: 2.918 ± 0.469
1.993AspPro: 1.993 ± 0.331
1.067AspGln: 1.067 ± 0.335
2.918AspArg: 2.918 ± 0.562
4.483AspSer: 4.483 ± 0.574
5.48AspThr: 5.48 ± 0.599
5.551AspVal: 5.551 ± 0.53
0.925AspTrp: 0.925 ± 0.241
2.704AspTyr: 2.704 ± 0.434
0.0AspXaa: 0.0 ± 0.0
Glu
4.27GluAla: 4.27 ± 0.604
0.427GluCys: 0.427 ± 0.219
2.918GluAsp: 2.918 ± 0.358
3.131GluGlu: 3.131 ± 0.664
2.491GluPhe: 2.491 ± 0.423
2.989GluGly: 2.989 ± 0.516
1.352GluHis: 1.352 ± 0.309
1.921GluIle: 1.921 ± 0.318
2.135GluLys: 2.135 ± 0.481
5.408GluLeu: 5.408 ± 0.715
1.779GluMet: 1.779 ± 0.323
1.993GluAsn: 1.993 ± 0.339
0.854GluPro: 0.854 ± 0.223
2.775GluGln: 2.775 ± 0.514
3.558GluArg: 3.558 ± 0.499
3.629GluSer: 3.629 ± 0.484
3.274GluThr: 3.274 ± 0.47
3.843GluVal: 3.843 ± 0.62
0.925GluTrp: 0.925 ± 0.264
2.42GluTyr: 2.42 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
3.06PheAla: 3.06 ± 0.404
0.213PheCys: 0.213 ± 0.122
2.704PheAsp: 2.704 ± 0.416
1.067PheGlu: 1.067 ± 0.227
0.996PhePhe: 0.996 ± 0.269
2.989PheGly: 2.989 ± 0.462
0.498PheHis: 0.498 ± 0.164
0.925PheIle: 0.925 ± 0.246
1.993PheLys: 1.993 ± 0.395
2.277PheLeu: 2.277 ± 0.321
0.783PheMet: 0.783 ± 0.174
1.779PheAsn: 1.779 ± 0.377
1.494PhePro: 1.494 ± 0.324
0.996PheGln: 0.996 ± 0.255
1.779PheArg: 1.779 ± 0.385
2.135PheSer: 2.135 ± 0.471
1.566PheThr: 1.566 ± 0.296
2.562PheVal: 2.562 ± 0.43
0.498PheTrp: 0.498 ± 0.161
0.783PheTyr: 0.783 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
7.188GlyAla: 7.188 ± 0.928
1.281GlyCys: 1.281 ± 0.354
4.412GlyAsp: 4.412 ± 0.616
2.633GlyGlu: 2.633 ± 0.356
2.491GlyPhe: 2.491 ± 0.344
5.337GlyGly: 5.337 ± 0.74
0.712GlyHis: 0.712 ± 0.202
4.91GlyIle: 4.91 ± 0.509
3.914GlyLys: 3.914 ± 0.615
5.408GlyLeu: 5.408 ± 0.491
2.491GlyMet: 2.491 ± 0.381
2.491GlyAsn: 2.491 ± 0.413
1.637GlyPro: 1.637 ± 0.392
2.989GlyGln: 2.989 ± 0.439
4.056GlyArg: 4.056 ± 0.446
4.555GlySer: 4.555 ± 0.551
6.547GlyThr: 6.547 ± 0.791
5.764GlyVal: 5.764 ± 0.497
0.996GlyTrp: 0.996 ± 0.224
3.914GlyTyr: 3.914 ± 0.763
0.0GlyXaa: 0.0 ± 0.0
His
1.423HisAla: 1.423 ± 0.311
0.285HisCys: 0.285 ± 0.118
1.494HisAsp: 1.494 ± 0.293
1.139HisGlu: 1.139 ± 0.355
0.427HisPhe: 0.427 ± 0.208
1.921HisGly: 1.921 ± 0.428
0.569HisHis: 0.569 ± 0.201
1.21HisIle: 1.21 ± 0.235
0.854HisLys: 0.854 ± 0.26
2.135HisLeu: 2.135 ± 0.427
0.64HisMet: 0.64 ± 0.239
1.139HisAsn: 1.139 ± 0.299
1.21HisPro: 1.21 ± 0.276
0.783HisGln: 0.783 ± 0.198
1.921HisArg: 1.921 ± 0.335
1.21HisSer: 1.21 ± 0.271
0.854HisThr: 0.854 ± 0.318
1.352HisVal: 1.352 ± 0.356
0.427HisTrp: 0.427 ± 0.191
0.854HisTyr: 0.854 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
4.555IleAla: 4.555 ± 0.518
0.427IleCys: 0.427 ± 0.192
2.42IleAsp: 2.42 ± 0.355
2.704IleGlu: 2.704 ± 0.558
0.712IlePhe: 0.712 ± 0.253
2.775IleGly: 2.775 ± 0.4
1.281IleHis: 1.281 ± 0.31
0.854IleIle: 0.854 ± 0.225
2.704IleLys: 2.704 ± 0.437
3.914IleLeu: 3.914 ± 0.607
1.139IleMet: 1.139 ± 0.274
2.348IleAsn: 2.348 ± 0.416
2.348IlePro: 2.348 ± 0.274
1.85IleGln: 1.85 ± 0.449
2.135IleArg: 2.135 ± 0.405
2.491IleSer: 2.491 ± 0.396
4.128IleThr: 4.128 ± 0.749
2.348IleVal: 2.348 ± 0.369
0.356IleTrp: 0.356 ± 0.197
0.925IleTyr: 0.925 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
5.124LysAla: 5.124 ± 0.747
0.285LysCys: 0.285 ± 0.138
2.704LysAsp: 2.704 ± 0.467
2.704LysGlu: 2.704 ± 0.5
1.067LysPhe: 1.067 ± 0.332
2.633LysGly: 2.633 ± 0.368
1.067LysHis: 1.067 ± 0.319
1.139LysIle: 1.139 ± 0.3
1.494LysLys: 1.494 ± 0.466
4.412LysLeu: 4.412 ± 0.453
1.21LysMet: 1.21 ± 0.258
1.423LysAsn: 1.423 ± 0.259
2.135LysPro: 2.135 ± 0.39
2.348LysGln: 2.348 ± 0.437
3.416LysArg: 3.416 ± 0.547
2.064LysSer: 2.064 ± 0.365
1.779LysThr: 1.779 ± 0.362
3.06LysVal: 3.06 ± 0.489
0.427LysTrp: 0.427 ± 0.186
1.708LysTyr: 1.708 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
7.259LeuAla: 7.259 ± 0.78
1.423LeuCys: 1.423 ± 0.304
5.195LeuAsp: 5.195 ± 0.483
4.981LeuGlu: 4.981 ± 0.489
2.704LeuPhe: 2.704 ± 0.418
6.049LeuGly: 6.049 ± 0.668
2.348LeuHis: 2.348 ± 0.404
3.416LeuIle: 3.416 ± 0.454
3.345LeuLys: 3.345 ± 0.456
6.832LeuLeu: 6.832 ± 0.781
2.562LeuMet: 2.562 ± 0.345
3.416LeuAsn: 3.416 ± 0.504
4.697LeuPro: 4.697 ± 0.466
3.914LeuGln: 3.914 ± 0.458
6.974LeuArg: 6.974 ± 0.835
6.262LeuSer: 6.262 ± 0.802
4.768LeuThr: 4.768 ± 0.536
7.757LeuVal: 7.757 ± 0.814
0.854LeuTrp: 0.854 ± 0.255
2.918LeuTyr: 2.918 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
2.847MetAla: 2.847 ± 0.532
0.356MetCys: 0.356 ± 0.167
1.352MetAsp: 1.352 ± 0.375
1.139MetGlu: 1.139 ± 0.246
1.352MetPhe: 1.352 ± 0.307
1.993MetGly: 1.993 ± 0.347
0.925MetHis: 0.925 ± 0.246
0.925MetIle: 0.925 ± 0.233
0.925MetLys: 0.925 ± 0.227
2.491MetLeu: 2.491 ± 0.478
0.64MetMet: 0.64 ± 0.214
0.996MetAsn: 0.996 ± 0.25
1.779MetPro: 1.779 ± 0.443
2.206MetGln: 2.206 ± 0.401
2.135MetArg: 2.135 ± 0.385
1.708MetSer: 1.708 ± 0.436
1.708MetThr: 1.708 ± 0.373
2.348MetVal: 2.348 ± 0.404
0.142MetTrp: 0.142 ± 0.097
1.281MetTyr: 1.281 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
4.27AsnAla: 4.27 ± 0.613
0.285AsnCys: 0.285 ± 0.147
1.637AsnAsp: 1.637 ± 0.424
1.637AsnGlu: 1.637 ± 0.285
1.566AsnPhe: 1.566 ± 0.312
3.772AsnGly: 3.772 ± 0.506
0.712AsnHis: 0.712 ± 0.25
1.779AsnIle: 1.779 ± 0.412
2.348AsnLys: 2.348 ± 0.346
4.128AsnLeu: 4.128 ± 0.514
1.85AsnMet: 1.85 ± 0.322
1.779AsnAsn: 1.779 ± 0.298
1.993AsnPro: 1.993 ± 0.501
2.135AsnGln: 2.135 ± 0.735
2.633AsnArg: 2.633 ± 0.421
2.562AsnSer: 2.562 ± 0.507
2.918AsnThr: 2.918 ± 0.425
2.704AsnVal: 2.704 ± 0.421
0.498AsnTrp: 0.498 ± 0.21
0.996AsnTyr: 0.996 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
4.128ProAla: 4.128 ± 0.49
0.356ProCys: 0.356 ± 0.147
3.416ProAsp: 3.416 ± 0.467
3.131ProGlu: 3.131 ± 0.434
0.925ProPhe: 0.925 ± 0.257
2.206ProGly: 2.206 ± 0.337
0.427ProHis: 0.427 ± 0.161
1.637ProIle: 1.637 ± 0.427
1.281ProLys: 1.281 ± 0.341
2.989ProLeu: 2.989 ± 0.51
1.637ProMet: 1.637 ± 0.309
1.921ProAsn: 1.921 ± 0.477
2.206ProPro: 2.206 ± 0.539
1.779ProGln: 1.779 ± 0.419
1.993ProArg: 1.993 ± 0.405
2.775ProSer: 2.775 ± 0.419
3.416ProThr: 3.416 ± 0.509
3.274ProVal: 3.274 ± 0.347
0.356ProTrp: 0.356 ± 0.179
1.566ProTyr: 1.566 ± 0.229
0.0ProXaa: 0.0 ± 0.0
Gln
4.483GlnAla: 4.483 ± 0.651
0.285GlnCys: 0.285 ± 0.185
1.779GlnAsp: 1.779 ± 0.393
2.918GlnGlu: 2.918 ± 0.486
1.708GlnPhe: 1.708 ± 0.336
4.128GlnGly: 4.128 ± 0.693
1.423GlnHis: 1.423 ± 0.4
1.85GlnIle: 1.85 ± 0.317
0.996GlnLys: 0.996 ± 0.196
3.843GlnLeu: 3.843 ± 0.546
1.566GlnMet: 1.566 ± 0.37
2.064GlnAsn: 2.064 ± 0.544
1.281GlnPro: 1.281 ± 0.433
2.847GlnGln: 2.847 ± 0.597
2.989GlnArg: 2.989 ± 0.527
2.562GlnSer: 2.562 ± 0.435
2.206GlnThr: 2.206 ± 0.372
3.345GlnVal: 3.345 ± 0.533
0.427GlnTrp: 0.427 ± 0.162
2.775GlnTyr: 2.775 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
4.483ArgAla: 4.483 ± 0.515
0.925ArgCys: 0.925 ± 0.239
4.981ArgAsp: 4.981 ± 0.574
3.202ArgGlu: 3.202 ± 0.626
1.85ArgPhe: 1.85 ± 0.384
4.483ArgGly: 4.483 ± 0.535
1.708ArgHis: 1.708 ± 0.304
3.202ArgIle: 3.202 ± 0.485
2.42ArgLys: 2.42 ± 0.399
4.839ArgLeu: 4.839 ± 0.549
1.566ArgMet: 1.566 ± 0.395
2.704ArgAsn: 2.704 ± 0.545
1.85ArgPro: 1.85 ± 0.307
3.131ArgGln: 3.131 ± 0.475
4.412ArgArg: 4.412 ± 0.564
4.056ArgSer: 4.056 ± 0.628
3.701ArgThr: 3.701 ± 0.549
4.626ArgVal: 4.626 ± 0.596
0.783ArgTrp: 0.783 ± 0.199
2.491ArgTyr: 2.491 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
7.757SerAla: 7.757 ± 0.831
0.569SerCys: 0.569 ± 0.172
3.558SerAsp: 3.558 ± 0.438
2.135SerGlu: 2.135 ± 0.329
1.85SerPhe: 1.85 ± 0.361
4.697SerGly: 4.697 ± 0.473
0.712SerHis: 0.712 ± 0.195
2.847SerIle: 2.847 ± 0.401
3.772SerLys: 3.772 ± 0.629
6.191SerLeu: 6.191 ± 0.621
1.281SerMet: 1.281 ± 0.355
2.847SerAsn: 2.847 ± 0.52
1.993SerPro: 1.993 ± 0.416
1.993SerGln: 1.993 ± 0.267
2.775SerArg: 2.775 ± 0.406
3.202SerSer: 3.202 ± 0.479
4.626SerThr: 4.626 ± 0.598
5.551SerVal: 5.551 ± 0.646
1.067SerTrp: 1.067 ± 0.262
1.921SerTyr: 1.921 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
6.974ThrAla: 6.974 ± 0.742
0.925ThrCys: 0.925 ± 0.289
4.483ThrAsp: 4.483 ± 0.631
3.06ThrGlu: 3.06 ± 0.495
1.779ThrPhe: 1.779 ± 0.398
5.48ThrGly: 5.48 ± 0.743
1.637ThrHis: 1.637 ± 0.363
2.42ThrIle: 2.42 ± 0.354
2.42ThrLys: 2.42 ± 0.469
6.262ThrLeu: 6.262 ± 0.957
0.854ThrMet: 0.854 ± 0.23
2.847ThrAsn: 2.847 ± 0.457
2.775ThrPro: 2.775 ± 0.445
2.42ThrGln: 2.42 ± 0.389
3.202ThrArg: 3.202 ± 0.505
3.985ThrSer: 3.985 ± 0.734
4.128ThrThr: 4.128 ± 0.772
6.405ThrVal: 6.405 ± 0.83
1.067ThrTrp: 1.067 ± 0.331
2.775ThrTyr: 2.775 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
7.188ValAla: 7.188 ± 0.625
1.352ValCys: 1.352 ± 0.309
5.408ValAsp: 5.408 ± 0.597
3.701ValGlu: 3.701 ± 0.586
2.206ValPhe: 2.206 ± 0.48
4.981ValGly: 4.981 ± 0.423
1.637ValHis: 1.637 ± 0.428
3.416ValIle: 3.416 ± 0.553
2.989ValLys: 2.989 ± 0.704
7.472ValLeu: 7.472 ± 0.805
2.206ValMet: 2.206 ± 0.367
2.775ValAsn: 2.775 ± 0.469
4.412ValPro: 4.412 ± 0.663
4.626ValGln: 4.626 ± 0.704
5.622ValArg: 5.622 ± 0.742
4.626ValSer: 4.626 ± 0.564
4.483ValThr: 4.483 ± 0.69
4.839ValVal: 4.839 ± 0.554
0.996ValTrp: 0.996 ± 0.252
2.775ValTyr: 2.775 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.854TrpAla: 0.854 ± 0.245
0.142TrpCys: 0.142 ± 0.113
0.712TrpAsp: 0.712 ± 0.244
0.854TrpGlu: 0.854 ± 0.32
0.854TrpPhe: 0.854 ± 0.301
0.712TrpGly: 0.712 ± 0.202
0.285TrpHis: 0.285 ± 0.126
0.213TrpIle: 0.213 ± 0.115
0.213TrpLys: 0.213 ± 0.123
1.423TrpLeu: 1.423 ± 0.395
0.427TrpMet: 0.427 ± 0.165
0.569TrpAsn: 0.569 ± 0.238
0.427TrpPro: 0.427 ± 0.156
0.854TrpGln: 0.854 ± 0.268
0.854TrpArg: 0.854 ± 0.278
0.427TrpSer: 0.427 ± 0.173
0.569TrpThr: 0.569 ± 0.175
1.139TrpVal: 1.139 ± 0.262
0.142TrpTrp: 0.142 ± 0.094
0.925TrpTyr: 0.925 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.206TyrAla: 2.206 ± 0.301
0.427TyrCys: 0.427 ± 0.176
2.704TyrAsp: 2.704 ± 0.447
1.921TyrGlu: 1.921 ± 0.342
0.925TyrPhe: 0.925 ± 0.272
2.847TyrGly: 2.847 ± 0.449
0.783TyrHis: 0.783 ± 0.211
2.064TyrIle: 2.064 ± 0.375
1.352TyrLys: 1.352 ± 0.294
3.629TyrLeu: 3.629 ± 0.621
1.423TyrMet: 1.423 ± 0.341
1.921TyrAsn: 1.921 ± 0.38
2.277TyrPro: 2.277 ± 0.37
1.494TyrGln: 1.494 ± 0.286
2.775TyrArg: 2.775 ± 0.431
3.202TyrSer: 3.202 ± 0.452
2.704TyrThr: 2.704 ± 0.482
2.704TyrVal: 2.704 ± 0.43
0.569TyrTrp: 0.569 ± 0.227
1.281TyrTyr: 1.281 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (14053 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski