Amino acid dipepetide frequency for Pectobacterium phage PP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.904AlaAla: 12.904 ± 1.741
1.236AlaCys: 1.236 ± 0.4
5.331AlaAsp: 5.331 ± 0.758
6.568AlaGlu: 6.568 ± 0.742
2.936AlaPhe: 2.936 ± 0.396
7.263AlaGly: 7.263 ± 1.22
1.777AlaHis: 1.777 ± 0.374
4.172AlaIle: 4.172 ± 0.544
6.413AlaLys: 6.413 ± 0.81
8.422AlaLeu: 8.422 ± 0.814
3.323AlaMet: 3.323 ± 0.377
4.018AlaAsn: 4.018 ± 0.666
2.859AlaPro: 2.859 ± 0.472
4.713AlaGln: 4.713 ± 0.875
4.559AlaArg: 4.559 ± 0.59
5.641AlaSer: 5.641 ± 0.778
4.868AlaThr: 4.868 ± 0.576
6.954AlaVal: 6.954 ± 0.692
1.468AlaTrp: 1.468 ± 0.289
3.863AlaTyr: 3.863 ± 0.605
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 0.224
0.077CysCys: 0.077 ± 0.069
0.85CysAsp: 0.85 ± 0.267
0.386CysGlu: 0.386 ± 0.167
0.155CysPhe: 0.155 ± 0.092
0.695CysGly: 0.695 ± 0.217
0.232CysHis: 0.232 ± 0.168
0.386CysIle: 0.386 ± 0.176
0.386CysLys: 0.386 ± 0.184
0.85CysLeu: 0.85 ± 0.221
0.618CysMet: 0.618 ± 0.207
0.232CysAsn: 0.232 ± 0.138
0.695CysPro: 0.695 ± 0.312
0.309CysGln: 0.309 ± 0.147
0.618CysArg: 0.618 ± 0.218
0.386CysSer: 0.386 ± 0.169
0.464CysThr: 0.464 ± 0.175
1.082CysVal: 1.082 ± 0.317
0.0CysTrp: 0.0 ± 0.0
0.773CysTyr: 0.773 ± 0.239
0.0CysXaa: 0.0 ± 0.0
Asp
5.872AspAla: 5.872 ± 0.554
0.618AspCys: 0.618 ± 0.243
2.704AspAsp: 2.704 ± 0.296
3.632AspGlu: 3.632 ± 0.418
3.323AspPhe: 3.323 ± 0.624
5.177AspGly: 5.177 ± 0.678
0.773AspHis: 0.773 ± 0.305
3.091AspIle: 3.091 ± 0.471
2.473AspLys: 2.473 ± 0.367
4.559AspLeu: 4.559 ± 0.601
1.545AspMet: 1.545 ± 0.376
3.013AspAsn: 3.013 ± 0.576
2.627AspPro: 2.627 ± 0.418
0.927AspGln: 0.927 ± 0.205
2.859AspArg: 2.859 ± 0.603
4.636AspSer: 4.636 ± 0.653
4.095AspThr: 4.095 ± 0.801
4.559AspVal: 4.559 ± 0.623
0.695AspTrp: 0.695 ± 0.265
2.936AspTyr: 2.936 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
6.645GluAla: 6.645 ± 0.879
0.618GluCys: 0.618 ± 0.212
4.404GluAsp: 4.404 ± 0.673
4.327GluGlu: 4.327 ± 0.553
2.086GluPhe: 2.086 ± 0.488
5.795GluGly: 5.795 ± 0.437
1.159GluHis: 1.159 ± 0.271
2.241GluIle: 2.241 ± 0.531
2.859GluLys: 2.859 ± 0.555
5.95GluLeu: 5.95 ± 0.663
1.623GluMet: 1.623 ± 0.281
1.7GluAsn: 1.7 ± 0.473
1.314GluPro: 1.314 ± 0.288
3.168GluGln: 3.168 ± 0.517
3.786GluArg: 3.786 ± 0.498
2.782GluSer: 2.782 ± 0.367
2.782GluThr: 2.782 ± 0.444
3.863GluVal: 3.863 ± 0.618
1.082GluTrp: 1.082 ± 0.331
2.086GluTyr: 2.086 ± 0.465
0.0GluXaa: 0.0 ± 0.0
Phe
2.782PheAla: 2.782 ± 0.386
0.386PheCys: 0.386 ± 0.156
2.55PheAsp: 2.55 ± 0.471
2.163PheGlu: 2.163 ± 0.372
1.082PhePhe: 1.082 ± 0.277
2.782PheGly: 2.782 ± 0.452
0.695PheHis: 0.695 ± 0.253
2.163PheIle: 2.163 ± 0.411
2.395PheLys: 2.395 ± 0.416
3.323PheLeu: 3.323 ± 0.341
1.082PheMet: 1.082 ± 0.259
1.545PheAsn: 1.545 ± 0.383
1.468PhePro: 1.468 ± 0.246
1.391PheGln: 1.391 ± 0.272
2.318PheArg: 2.318 ± 0.442
2.241PheSer: 2.241 ± 0.386
2.704PheThr: 2.704 ± 0.451
2.009PheVal: 2.009 ± 0.343
0.386PheTrp: 0.386 ± 0.16
1.159PheTyr: 1.159 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
7.186GlyAla: 7.186 ± 1.152
1.082GlyCys: 1.082 ± 0.324
4.559GlyAsp: 4.559 ± 0.569
3.4GlyGlu: 3.4 ± 0.55
2.859GlyPhe: 2.859 ± 0.506
6.8GlyGly: 6.8 ± 1.011
1.391GlyHis: 1.391 ± 0.368
4.172GlyIle: 4.172 ± 0.527
5.177GlyLys: 5.177 ± 0.813
6.413GlyLeu: 6.413 ± 0.674
1.7GlyMet: 1.7 ± 0.362
3.013GlyAsn: 3.013 ± 0.423
1.391GlyPro: 1.391 ± 0.322
2.782GlyGln: 2.782 ± 0.456
5.022GlyArg: 5.022 ± 0.552
5.641GlySer: 5.641 ± 0.816
5.1GlyThr: 5.1 ± 0.756
5.486GlyVal: 5.486 ± 0.858
1.236GlyTrp: 1.236 ± 0.201
2.163GlyTyr: 2.163 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
1.7HisAla: 1.7 ± 0.438
0.155HisCys: 0.155 ± 0.093
1.545HisAsp: 1.545 ± 0.326
0.695HisGlu: 0.695 ± 0.308
0.773HisPhe: 0.773 ± 0.193
1.7HisGly: 1.7 ± 0.412
0.464HisHis: 0.464 ± 0.208
0.541HisIle: 0.541 ± 0.172
1.082HisLys: 1.082 ± 0.301
1.468HisLeu: 1.468 ± 0.316
0.309HisMet: 0.309 ± 0.143
1.004HisAsn: 1.004 ± 0.369
0.773HisPro: 0.773 ± 0.236
1.004HisGln: 1.004 ± 0.256
1.159HisArg: 1.159 ± 0.317
1.468HisSer: 1.468 ± 0.358
1.314HisThr: 1.314 ± 0.322
1.623HisVal: 1.623 ± 0.359
0.232HisTrp: 0.232 ± 0.113
0.927HisTyr: 0.927 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
3.477IleAla: 3.477 ± 0.429
0.077IleCys: 0.077 ± 0.082
2.318IleAsp: 2.318 ± 0.416
2.627IleGlu: 2.627 ± 0.479
1.932IlePhe: 1.932 ± 0.409
4.018IleGly: 4.018 ± 0.58
1.391IleHis: 1.391 ± 0.33
1.777IleIle: 1.777 ± 0.461
2.163IleLys: 2.163 ± 0.515
3.013IleLeu: 3.013 ± 0.532
1.391IleMet: 1.391 ± 0.266
1.932IleAsn: 1.932 ± 0.328
1.854IlePro: 1.854 ± 0.334
2.009IleGln: 2.009 ± 0.415
2.936IleArg: 2.936 ± 0.481
3.323IleSer: 3.323 ± 0.59
3.168IleThr: 3.168 ± 0.534
2.782IleVal: 2.782 ± 0.526
0.618IleTrp: 0.618 ± 0.231
1.545IleTyr: 1.545 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
5.872LysAla: 5.872 ± 0.892
0.232LysCys: 0.232 ± 0.134
3.091LysAsp: 3.091 ± 0.437
3.091LysGlu: 3.091 ± 0.568
1.623LysPhe: 1.623 ± 0.435
4.095LysGly: 4.095 ± 0.45
1.854LysHis: 1.854 ± 0.376
1.777LysIle: 1.777 ± 0.469
3.013LysLys: 3.013 ± 0.719
5.254LysLeu: 5.254 ± 0.786
1.236LysMet: 1.236 ± 0.329
1.236LysAsn: 1.236 ± 0.365
2.704LysPro: 2.704 ± 0.551
2.318LysGln: 2.318 ± 0.378
2.55LysArg: 2.55 ± 0.4
2.55LysSer: 2.55 ± 0.455
3.168LysThr: 3.168 ± 0.422
5.486LysVal: 5.486 ± 0.57
0.695LysTrp: 0.695 ± 0.223
1.932LysTyr: 1.932 ± 0.429
0.0LysXaa: 0.0 ± 0.0
Leu
9.272LeuAla: 9.272 ± 0.987
0.927LeuCys: 0.927 ± 0.368
5.563LeuAsp: 5.563 ± 0.593
4.713LeuGlu: 4.713 ± 0.655
2.241LeuPhe: 2.241 ± 0.426
5.563LeuGly: 5.563 ± 0.58
2.395LeuHis: 2.395 ± 0.41
3.4LeuIle: 3.4 ± 0.57
4.868LeuLys: 4.868 ± 0.783
5.641LeuLeu: 5.641 ± 0.709
2.395LeuMet: 2.395 ± 0.496
3.786LeuAsn: 3.786 ± 0.517
4.095LeuPro: 4.095 ± 0.467
4.095LeuGln: 4.095 ± 0.845
5.022LeuArg: 5.022 ± 0.529
5.486LeuSer: 5.486 ± 0.526
5.331LeuThr: 5.331 ± 0.644
6.645LeuVal: 6.645 ± 0.68
0.773LeuTrp: 0.773 ± 0.274
2.241LeuTyr: 2.241 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
3.709MetAla: 3.709 ± 0.488
0.077MetCys: 0.077 ± 0.092
1.7MetAsp: 1.7 ± 0.378
1.545MetGlu: 1.545 ± 0.383
1.159MetPhe: 1.159 ± 0.306
1.7MetGly: 1.7 ± 0.351
0.541MetHis: 0.541 ± 0.175
0.927MetIle: 0.927 ± 0.206
1.545MetLys: 1.545 ± 0.424
2.473MetLeu: 2.473 ± 0.381
0.695MetMet: 0.695 ± 0.328
1.314MetAsn: 1.314 ± 0.412
1.082MetPro: 1.082 ± 0.298
2.086MetGln: 2.086 ± 0.397
1.7MetArg: 1.7 ± 0.311
2.086MetSer: 2.086 ± 0.459
1.468MetThr: 1.468 ± 0.377
1.777MetVal: 1.777 ± 0.451
0.541MetTrp: 0.541 ± 0.221
1.004MetTyr: 1.004 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
3.554AsnAla: 3.554 ± 0.516
0.464AsnCys: 0.464 ± 0.167
2.395AsnAsp: 2.395 ± 0.376
1.623AsnGlu: 1.623 ± 0.302
1.468AsnPhe: 1.468 ± 0.268
3.091AsnGly: 3.091 ± 0.519
0.695AsnHis: 0.695 ± 0.191
2.55AsnIle: 2.55 ± 0.392
2.086AsnLys: 2.086 ± 0.336
3.013AsnLeu: 3.013 ± 0.51
1.932AsnMet: 1.932 ± 0.378
1.545AsnAsn: 1.545 ± 0.478
1.854AsnPro: 1.854 ± 0.281
1.159AsnGln: 1.159 ± 0.321
3.091AsnArg: 3.091 ± 0.527
2.627AsnSer: 2.627 ± 0.451
2.859AsnThr: 2.859 ± 0.46
2.55AsnVal: 2.55 ± 0.516
0.309AsnTrp: 0.309 ± 0.186
1.854AsnTyr: 1.854 ± 0.447
0.0AsnXaa: 0.0 ± 0.0
Pro
3.323ProAla: 3.323 ± 0.506
0.618ProCys: 0.618 ± 0.24
2.704ProAsp: 2.704 ± 0.487
3.786ProGlu: 3.786 ± 0.513
1.236ProPhe: 1.236 ± 0.373
3.013ProGly: 3.013 ± 0.362
0.618ProHis: 0.618 ± 0.204
2.241ProIle: 2.241 ± 0.392
1.391ProLys: 1.391 ± 0.385
3.554ProLeu: 3.554 ± 0.441
0.927ProMet: 0.927 ± 0.247
2.163ProAsn: 2.163 ± 0.401
1.623ProPro: 1.623 ± 0.49
1.777ProGln: 1.777 ± 0.427
1.004ProArg: 1.004 ± 0.279
1.7ProSer: 1.7 ± 0.358
2.936ProThr: 2.936 ± 0.483
2.704ProVal: 2.704 ± 0.429
0.541ProTrp: 0.541 ± 0.23
1.7ProTyr: 1.7 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
4.945GlnAla: 4.945 ± 0.56
0.232GlnCys: 0.232 ± 0.13
2.163GlnAsp: 2.163 ± 0.382
3.091GlnGlu: 3.091 ± 0.536
1.854GlnPhe: 1.854 ± 0.373
2.936GlnGly: 2.936 ± 0.539
1.236GlnHis: 1.236 ± 0.385
1.854GlnIle: 1.854 ± 0.497
2.163GlnLys: 2.163 ± 0.398
4.018GlnLeu: 4.018 ± 0.473
1.545GlnMet: 1.545 ± 0.283
1.468GlnAsn: 1.468 ± 0.393
0.773GlnPro: 0.773 ± 0.257
2.627GlnGln: 2.627 ± 0.495
2.241GlnArg: 2.241 ± 0.41
2.395GlnSer: 2.395 ± 0.485
1.932GlnThr: 1.932 ± 0.416
3.4GlnVal: 3.4 ± 0.641
0.85GlnTrp: 0.85 ± 0.252
2.009GlnTyr: 2.009 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
4.482ArgAla: 4.482 ± 0.766
0.386ArgCys: 0.386 ± 0.171
3.168ArgAsp: 3.168 ± 0.44
4.327ArgGlu: 4.327 ± 0.541
2.704ArgPhe: 2.704 ± 0.357
3.554ArgGly: 3.554 ± 0.445
1.082ArgHis: 1.082 ± 0.301
2.704ArgIle: 2.704 ± 0.525
2.627ArgLys: 2.627 ± 0.415
5.177ArgLeu: 5.177 ± 0.568
2.086ArgMet: 2.086 ± 0.332
2.318ArgAsn: 2.318 ± 0.388
2.55ArgPro: 2.55 ± 0.402
2.163ArgGln: 2.163 ± 0.49
4.095ArgArg: 4.095 ± 0.711
3.091ArgSer: 3.091 ± 0.536
3.554ArgThr: 3.554 ± 0.626
3.632ArgVal: 3.632 ± 0.424
0.695ArgTrp: 0.695 ± 0.285
2.473ArgTyr: 2.473 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
7.186SerAla: 7.186 ± 0.772
0.386SerCys: 0.386 ± 0.142
3.245SerAsp: 3.245 ± 0.593
3.632SerGlu: 3.632 ± 0.507
2.086SerPhe: 2.086 ± 0.367
4.713SerGly: 4.713 ± 0.504
0.773SerHis: 0.773 ± 0.206
1.854SerIle: 1.854 ± 0.444
4.327SerLys: 4.327 ± 0.57
5.022SerLeu: 5.022 ± 0.67
1.777SerMet: 1.777 ± 0.393
2.859SerAsn: 2.859 ± 0.54
3.323SerPro: 3.323 ± 0.548
3.168SerGln: 3.168 ± 0.498
2.782SerArg: 2.782 ± 0.472
3.941SerSer: 3.941 ± 0.512
3.941SerThr: 3.941 ± 0.502
5.331SerVal: 5.331 ± 0.669
1.004SerTrp: 1.004 ± 0.255
1.623SerTyr: 1.623 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
5.254ThrAla: 5.254 ± 0.663
0.541ThrCys: 0.541 ± 0.215
3.632ThrAsp: 3.632 ± 0.569
3.941ThrGlu: 3.941 ± 0.605
2.241ThrPhe: 2.241 ± 0.478
5.1ThrGly: 5.1 ± 0.667
1.314ThrHis: 1.314 ± 0.314
2.936ThrIle: 2.936 ± 0.339
3.013ThrLys: 3.013 ± 0.452
5.022ThrLeu: 5.022 ± 0.584
2.163ThrMet: 2.163 ± 0.509
1.623ThrAsn: 1.623 ± 0.401
3.168ThrPro: 3.168 ± 0.528
2.627ThrGln: 2.627 ± 0.485
3.4ThrArg: 3.4 ± 0.426
4.636ThrSer: 4.636 ± 0.652
2.936ThrThr: 2.936 ± 0.793
3.863ThrVal: 3.863 ± 0.629
0.541ThrTrp: 0.541 ± 0.202
2.395ThrTyr: 2.395 ± 0.497
0.0ThrXaa: 0.0 ± 0.0
Val
6.413ValAla: 6.413 ± 0.688
0.85ValCys: 0.85 ± 0.235
4.095ValAsp: 4.095 ± 0.417
4.018ValGlu: 4.018 ± 0.593
2.473ValPhe: 2.473 ± 0.452
5.254ValGly: 5.254 ± 0.712
1.082ValHis: 1.082 ± 0.287
3.4ValIle: 3.4 ± 0.599
3.4ValLys: 3.4 ± 0.565
6.336ValLeu: 6.336 ± 0.683
1.932ValMet: 1.932 ± 0.323
3.863ValAsn: 3.863 ± 0.54
2.936ValPro: 2.936 ± 0.471
3.013ValGln: 3.013 ± 0.398
4.559ValArg: 4.559 ± 0.638
5.177ValSer: 5.177 ± 0.741
5.254ValThr: 5.254 ± 0.633
4.327ValVal: 4.327 ± 0.723
0.618ValTrp: 0.618 ± 0.209
2.704ValTyr: 2.704 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
1.082TrpAla: 1.082 ± 0.294
0.077TrpCys: 0.077 ± 0.08
1.082TrpAsp: 1.082 ± 0.351
0.773TrpGlu: 0.773 ± 0.293
1.236TrpPhe: 1.236 ± 0.264
0.386TrpGly: 0.386 ± 0.158
0.155TrpHis: 0.155 ± 0.084
0.309TrpIle: 0.309 ± 0.132
0.85TrpLys: 0.85 ± 0.266
2.086TrpLeu: 2.086 ± 0.433
0.077TrpMet: 0.077 ± 0.076
0.155TrpAsn: 0.155 ± 0.115
0.309TrpPro: 0.309 ± 0.135
0.232TrpGln: 0.232 ± 0.11
0.773TrpArg: 0.773 ± 0.247
1.004TrpSer: 1.004 ± 0.235
0.541TrpThr: 0.541 ± 0.177
1.236TrpVal: 1.236 ± 0.266
0.618TrpTrp: 0.618 ± 0.177
0.386TrpTyr: 0.386 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.091TyrAla: 3.091 ± 0.456
0.695TyrCys: 0.695 ± 0.276
3.013TyrAsp: 3.013 ± 0.522
1.932TyrGlu: 1.932 ± 0.351
1.159TyrPhe: 1.159 ± 0.274
2.936TyrGly: 2.936 ± 0.364
0.386TyrHis: 0.386 ± 0.179
1.854TyrIle: 1.854 ± 0.376
1.545TyrLys: 1.545 ± 0.366
2.859TyrLeu: 2.859 ± 0.429
0.695TyrMet: 0.695 ± 0.171
1.932TyrAsn: 1.932 ± 0.553
2.163TyrPro: 2.163 ± 0.4
2.163TyrGln: 2.163 ± 0.441
2.395TyrArg: 2.395 ± 0.299
2.163TyrSer: 2.163 ± 0.344
1.932TyrThr: 1.932 ± 0.529
2.395TyrVal: 2.395 ± 0.398
0.464TyrTrp: 0.464 ± 0.211
0.695TyrTyr: 0.695 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (12943 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski