Amino acid dipepetide frequency for Croceibacter phage P2559S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.274AlaAla: 8.274 ± 1.058
0.21AlaCys: 0.21 ± 0.102
6.311AlaAsp: 6.311 ± 0.801
6.101AlaGlu: 6.101 ± 0.904
3.787AlaPhe: 3.787 ± 0.579
6.311AlaGly: 6.311 ± 0.653
1.122AlaHis: 1.122 ± 0.282
5.68AlaIle: 5.68 ± 0.659
5.049AlaLys: 5.049 ± 0.799
6.451AlaLeu: 6.451 ± 0.733
1.473AlaMet: 1.473 ± 0.369
4.277AlaAsn: 4.277 ± 0.574
2.174AlaPro: 2.174 ± 0.409
5.189AlaGln: 5.189 ± 0.547
2.104AlaArg: 2.104 ± 0.457
3.366AlaSer: 3.366 ± 0.711
4.908AlaThr: 4.908 ± 0.799
6.662AlaVal: 6.662 ± 0.802
0.841AlaTrp: 0.841 ± 0.194
2.454AlaTyr: 2.454 ± 0.393
0.0AlaXaa: 0.0 ± 0.0
Cys
0.351CysAla: 0.351 ± 0.16
0.14CysCys: 0.14 ± 0.134
0.912CysAsp: 0.912 ± 0.26
0.771CysGlu: 0.771 ± 0.252
0.351CysPhe: 0.351 ± 0.179
0.28CysGly: 0.28 ± 0.156
0.14CysHis: 0.14 ± 0.097
0.421CysIle: 0.421 ± 0.226
0.912CysLys: 0.912 ± 0.237
0.771CysLeu: 0.771 ± 0.302
0.28CysMet: 0.28 ± 0.151
0.351CysAsn: 0.351 ± 0.153
0.421CysPro: 0.421 ± 0.189
0.28CysGln: 0.28 ± 0.139
0.421CysArg: 0.421 ± 0.163
0.561CysSer: 0.561 ± 0.205
0.28CysThr: 0.28 ± 0.158
0.491CysVal: 0.491 ± 0.164
0.21CysTrp: 0.21 ± 0.117
0.561CysTyr: 0.561 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
4.908AspAla: 4.908 ± 0.532
0.491AspCys: 0.491 ± 0.167
3.787AspAsp: 3.787 ± 0.567
4.277AspGlu: 4.277 ± 0.543
2.594AspPhe: 2.594 ± 0.419
4.067AspGly: 4.067 ± 0.414
0.21AspHis: 0.21 ± 0.099
4.277AspIle: 4.277 ± 0.539
4.838AspLys: 4.838 ± 0.391
6.381AspLeu: 6.381 ± 0.493
1.823AspMet: 1.823 ± 0.34
3.155AspAsn: 3.155 ± 0.361
1.753AspPro: 1.753 ± 0.372
1.332AspGln: 1.332 ± 0.358
2.104AspArg: 2.104 ± 0.391
2.104AspSer: 2.104 ± 0.4
4.558AspThr: 4.558 ± 0.51
4.277AspVal: 4.277 ± 0.588
0.912AspTrp: 0.912 ± 0.236
3.506AspTyr: 3.506 ± 0.587
0.0AspXaa: 0.0 ± 0.0
Glu
5.329GluAla: 5.329 ± 0.579
0.631GluCys: 0.631 ± 0.197
1.963GluAsp: 1.963 ± 0.378
3.506GluGlu: 3.506 ± 0.572
3.155GluPhe: 3.155 ± 0.389
2.524GluGly: 2.524 ± 0.419
1.332GluHis: 1.332 ± 0.341
5.61GluIle: 5.61 ± 0.627
7.573GluLys: 7.573 ± 1.015
5.75GluLeu: 5.75 ± 0.53
2.454GluMet: 2.454 ± 0.449
3.927GluAsn: 3.927 ± 0.574
1.543GluPro: 1.543 ± 0.299
4.277GluGln: 4.277 ± 0.566
3.857GluArg: 3.857 ± 0.644
3.646GluSer: 3.646 ± 0.461
3.436GluThr: 3.436 ± 0.438
4.838GluVal: 4.838 ± 0.513
0.982GluTrp: 0.982 ± 0.236
3.296GluTyr: 3.296 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
3.226PheAla: 3.226 ± 0.446
0.491PheCys: 0.491 ± 0.182
2.875PheAsp: 2.875 ± 0.51
3.015PheGlu: 3.015 ± 0.425
1.613PhePhe: 1.613 ± 0.353
3.226PheGly: 3.226 ± 0.39
0.351PheHis: 0.351 ± 0.163
2.875PheIle: 2.875 ± 0.386
3.296PheLys: 3.296 ± 0.566
2.244PheLeu: 2.244 ± 0.363
0.771PheMet: 0.771 ± 0.208
3.085PheAsn: 3.085 ± 0.489
0.912PhePro: 0.912 ± 0.298
0.701PheGln: 0.701 ± 0.172
0.561PheArg: 0.561 ± 0.231
2.314PheSer: 2.314 ± 0.275
3.296PheThr: 3.296 ± 0.556
2.244PheVal: 2.244 ± 0.321
0.631PheTrp: 0.631 ± 0.218
2.034PheTyr: 2.034 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
5.469GlyAla: 5.469 ± 0.644
0.631GlyCys: 0.631 ± 0.225
3.997GlyAsp: 3.997 ± 0.571
3.576GlyGlu: 3.576 ± 0.461
2.524GlyPhe: 2.524 ± 0.399
5.329GlyGly: 5.329 ± 0.923
0.982GlyHis: 0.982 ± 0.289
3.646GlyIle: 3.646 ± 0.517
6.101GlyLys: 6.101 ± 0.812
5.68GlyLeu: 5.68 ± 0.749
1.262GlyMet: 1.262 ± 0.267
3.155GlyAsn: 3.155 ± 0.884
0.0GlyPro: 0.0 ± 0.0
3.296GlyGln: 3.296 ± 0.477
2.454GlyArg: 2.454 ± 0.351
3.576GlySer: 3.576 ± 0.476
4.137GlyThr: 4.137 ± 0.726
4.908GlyVal: 4.908 ± 0.463
1.473GlyTrp: 1.473 ± 0.319
2.805GlyTyr: 2.805 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
0.771HisAla: 0.771 ± 0.239
0.0HisCys: 0.0 ± 0.0
1.473HisAsp: 1.473 ± 0.307
0.771HisGlu: 0.771 ± 0.251
1.192HisPhe: 1.192 ± 0.3
0.982HisGly: 0.982 ± 0.219
0.491HisHis: 0.491 ± 0.19
0.841HisIle: 0.841 ± 0.278
0.982HisLys: 0.982 ± 0.3
1.052HisLeu: 1.052 ± 0.28
0.07HisMet: 0.07 ± 0.077
0.912HisAsn: 0.912 ± 0.249
0.841HisPro: 0.841 ± 0.198
0.561HisGln: 0.561 ± 0.2
1.052HisArg: 1.052 ± 0.253
0.841HisSer: 0.841 ± 0.219
1.262HisThr: 1.262 ± 0.327
0.841HisVal: 0.841 ± 0.183
0.14HisTrp: 0.14 ± 0.094
0.631HisTyr: 0.631 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
5.049IleAla: 5.049 ± 0.662
0.841IleCys: 0.841 ± 0.284
4.277IleAsp: 4.277 ± 0.56
4.908IleGlu: 4.908 ± 0.694
1.332IlePhe: 1.332 ± 0.37
3.716IleGly: 3.716 ± 0.534
0.912IleHis: 0.912 ± 0.269
4.207IleIle: 4.207 ± 0.727
7.222IleLys: 7.222 ± 0.716
4.418IleLeu: 4.418 ± 0.534
1.262IleMet: 1.262 ± 0.281
3.997IleAsn: 3.997 ± 0.564
1.753IlePro: 1.753 ± 0.285
2.524IleGln: 2.524 ± 0.412
1.893IleArg: 1.893 ± 0.436
3.857IleSer: 3.857 ± 0.47
4.838IleThr: 4.838 ± 0.567
2.805IleVal: 2.805 ± 0.367
0.491IleTrp: 0.491 ± 0.161
2.454IleTyr: 2.454 ± 0.422
0.0IleXaa: 0.0 ± 0.0
Lys
8.695LysAla: 8.695 ± 0.975
0.21LysCys: 0.21 ± 0.11
5.61LysAsp: 5.61 ± 0.604
7.293LysGlu: 7.293 ± 0.902
2.244LysPhe: 2.244 ± 0.416
4.558LysGly: 4.558 ± 0.647
1.613LysHis: 1.613 ± 0.401
3.997LysIle: 3.997 ± 0.428
7.082LysLys: 7.082 ± 0.939
7.433LysLeu: 7.433 ± 0.717
2.174LysMet: 2.174 ± 0.532
4.137LysAsn: 4.137 ± 0.642
2.945LysPro: 2.945 ± 0.564
5.259LysGln: 5.259 ± 0.642
3.927LysArg: 3.927 ± 0.553
4.908LysSer: 4.908 ± 0.697
5.189LysThr: 5.189 ± 0.614
3.716LysVal: 3.716 ± 0.538
1.262LysTrp: 1.262 ± 0.292
3.155LysTyr: 3.155 ± 0.485
0.0LysXaa: 0.0 ± 0.0
Leu
5.189LeuAla: 5.189 ± 0.722
0.982LeuCys: 0.982 ± 0.259
4.908LeuAsp: 4.908 ± 0.631
5.399LeuGlu: 5.399 ± 0.609
2.735LeuPhe: 2.735 ± 0.359
4.838LeuGly: 4.838 ± 0.57
2.174LeuHis: 2.174 ± 0.4
5.61LeuIle: 5.61 ± 0.719
8.204LeuLys: 8.204 ± 0.988
6.03LeuLeu: 6.03 ± 0.657
1.823LeuMet: 1.823 ± 0.385
5.469LeuAsn: 5.469 ± 0.458
3.436LeuPro: 3.436 ± 0.561
3.716LeuGln: 3.716 ± 0.533
3.576LeuArg: 3.576 ± 0.436
5.329LeuSer: 5.329 ± 0.615
5.329LeuThr: 5.329 ± 0.739
4.137LeuVal: 4.137 ± 0.6
0.841LeuTrp: 0.841 ± 0.248
2.805LeuTyr: 2.805 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
2.174MetAla: 2.174 ± 0.469
0.14MetCys: 0.14 ± 0.102
1.052MetAsp: 1.052 ± 0.272
2.314MetGlu: 2.314 ± 0.482
0.491MetPhe: 0.491 ± 0.175
1.052MetGly: 1.052 ± 0.246
0.491MetHis: 0.491 ± 0.19
1.052MetIle: 1.052 ± 0.275
1.543MetLys: 1.543 ± 0.323
2.594MetLeu: 2.594 ± 0.404
0.491MetMet: 0.491 ± 0.191
1.122MetAsn: 1.122 ± 0.312
1.332MetPro: 1.332 ± 0.23
2.665MetGln: 2.665 ± 0.424
1.052MetArg: 1.052 ± 0.331
0.701MetSer: 0.701 ± 0.251
1.052MetThr: 1.052 ± 0.27
0.841MetVal: 0.841 ± 0.185
0.351MetTrp: 0.351 ± 0.155
0.491MetTyr: 0.491 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.908AsnAla: 4.908 ± 0.618
0.771AsnCys: 0.771 ± 0.246
2.665AsnAsp: 2.665 ± 0.36
4.838AsnGlu: 4.838 ± 0.635
2.875AsnPhe: 2.875 ± 0.41
4.137AsnGly: 4.137 ± 0.849
0.701AsnHis: 0.701 ± 0.228
3.226AsnIle: 3.226 ± 0.479
3.927AsnLys: 3.927 ± 0.502
4.067AsnLeu: 4.067 ± 0.426
1.402AsnMet: 1.402 ± 0.316
3.576AsnAsn: 3.576 ± 0.597
2.524AsnPro: 2.524 ± 0.512
1.963AsnGln: 1.963 ± 0.302
2.314AsnArg: 2.314 ± 0.355
3.015AsnSer: 3.015 ± 0.381
3.576AsnThr: 3.576 ± 0.529
3.085AsnVal: 3.085 ± 0.447
0.841AsnTrp: 0.841 ± 0.23
1.963AsnTyr: 1.963 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
3.997ProAla: 3.997 ± 0.668
0.421ProCys: 0.421 ± 0.207
1.753ProAsp: 1.753 ± 0.258
1.893ProGlu: 1.893 ± 0.334
2.034ProPhe: 2.034 ± 0.394
0.841ProGly: 0.841 ± 0.192
0.631ProHis: 0.631 ± 0.202
2.454ProIle: 2.454 ± 0.398
2.735ProLys: 2.735 ± 0.459
2.384ProLeu: 2.384 ± 0.402
0.561ProMet: 0.561 ± 0.199
1.543ProAsn: 1.543 ± 0.346
0.982ProPro: 0.982 ± 0.285
1.613ProGln: 1.613 ± 0.369
1.332ProArg: 1.332 ± 0.247
0.982ProSer: 0.982 ± 0.26
1.613ProThr: 1.613 ± 0.365
2.104ProVal: 2.104 ± 0.669
0.351ProTrp: 0.351 ± 0.155
1.473ProTyr: 1.473 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
4.277GlnAla: 4.277 ± 0.619
0.14GlnCys: 0.14 ± 0.11
3.436GlnAsp: 3.436 ± 0.509
3.015GlnGlu: 3.015 ± 0.473
1.753GlnPhe: 1.753 ± 0.371
3.436GlnGly: 3.436 ± 0.413
0.701GlnHis: 0.701 ± 0.277
2.805GlnIle: 2.805 ± 0.459
3.366GlnLys: 3.366 ± 0.442
5.61GlnLeu: 5.61 ± 0.673
1.262GlnMet: 1.262 ± 0.31
2.454GlnAsn: 2.454 ± 0.37
2.314GlnPro: 2.314 ± 0.51
2.875GlnGln: 2.875 ± 0.578
2.314GlnArg: 2.314 ± 0.452
1.683GlnSer: 1.683 ± 0.414
1.893GlnThr: 1.893 ± 0.389
3.155GlnVal: 3.155 ± 0.484
0.351GlnTrp: 0.351 ± 0.157
1.753GlnTyr: 1.753 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
3.015ArgAla: 3.015 ± 0.377
0.07ArgCys: 0.07 ± 0.077
1.753ArgAsp: 1.753 ± 0.306
2.735ArgGlu: 2.735 ± 0.417
1.753ArgPhe: 1.753 ± 0.328
2.594ArgGly: 2.594 ± 0.476
0.561ArgHis: 0.561 ± 0.196
2.945ArgIle: 2.945 ± 0.461
4.277ArgLys: 4.277 ± 0.631
4.628ArgLeu: 4.628 ± 0.531
0.912ArgMet: 0.912 ± 0.313
2.034ArgAsn: 2.034 ± 0.383
0.701ArgPro: 0.701 ± 0.167
2.104ArgGln: 2.104 ± 0.53
1.753ArgArg: 1.753 ± 0.376
1.893ArgSer: 1.893 ± 0.291
2.594ArgThr: 2.594 ± 0.382
2.524ArgVal: 2.524 ± 0.398
0.21ArgTrp: 0.21 ± 0.129
1.893ArgTyr: 1.893 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
3.226SerAla: 3.226 ± 0.42
0.631SerCys: 0.631 ± 0.222
3.436SerAsp: 3.436 ± 0.519
2.875SerGlu: 2.875 ± 0.444
2.384SerPhe: 2.384 ± 0.483
4.418SerGly: 4.418 ± 0.511
0.841SerHis: 0.841 ± 0.215
2.735SerIle: 2.735 ± 0.331
4.137SerLys: 4.137 ± 0.597
3.436SerLeu: 3.436 ± 0.443
0.982SerMet: 0.982 ± 0.277
3.927SerAsn: 3.927 ± 0.611
1.893SerPro: 1.893 ± 0.326
2.524SerGln: 2.524 ± 0.311
2.735SerArg: 2.735 ± 0.43
2.945SerSer: 2.945 ± 0.371
3.296SerThr: 3.296 ± 0.455
3.015SerVal: 3.015 ± 0.371
0.982SerTrp: 0.982 ± 0.314
1.963SerTyr: 1.963 ± 0.31
0.0SerXaa: 0.0 ± 0.0
Thr
6.732ThrAla: 6.732 ± 0.985
0.28ThrCys: 0.28 ± 0.12
4.207ThrAsp: 4.207 ± 0.508
4.348ThrGlu: 4.348 ± 0.572
2.735ThrPhe: 2.735 ± 0.44
4.908ThrGly: 4.908 ± 0.543
0.561ThrHis: 0.561 ± 0.203
3.857ThrIle: 3.857 ± 0.463
4.348ThrLys: 4.348 ± 0.459
4.277ThrLeu: 4.277 ± 0.588
0.912ThrMet: 0.912 ± 0.327
2.454ThrAsn: 2.454 ± 0.485
2.384ThrPro: 2.384 ± 0.516
2.524ThrGln: 2.524 ± 0.397
2.665ThrArg: 2.665 ± 0.423
3.436ThrSer: 3.436 ± 0.538
2.524ThrThr: 2.524 ± 0.378
3.716ThrVal: 3.716 ± 0.379
0.631ThrTrp: 0.631 ± 0.226
3.155ThrTyr: 3.155 ± 0.609
0.0ThrXaa: 0.0 ± 0.0
Val
3.436ValAla: 3.436 ± 0.547
0.841ValCys: 0.841 ± 0.247
3.085ValAsp: 3.085 ± 0.57
3.927ValGlu: 3.927 ± 0.582
2.174ValPhe: 2.174 ± 0.313
4.207ValGly: 4.207 ± 0.713
0.701ValHis: 0.701 ± 0.209
3.296ValIle: 3.296 ± 0.438
5.399ValLys: 5.399 ± 0.737
5.259ValLeu: 5.259 ± 0.547
1.893ValMet: 1.893 ± 0.312
3.716ValAsn: 3.716 ± 0.532
2.454ValPro: 2.454 ± 0.426
2.665ValGln: 2.665 ± 0.433
2.314ValArg: 2.314 ± 0.474
4.067ValSer: 4.067 ± 0.636
3.997ValThr: 3.997 ± 0.8
2.524ValVal: 2.524 ± 0.374
0.491ValTrp: 0.491 ± 0.179
2.174ValTyr: 2.174 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.771TrpAla: 0.771 ± 0.197
0.07TrpCys: 0.07 ± 0.068
0.771TrpAsp: 0.771 ± 0.189
1.262TrpGlu: 1.262 ± 0.293
0.491TrpPhe: 0.491 ± 0.158
0.771TrpGly: 0.771 ± 0.207
0.491TrpHis: 0.491 ± 0.14
1.052TrpIle: 1.052 ± 0.306
1.052TrpLys: 1.052 ± 0.268
1.262TrpLeu: 1.262 ± 0.286
0.561TrpMet: 0.561 ± 0.196
0.631TrpAsn: 0.631 ± 0.228
0.07TrpPro: 0.07 ± 0.069
0.561TrpGln: 0.561 ± 0.198
0.701TrpArg: 0.701 ± 0.196
0.701TrpSer: 0.701 ± 0.208
0.421TrpThr: 0.421 ± 0.17
0.912TrpVal: 0.912 ± 0.246
0.14TrpTrp: 0.14 ± 0.103
0.28TrpTyr: 0.28 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.945TyrAla: 2.945 ± 0.423
0.912TyrCys: 0.912 ± 0.228
3.085TyrAsp: 3.085 ± 0.63
2.805TyrGlu: 2.805 ± 0.457
1.543TyrPhe: 1.543 ± 0.282
2.805TyrGly: 2.805 ± 0.391
0.701TyrHis: 0.701 ± 0.201
2.174TyrIle: 2.174 ± 0.475
3.366TyrLys: 3.366 ± 0.541
2.875TyrLeu: 2.875 ± 0.391
0.561TyrMet: 0.561 ± 0.172
2.454TyrAsn: 2.454 ± 0.452
1.262TyrPro: 1.262 ± 0.279
1.963TyrGln: 1.963 ± 0.425
1.823TyrArg: 1.823 ± 0.434
2.454TyrSer: 2.454 ± 0.371
2.454TyrThr: 2.454 ± 0.454
1.823TyrVal: 1.823 ± 0.364
0.841TyrTrp: 0.841 ± 0.235
2.104TyrTyr: 2.104 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (14262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski