Amino acid dipepetide frequency for Pseudomonas phage ZC08

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.296AlaAla: 6.296 ± 0.631
0.643AlaCys: 0.643 ± 0.174
3.447AlaAsp: 3.447 ± 0.36
5.193AlaGlu: 5.193 ± 0.516
2.895AlaPhe: 2.895 ± 0.321
4.09AlaGly: 4.09 ± 0.575
1.103AlaHis: 1.103 ± 0.209
5.423AlaIle: 5.423 ± 0.422
4.274AlaLys: 4.274 ± 0.428
6.802AlaLeu: 6.802 ± 0.518
1.838AlaMet: 1.838 ± 0.3
3.171AlaAsn: 3.171 ± 0.497
2.62AlaPro: 2.62 ± 0.379
3.217AlaGln: 3.217 ± 0.415
3.769AlaArg: 3.769 ± 0.425
4.412AlaSer: 4.412 ± 0.58
3.769AlaThr: 3.769 ± 0.396
4.963AlaVal: 4.963 ± 0.552
1.057AlaTrp: 1.057 ± 0.216
2.528AlaTyr: 2.528 ± 0.341
0.046AlaXaa: 0.046 ± 0.045
Cys
0.506CysAla: 0.506 ± 0.174
0.046CysCys: 0.046 ± 0.052
0.506CysAsp: 0.506 ± 0.184
0.46CysGlu: 0.46 ± 0.196
0.414CysPhe: 0.414 ± 0.146
0.368CysGly: 0.368 ± 0.162
0.138CysHis: 0.138 ± 0.079
0.276CysIle: 0.276 ± 0.123
0.781CysLys: 0.781 ± 0.22
0.919CysLeu: 0.919 ± 0.27
0.23CysMet: 0.23 ± 0.13
0.46CysAsn: 0.46 ± 0.174
0.46CysPro: 0.46 ± 0.173
0.092CysGln: 0.092 ± 0.075
0.735CysArg: 0.735 ± 0.219
0.781CysSer: 0.781 ± 0.271
0.322CysThr: 0.322 ± 0.143
0.184CysVal: 0.184 ± 0.093
0.184CysTrp: 0.184 ± 0.097
0.184CysTyr: 0.184 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
5.147AspAla: 5.147 ± 0.405
0.551AspCys: 0.551 ± 0.195
2.849AspAsp: 2.849 ± 0.371
4.32AspGlu: 4.32 ± 0.435
2.298AspPhe: 2.298 ± 0.356
3.493AspGly: 3.493 ± 0.476
1.149AspHis: 1.149 ± 0.294
4.412AspIle: 4.412 ± 0.44
2.941AspLys: 2.941 ± 0.355
5.423AspLeu: 5.423 ± 0.602
1.287AspMet: 1.287 ± 0.28
2.895AspAsn: 2.895 ± 0.297
3.401AspPro: 3.401 ± 0.35
2.482AspGln: 2.482 ± 0.321
2.298AspArg: 2.298 ± 0.315
4.228AspSer: 4.228 ± 0.381
2.666AspThr: 2.666 ± 0.298
4.412AspVal: 4.412 ± 0.387
0.827AspTrp: 0.827 ± 0.207
2.206AspTyr: 2.206 ± 0.26
0.0AspXaa: 0.0 ± 0.0
Glu
6.664GluAla: 6.664 ± 0.636
0.873GluCys: 0.873 ± 0.286
4.044GluAsp: 4.044 ± 0.541
7.307GluGlu: 7.307 ± 1.199
3.493GluPhe: 3.493 ± 0.467
3.86GluGly: 3.86 ± 0.416
1.792GluHis: 1.792 ± 0.291
5.147GluIle: 5.147 ± 0.466
4.918GluLys: 4.918 ± 0.522
6.296GluLeu: 6.296 ± 0.585
1.884GluMet: 1.884 ± 0.296
4.366GluAsn: 4.366 ± 0.486
3.677GluPro: 3.677 ± 0.554
2.803GluGln: 2.803 ± 0.593
2.528GluArg: 2.528 ± 0.276
4.642GluSer: 4.642 ± 0.787
4.688GluThr: 4.688 ± 0.544
5.101GluVal: 5.101 ± 0.477
1.103GluTrp: 1.103 ± 0.254
2.987GluTyr: 2.987 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
2.022PheAla: 2.022 ± 0.286
0.23PheCys: 0.23 ± 0.135
2.803PheAsp: 2.803 ± 0.355
2.252PheGlu: 2.252 ± 0.32
1.333PhePhe: 1.333 ± 0.277
2.436PheGly: 2.436 ± 0.309
0.735PheHis: 0.735 ± 0.177
2.528PheIle: 2.528 ± 0.293
3.125PheLys: 3.125 ± 0.362
3.401PheLeu: 3.401 ± 0.487
1.057PheMet: 1.057 ± 0.229
2.344PheAsn: 2.344 ± 0.313
1.425PhePro: 1.425 ± 0.254
1.379PheGln: 1.379 ± 0.232
2.482PheArg: 2.482 ± 0.272
2.482PheSer: 2.482 ± 0.395
2.436PheThr: 2.436 ± 0.338
2.298PheVal: 2.298 ± 0.436
0.322PheTrp: 0.322 ± 0.137
1.746PheTyr: 1.746 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
3.263GlyAla: 3.263 ± 0.436
0.322GlyCys: 0.322 ± 0.139
2.895GlyAsp: 2.895 ± 0.464
4.09GlyGlu: 4.09 ± 0.457
3.125GlyPhe: 3.125 ± 0.366
3.86GlyGly: 3.86 ± 0.611
0.643GlyHis: 0.643 ± 0.18
4.044GlyIle: 4.044 ± 0.415
4.32GlyLys: 4.32 ± 0.417
4.642GlyLeu: 4.642 ± 0.582
2.068GlyMet: 2.068 ± 0.302
3.723GlyAsn: 3.723 ± 0.311
1.333GlyPro: 1.333 ± 0.245
1.884GlyGln: 1.884 ± 0.256
3.355GlyArg: 3.355 ± 0.381
5.377GlySer: 5.377 ± 0.586
3.815GlyThr: 3.815 ± 0.447
3.952GlyVal: 3.952 ± 0.416
0.781GlyTrp: 0.781 ± 0.258
2.574GlyTyr: 2.574 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
1.425HisAla: 1.425 ± 0.249
0.184HisCys: 0.184 ± 0.09
0.965HisAsp: 0.965 ± 0.243
1.654HisGlu: 1.654 ± 0.297
0.919HisPhe: 0.919 ± 0.223
0.873HisGly: 0.873 ± 0.173
0.414HisHis: 0.414 ± 0.129
1.379HisIle: 1.379 ± 0.237
0.735HisLys: 0.735 ± 0.195
1.792HisLeu: 1.792 ± 0.379
0.414HisMet: 0.414 ± 0.128
0.873HisAsn: 0.873 ± 0.195
1.011HisPro: 1.011 ± 0.221
0.551HisGln: 0.551 ± 0.148
1.011HisArg: 1.011 ± 0.196
1.011HisSer: 1.011 ± 0.195
0.735HisThr: 0.735 ± 0.18
1.011HisVal: 1.011 ± 0.232
0.322HisTrp: 0.322 ± 0.095
0.827HisTyr: 0.827 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.412IleAla: 4.412 ± 0.467
0.643IleCys: 0.643 ± 0.237
4.044IleAsp: 4.044 ± 0.462
5.423IleGlu: 5.423 ± 0.449
1.838IlePhe: 1.838 ± 0.23
3.631IleGly: 3.631 ± 0.634
1.471IleHis: 1.471 ± 0.272
2.849IleIle: 2.849 ± 0.444
4.918IleLys: 4.918 ± 0.62
5.929IleLeu: 5.929 ± 0.502
0.827IleMet: 0.827 ± 0.189
4.274IleAsn: 4.274 ± 0.361
3.079IlePro: 3.079 ± 0.31
2.803IleGln: 2.803 ± 0.239
3.677IleArg: 3.677 ± 0.384
4.504IleSer: 4.504 ± 0.466
3.86IleThr: 3.86 ± 0.617
3.217IleVal: 3.217 ± 0.42
0.781IleTrp: 0.781 ± 0.219
1.838IleTyr: 1.838 ± 0.252
0.0IleXaa: 0.0 ± 0.0
Lys
5.147LysAla: 5.147 ± 0.49
0.414LysCys: 0.414 ± 0.136
4.182LysAsp: 4.182 ± 0.582
5.101LysGlu: 5.101 ± 0.52
2.482LysPhe: 2.482 ± 0.296
3.263LysGly: 3.263 ± 0.371
1.241LysHis: 1.241 ± 0.289
3.079LysIle: 3.079 ± 0.394
4.228LysLys: 4.228 ± 0.455
6.204LysLeu: 6.204 ± 0.483
1.379LysMet: 1.379 ± 0.273
4.136LysAsn: 4.136 ± 0.414
2.666LysPro: 2.666 ± 0.278
2.482LysGln: 2.482 ± 0.307
2.987LysArg: 2.987 ± 0.399
3.539LysSer: 3.539 ± 0.438
3.998LysThr: 3.998 ± 0.401
3.631LysVal: 3.631 ± 0.484
0.643LysTrp: 0.643 ± 0.168
2.941LysTyr: 2.941 ± 0.43
0.046LysXaa: 0.046 ± 0.05
Leu
6.342LeuAla: 6.342 ± 0.61
0.551LeuCys: 0.551 ± 0.193
6.388LeuAsp: 6.388 ± 0.52
6.986LeuGlu: 6.986 ± 0.518
3.033LeuPhe: 3.033 ± 0.356
5.745LeuGly: 5.745 ± 0.519
1.884LeuHis: 1.884 ± 0.333
5.285LeuIle: 5.285 ± 0.502
5.607LeuLys: 5.607 ± 0.617
6.066LeuLeu: 6.066 ± 0.465
2.114LeuMet: 2.114 ± 0.28
5.331LeuAsn: 5.331 ± 0.463
3.86LeuPro: 3.86 ± 0.488
3.815LeuGln: 3.815 ± 0.367
5.147LeuArg: 5.147 ± 0.391
6.434LeuSer: 6.434 ± 0.735
5.239LeuThr: 5.239 ± 0.402
5.331LeuVal: 5.331 ± 0.416
0.689LeuTrp: 0.689 ± 0.189
3.217LeuTyr: 3.217 ± 0.377
0.046LeuXaa: 0.046 ± 0.042
Met
1.379MetAla: 1.379 ± 0.273
0.138MetCys: 0.138 ± 0.09
1.425MetAsp: 1.425 ± 0.281
1.654MetGlu: 1.654 ± 0.263
0.506MetPhe: 0.506 ± 0.154
1.333MetGly: 1.333 ± 0.215
0.092MetHis: 0.092 ± 0.066
1.149MetIle: 1.149 ± 0.184
1.884MetLys: 1.884 ± 0.415
2.482MetLeu: 2.482 ± 0.359
0.643MetMet: 0.643 ± 0.155
1.654MetAsn: 1.654 ± 0.378
1.241MetPro: 1.241 ± 0.272
1.333MetGln: 1.333 ± 0.263
1.333MetArg: 1.333 ± 0.206
2.068MetSer: 2.068 ± 0.339
1.93MetThr: 1.93 ± 0.281
1.241MetVal: 1.241 ± 0.192
0.184MetTrp: 0.184 ± 0.096
0.827MetTyr: 0.827 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.09AsnAla: 4.09 ± 0.427
0.368AsnCys: 0.368 ± 0.14
3.125AsnAsp: 3.125 ± 0.325
3.493AsnGlu: 3.493 ± 0.392
1.884AsnPhe: 1.884 ± 0.292
3.539AsnGly: 3.539 ± 0.642
0.919AsnHis: 0.919 ± 0.244
3.769AsnIle: 3.769 ± 0.318
3.998AsnLys: 3.998 ± 0.492
6.25AsnLeu: 6.25 ± 0.752
1.379AsnMet: 1.379 ± 0.255
3.401AsnAsn: 3.401 ± 0.453
3.309AsnPro: 3.309 ± 0.393
3.033AsnGln: 3.033 ± 0.317
3.447AsnArg: 3.447 ± 0.481
3.447AsnSer: 3.447 ± 0.408
3.447AsnThr: 3.447 ± 0.414
2.712AsnVal: 2.712 ± 0.417
0.322AsnTrp: 0.322 ± 0.14
2.068AsnTyr: 2.068 ± 0.357
0.0AsnXaa: 0.0 ± 0.0
Pro
2.298ProAla: 2.298 ± 0.304
0.23ProCys: 0.23 ± 0.11
2.803ProAsp: 2.803 ± 0.29
6.158ProGlu: 6.158 ± 0.566
1.746ProPhe: 1.746 ± 0.386
2.62ProGly: 2.62 ± 0.293
0.827ProHis: 0.827 ± 0.171
2.712ProIle: 2.712 ± 0.363
1.976ProLys: 1.976 ± 0.298
3.539ProLeu: 3.539 ± 0.357
1.103ProMet: 1.103 ± 0.244
1.884ProAsn: 1.884 ± 0.269
1.287ProPro: 1.287 ± 0.263
1.333ProGln: 1.333 ± 0.22
1.838ProArg: 1.838 ± 0.283
2.482ProSer: 2.482 ± 0.248
2.528ProThr: 2.528 ± 0.332
3.539ProVal: 3.539 ± 0.4
0.092ProTrp: 0.092 ± 0.055
1.654ProTyr: 1.654 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.263GlnAla: 3.263 ± 0.438
0.23GlnCys: 0.23 ± 0.13
2.206GlnAsp: 2.206 ± 0.375
3.033GlnGlu: 3.033 ± 0.447
1.379GlnPhe: 1.379 ± 0.274
2.39GlnGly: 2.39 ± 0.359
0.643GlnHis: 0.643 ± 0.21
3.309GlnIle: 3.309 ± 0.414
2.803GlnLys: 2.803 ± 0.355
3.769GlnLeu: 3.769 ± 0.377
1.379GlnMet: 1.379 ± 0.174
2.206GlnAsn: 2.206 ± 0.314
1.471GlnPro: 1.471 ± 0.275
2.022GlnGln: 2.022 ± 0.317
1.746GlnArg: 1.746 ± 0.309
2.068GlnSer: 2.068 ± 0.317
1.7GlnThr: 1.7 ± 0.279
2.022GlnVal: 2.022 ± 0.298
0.184GlnTrp: 0.184 ± 0.084
1.149GlnTyr: 1.149 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
3.952ArgAla: 3.952 ± 0.591
0.414ArgCys: 0.414 ± 0.157
2.849ArgAsp: 2.849 ± 0.318
4.55ArgGlu: 4.55 ± 0.527
2.528ArgPhe: 2.528 ± 0.32
2.62ArgGly: 2.62 ± 0.304
0.506ArgHis: 0.506 ± 0.177
3.217ArgIle: 3.217 ± 0.401
3.493ArgLys: 3.493 ± 0.504
4.32ArgLeu: 4.32 ± 0.375
1.471ArgMet: 1.471 ± 0.235
3.125ArgAsn: 3.125 ± 0.349
1.471ArgPro: 1.471 ± 0.251
1.7ArgGln: 1.7 ± 0.36
2.298ArgArg: 2.298 ± 0.374
2.62ArgSer: 2.62 ± 0.45
2.941ArgThr: 2.941 ± 0.385
3.631ArgVal: 3.631 ± 0.492
0.643ArgTrp: 0.643 ± 0.21
1.7ArgTyr: 1.7 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
4.228SerAla: 4.228 ± 0.555
0.46SerCys: 0.46 ± 0.196
3.906SerAsp: 3.906 ± 0.368
4.274SerGlu: 4.274 ± 0.432
2.298SerPhe: 2.298 ± 0.401
4.734SerGly: 4.734 ± 0.449
1.241SerHis: 1.241 ± 0.231
3.906SerIle: 3.906 ± 0.52
3.815SerLys: 3.815 ± 0.493
6.434SerLeu: 6.434 ± 0.452
2.022SerMet: 2.022 ± 0.313
3.906SerAsn: 3.906 ± 0.553
2.712SerPro: 2.712 ± 0.471
1.93SerGln: 1.93 ± 0.322
3.263SerArg: 3.263 ± 0.335
4.826SerSer: 4.826 ± 0.577
3.86SerThr: 3.86 ± 0.456
5.009SerVal: 5.009 ± 0.426
0.965SerTrp: 0.965 ± 0.241
2.344SerTyr: 2.344 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
3.952ThrAla: 3.952 ± 0.447
0.276ThrCys: 0.276 ± 0.107
3.769ThrAsp: 3.769 ± 0.338
4.458ThrGlu: 4.458 ± 0.43
2.068ThrPhe: 2.068 ± 0.282
4.274ThrGly: 4.274 ± 0.389
1.287ThrHis: 1.287 ± 0.233
3.952ThrIle: 3.952 ± 0.356
2.941ThrLys: 2.941 ± 0.368
4.274ThrLeu: 4.274 ± 0.398
1.057ThrMet: 1.057 ± 0.238
2.987ThrAsn: 2.987 ± 0.337
2.941ThrPro: 2.941 ± 0.323
1.517ThrGln: 1.517 ± 0.23
3.355ThrArg: 3.355 ± 0.43
4.228ThrSer: 4.228 ± 0.423
2.895ThrThr: 2.895 ± 0.434
3.86ThrVal: 3.86 ± 0.368
0.689ThrTrp: 0.689 ± 0.18
2.068ThrTyr: 2.068 ± 0.321
0.046ThrXaa: 0.046 ± 0.05
Val
4.963ValAla: 4.963 ± 0.49
0.827ValCys: 0.827 ± 0.289
3.998ValAsp: 3.998 ± 0.403
4.228ValGlu: 4.228 ± 0.368
2.62ValPhe: 2.62 ± 0.348
3.815ValGly: 3.815 ± 0.457
1.195ValHis: 1.195 ± 0.304
4.366ValIle: 4.366 ± 0.546
3.723ValLys: 3.723 ± 0.471
5.883ValLeu: 5.883 ± 0.526
1.287ValMet: 1.287 ± 0.199
4.55ValAsn: 4.55 ± 0.484
3.033ValPro: 3.033 ± 0.382
2.39ValGln: 2.39 ± 0.331
2.482ValArg: 2.482 ± 0.402
3.723ValSer: 3.723 ± 0.353
3.815ValThr: 3.815 ± 0.317
3.998ValVal: 3.998 ± 0.432
0.735ValTrp: 0.735 ± 0.204
2.252ValTyr: 2.252 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.138
0.092TrpCys: 0.092 ± 0.069
0.873TrpAsp: 0.873 ± 0.23
1.057TrpGlu: 1.057 ± 0.182
0.689TrpPhe: 0.689 ± 0.189
0.781TrpGly: 0.781 ± 0.208
0.23TrpHis: 0.23 ± 0.118
0.597TrpIle: 0.597 ± 0.207
0.827TrpLys: 0.827 ± 0.253
0.965TrpLeu: 0.965 ± 0.24
0.322TrpMet: 0.322 ± 0.099
0.735TrpAsn: 0.735 ± 0.157
0.046TrpPro: 0.046 ± 0.05
0.597TrpGln: 0.597 ± 0.156
0.46TrpArg: 0.46 ± 0.124
0.689TrpSer: 0.689 ± 0.2
0.551TrpThr: 0.551 ± 0.152
0.827TrpVal: 0.827 ± 0.223
0.184TrpTrp: 0.184 ± 0.091
0.23TrpTyr: 0.23 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.7TyrAla: 1.7 ± 0.3
0.551TyrCys: 0.551 ± 0.203
2.252TyrAsp: 2.252 ± 0.255
2.482TyrGlu: 2.482 ± 0.37
1.241TyrPhe: 1.241 ± 0.292
2.298TyrGly: 2.298 ± 0.406
0.597TyrHis: 0.597 ± 0.178
2.62TyrIle: 2.62 ± 0.391
2.482TyrLys: 2.482 ± 0.389
3.723TyrLeu: 3.723 ± 0.407
0.689TyrMet: 0.689 ± 0.199
1.976TyrAsn: 1.976 ± 0.345
1.609TyrPro: 1.609 ± 0.341
1.609TyrGln: 1.609 ± 0.261
1.792TyrArg: 1.792 ± 0.252
2.528TyrSer: 2.528 ± 0.333
1.7TyrThr: 1.7 ± 0.27
2.941TyrVal: 2.941 ± 0.405
0.551TyrTrp: 0.551 ± 0.209
1.149TyrTyr: 1.149 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.046XaaPhe: 0.046 ± 0.05
0.046XaaGly: 0.046 ± 0.042
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.046XaaArg: 0.046 ± 0.045
0.046XaaSer: 0.046 ± 0.05
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (21760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski