Amino acid dipepetide frequency for Gordonia phage BrutonGaster

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.405AlaAla: 9.405 ± 0.996
0.731AlaCys: 0.731 ± 0.181
6.897AlaAsp: 6.897 ± 0.463
6.2AlaGlu: 6.2 ± 0.593
2.543AlaPhe: 2.543 ± 0.322
7.489AlaGly: 7.489 ± 0.687
1.498AlaHis: 1.498 ± 0.287
4.598AlaIle: 4.598 ± 0.438
4.389AlaLys: 4.389 ± 0.579
7.768AlaLeu: 7.768 ± 0.696
2.578AlaMet: 2.578 ± 0.303
3.727AlaAsn: 3.727 ± 0.426
3.1AlaPro: 3.1 ± 0.315
3.901AlaGln: 3.901 ± 0.483
5.921AlaArg: 5.921 ± 0.451
5.155AlaSer: 5.155 ± 0.407
5.887AlaThr: 5.887 ± 0.488
6.305AlaVal: 6.305 ± 0.479
1.916AlaTrp: 1.916 ± 0.264
3.17AlaTyr: 3.17 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
1.358CysAla: 1.358 ± 0.303
0.244CysCys: 0.244 ± 0.1
0.906CysAsp: 0.906 ± 0.23
0.592CysGlu: 0.592 ± 0.155
0.279CysPhe: 0.279 ± 0.112
1.289CysGly: 1.289 ± 0.247
0.522CysHis: 0.522 ± 0.16
0.279CysIle: 0.279 ± 0.1
0.488CysLys: 0.488 ± 0.159
0.383CysLeu: 0.383 ± 0.144
0.104CysMet: 0.104 ± 0.058
0.662CysAsn: 0.662 ± 0.192
0.453CysPro: 0.453 ± 0.155
0.348CysGln: 0.348 ± 0.133
0.871CysArg: 0.871 ± 0.293
0.592CysSer: 0.592 ± 0.224
0.488CysThr: 0.488 ± 0.149
0.488CysVal: 0.488 ± 0.143
0.209CysTrp: 0.209 ± 0.089
0.313CysTyr: 0.313 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
7.001AspAla: 7.001 ± 0.404
0.592AspCys: 0.592 ± 0.161
5.469AspAsp: 5.469 ± 0.547
4.772AspGlu: 4.772 ± 0.497
2.055AspPhe: 2.055 ± 0.251
5.956AspGly: 5.956 ± 0.553
2.16AspHis: 2.16 ± 0.276
3.135AspIle: 3.135 ± 0.336
2.682AspLys: 2.682 ± 0.275
5.852AspLeu: 5.852 ± 0.49
1.811AspMet: 1.811 ± 0.227
1.951AspAsn: 1.951 ± 0.261
4.633AspPro: 4.633 ± 0.422
1.811AspGln: 1.811 ± 0.209
4.006AspArg: 4.006 ± 0.339
2.961AspSer: 2.961 ± 0.303
3.239AspThr: 3.239 ± 0.341
5.295AspVal: 5.295 ± 0.523
1.916AspTrp: 1.916 ± 0.231
2.543AspTyr: 2.543 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
7.593GluAla: 7.593 ± 0.579
0.836GluCys: 0.836 ± 0.235
3.971GluAsp: 3.971 ± 0.416
5.608GluGlu: 5.608 ± 0.59
2.578GluPhe: 2.578 ± 0.312
4.842GluGly: 4.842 ± 0.466
1.742GluHis: 1.742 ± 0.253
4.215GluIle: 4.215 ± 0.489
3.135GluLys: 3.135 ± 0.409
6.479GluLeu: 6.479 ± 0.493
1.707GluMet: 1.707 ± 0.233
2.682GluAsn: 2.682 ± 0.285
2.717GluPro: 2.717 ± 0.344
2.856GluGln: 2.856 ± 0.335
3.901GluArg: 3.901 ± 0.416
2.961GluSer: 2.961 ± 0.301
3.623GluThr: 3.623 ± 0.374
4.772GluVal: 4.772 ± 0.39
1.672GluTrp: 1.672 ± 0.226
2.682GluTyr: 2.682 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.319
0.488PheCys: 0.488 ± 0.139
2.647PheAsp: 2.647 ± 0.301
2.055PheGlu: 2.055 ± 0.274
0.871PhePhe: 0.871 ± 0.198
2.787PheGly: 2.787 ± 0.349
0.627PheHis: 0.627 ± 0.151
1.463PheIle: 1.463 ± 0.243
0.975PheLys: 0.975 ± 0.193
1.951PheLeu: 1.951 ± 0.275
1.01PheMet: 1.01 ± 0.186
1.428PheAsn: 1.428 ± 0.206
1.08PhePro: 1.08 ± 0.204
0.731PheGln: 0.731 ± 0.188
1.428PheArg: 1.428 ± 0.209
2.125PheSer: 2.125 ± 0.239
1.846PheThr: 1.846 ± 0.202
2.299PheVal: 2.299 ± 0.328
0.627PheTrp: 0.627 ± 0.144
0.975PheTyr: 0.975 ± 0.194
0.0PheXaa: 0.0 ± 0.0
Gly
5.329GlyAla: 5.329 ± 0.748
0.766GlyCys: 0.766 ± 0.218
5.434GlyAsp: 5.434 ± 0.587
5.329GlyGlu: 5.329 ± 0.447
2.508GlyPhe: 2.508 ± 0.312
6.444GlyGly: 6.444 ± 0.556
1.498GlyHis: 1.498 ± 0.207
3.553GlyIle: 3.553 ± 0.352
3.971GlyLys: 3.971 ± 0.426
6.305GlyLeu: 6.305 ± 0.592
2.16GlyMet: 2.16 ± 0.382
3.657GlyAsn: 3.657 ± 0.421
3.553GlyPro: 3.553 ± 0.408
2.891GlyGln: 2.891 ± 0.273
4.493GlyArg: 4.493 ± 0.361
4.842GlySer: 4.842 ± 0.471
5.086GlyThr: 5.086 ± 0.523
6.061GlyVal: 6.061 ± 0.47
1.811GlyTrp: 1.811 ± 0.254
2.717GlyTyr: 2.717 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.811HisAla: 1.811 ± 0.276
0.313HisCys: 0.313 ± 0.111
1.254HisAsp: 1.254 ± 0.211
1.324HisGlu: 1.324 ± 0.212
1.115HisPhe: 1.115 ± 0.237
1.358HisGly: 1.358 ± 0.184
0.313HisHis: 0.313 ± 0.106
1.115HisIle: 1.115 ± 0.245
0.94HisLys: 0.94 ± 0.19
1.916HisLeu: 1.916 ± 0.237
0.488HisMet: 0.488 ± 0.141
0.906HisAsn: 0.906 ± 0.178
1.254HisPro: 1.254 ± 0.25
0.871HisGln: 0.871 ± 0.147
1.672HisArg: 1.672 ± 0.274
0.975HisSer: 0.975 ± 0.218
1.602HisThr: 1.602 ± 0.24
1.672HisVal: 1.672 ± 0.288
0.383HisTrp: 0.383 ± 0.158
1.219HisTyr: 1.219 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
5.329IleAla: 5.329 ± 0.402
0.557IleCys: 0.557 ± 0.147
4.145IleAsp: 4.145 ± 0.384
4.424IleGlu: 4.424 ± 0.418
1.045IlePhe: 1.045 ± 0.154
3.623IleGly: 3.623 ± 0.397
1.08IleHis: 1.08 ± 0.255
2.194IleIle: 2.194 ± 0.293
2.612IleLys: 2.612 ± 0.36
3.239IleLeu: 3.239 ± 0.335
1.01IleMet: 1.01 ± 0.187
2.055IleAsn: 2.055 ± 0.315
2.508IlePro: 2.508 ± 0.397
1.951IleGln: 1.951 ± 0.32
3.448IleArg: 3.448 ± 0.343
2.299IleSer: 2.299 ± 0.303
3.03IleThr: 3.03 ± 0.361
3.239IleVal: 3.239 ± 0.347
0.731IleTrp: 0.731 ± 0.148
1.358IleTyr: 1.358 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
5.051LysAla: 5.051 ± 0.633
0.348LysCys: 0.348 ± 0.133
2.508LysAsp: 2.508 ± 0.349
3.065LysGlu: 3.065 ± 0.36
1.254LysPhe: 1.254 ± 0.24
3.518LysGly: 3.518 ± 0.51
1.115LysHis: 1.115 ± 0.184
2.16LysIle: 2.16 ± 0.239
2.508LysLys: 2.508 ± 0.441
4.18LysLeu: 4.18 ± 0.601
1.254LysMet: 1.254 ± 0.203
1.533LysAsn: 1.533 ± 0.233
2.403LysPro: 2.403 ± 0.263
1.358LysGln: 1.358 ± 0.22
3.448LysArg: 3.448 ± 0.385
2.682LysSer: 2.682 ± 0.347
2.194LysThr: 2.194 ± 0.302
3.553LysVal: 3.553 ± 0.312
0.801LysTrp: 0.801 ± 0.159
0.94LysTyr: 0.94 ± 0.21
0.0LysXaa: 0.0 ± 0.0
Leu
6.897LeuAla: 6.897 ± 0.596
1.08LeuCys: 1.08 ± 0.288
5.852LeuAsp: 5.852 ± 0.36
5.887LeuGlu: 5.887 ± 0.519
2.369LeuPhe: 2.369 ± 0.254
5.329LeuGly: 5.329 ± 0.464
2.02LeuHis: 2.02 ± 0.289
3.657LeuIle: 3.657 ± 0.293
3.03LeuLys: 3.03 ± 0.444
4.946LeuLeu: 4.946 ± 0.514
1.602LeuMet: 1.602 ± 0.169
3.274LeuAsn: 3.274 ± 0.586
4.006LeuPro: 4.006 ± 0.511
2.891LeuGln: 2.891 ± 0.296
4.981LeuArg: 4.981 ± 0.394
4.807LeuSer: 4.807 ± 0.399
5.712LeuThr: 5.712 ± 0.532
4.981LeuVal: 4.981 ± 0.533
1.742LeuTrp: 1.742 ± 0.29
2.299LeuTyr: 2.299 ± 0.285
0.0LeuXaa: 0.0 ± 0.0
Met
3.03MetAla: 3.03 ± 0.359
0.313MetCys: 0.313 ± 0.107
1.254MetAsp: 1.254 ± 0.209
1.463MetGlu: 1.463 ± 0.203
0.697MetPhe: 0.697 ± 0.16
1.916MetGly: 1.916 ± 0.296
0.348MetHis: 0.348 ± 0.107
1.428MetIle: 1.428 ± 0.224
0.836MetLys: 0.836 ± 0.182
1.567MetLeu: 1.567 ± 0.22
0.557MetMet: 0.557 ± 0.127
1.358MetAsn: 1.358 ± 0.202
1.324MetPro: 1.324 ± 0.177
0.94MetGln: 0.94 ± 0.195
1.602MetArg: 1.602 ± 0.255
2.229MetSer: 2.229 ± 0.289
2.055MetThr: 2.055 ± 0.324
1.08MetVal: 1.08 ± 0.199
0.488MetTrp: 0.488 ± 0.131
0.522MetTyr: 0.522 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.414AsnAla: 3.414 ± 0.442
0.488AsnCys: 0.488 ± 0.15
2.473AsnAsp: 2.473 ± 0.241
2.752AsnGlu: 2.752 ± 0.266
1.184AsnPhe: 1.184 ± 0.211
3.379AsnGly: 3.379 ± 0.411
0.731AsnHis: 0.731 ± 0.16
1.742AsnIle: 1.742 ± 0.235
1.672AsnLys: 1.672 ± 0.351
3.17AsnLeu: 3.17 ± 0.225
0.801AsnMet: 0.801 ± 0.193
1.498AsnAsn: 1.498 ± 0.273
2.787AsnPro: 2.787 ± 0.293
1.358AsnGln: 1.358 ± 0.237
2.578AsnArg: 2.578 ± 0.278
1.985AsnSer: 1.985 ± 0.315
2.508AsnThr: 2.508 ± 0.355
2.299AsnVal: 2.299 ± 0.355
0.766AsnTrp: 0.766 ± 0.169
0.94AsnTyr: 0.94 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
4.215ProAla: 4.215 ± 0.378
0.488ProCys: 0.488 ± 0.14
3.239ProAsp: 3.239 ± 0.302
4.041ProGlu: 4.041 ± 0.471
1.811ProPhe: 1.811 ± 0.203
4.459ProGly: 4.459 ± 0.541
0.801ProHis: 0.801 ± 0.166
2.264ProIle: 2.264 ± 0.32
2.926ProLys: 2.926 ± 0.42
2.821ProLeu: 2.821 ± 0.292
1.08ProMet: 1.08 ± 0.162
1.602ProAsn: 1.602 ± 0.233
2.055ProPro: 2.055 ± 0.321
1.254ProGln: 1.254 ± 0.252
2.334ProArg: 2.334 ± 0.359
2.926ProSer: 2.926 ± 0.372
3.135ProThr: 3.135 ± 0.436
3.379ProVal: 3.379 ± 0.367
0.94ProTrp: 0.94 ± 0.164
1.324ProTyr: 1.324 ± 0.194
0.0ProXaa: 0.0 ± 0.0
Gln
3.762GlnAla: 3.762 ± 0.528
0.383GlnCys: 0.383 ± 0.125
1.881GlnAsp: 1.881 ± 0.3
1.985GlnGlu: 1.985 ± 0.299
1.08GlnPhe: 1.08 ± 0.172
2.612GlnGly: 2.612 ± 0.322
0.418GlnHis: 0.418 ± 0.118
1.985GlnIle: 1.985 ± 0.299
1.707GlnLys: 1.707 ± 0.312
3.623GlnLeu: 3.623 ± 0.359
1.01GlnMet: 1.01 ± 0.155
1.219GlnAsn: 1.219 ± 0.247
1.742GlnPro: 1.742 ± 0.292
1.602GlnGln: 1.602 ± 0.228
2.612GlnArg: 2.612 ± 0.397
2.055GlnSer: 2.055 ± 0.311
2.264GlnThr: 2.264 ± 0.271
2.961GlnVal: 2.961 ± 0.358
0.731GlnTrp: 0.731 ± 0.188
0.836GlnTyr: 0.836 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
5.434ArgAla: 5.434 ± 0.575
0.801ArgCys: 0.801 ± 0.241
4.493ArgAsp: 4.493 ± 0.391
4.911ArgGlu: 4.911 ± 0.483
1.742ArgPhe: 1.742 ± 0.243
3.971ArgGly: 3.971 ± 0.325
2.229ArgHis: 2.229 ± 0.41
3.623ArgIle: 3.623 ± 0.375
3.762ArgLys: 3.762 ± 0.362
5.051ArgLeu: 5.051 ± 0.418
1.637ArgMet: 1.637 ± 0.219
2.299ArgAsn: 2.299 ± 0.238
2.438ArgPro: 2.438 ± 0.309
3.065ArgGln: 3.065 ± 0.377
4.563ArgArg: 4.563 ± 0.581
3.239ArgSer: 3.239 ± 0.306
2.647ArgThr: 2.647 ± 0.293
4.598ArgVal: 4.598 ± 0.475
1.602ArgTrp: 1.602 ± 0.282
2.229ArgTyr: 2.229 ± 0.286
0.0ArgXaa: 0.0 ± 0.0
Ser
3.971SerAla: 3.971 ± 0.417
0.557SerCys: 0.557 ± 0.162
3.832SerAsp: 3.832 ± 0.381
3.17SerGlu: 3.17 ± 0.34
1.776SerPhe: 1.776 ± 0.237
4.737SerGly: 4.737 ± 0.512
1.602SerHis: 1.602 ± 0.306
2.717SerIle: 2.717 ± 0.318
2.438SerLys: 2.438 ± 0.33
4.319SerLeu: 4.319 ± 0.326
1.637SerMet: 1.637 ± 0.253
2.578SerAsn: 2.578 ± 0.315
2.194SerPro: 2.194 ± 0.243
1.846SerGln: 1.846 ± 0.317
3.901SerArg: 3.901 ± 0.365
3.344SerSer: 3.344 ± 0.335
3.309SerThr: 3.309 ± 0.304
4.075SerVal: 4.075 ± 0.346
1.324SerTrp: 1.324 ± 0.209
2.334SerTyr: 2.334 ± 0.325
0.0SerXaa: 0.0 ± 0.0
Thr
6.061ThrAla: 6.061 ± 0.519
0.383ThrCys: 0.383 ± 0.13
3.901ThrAsp: 3.901 ± 0.389
4.145ThrGlu: 4.145 ± 0.415
1.916ThrPhe: 1.916 ± 0.273
5.434ThrGly: 5.434 ± 0.527
0.94ThrHis: 0.94 ± 0.202
3.309ThrIle: 3.309 ± 0.386
2.752ThrLys: 2.752 ± 0.395
4.633ThrLeu: 4.633 ± 0.409
1.393ThrMet: 1.393 ± 0.246
1.916ThrAsn: 1.916 ± 0.26
3.414ThrPro: 3.414 ± 0.377
2.194ThrGln: 2.194 ± 0.247
3.762ThrArg: 3.762 ± 0.424
3.762ThrSer: 3.762 ± 0.398
3.936ThrThr: 3.936 ± 0.451
4.145ThrVal: 4.145 ± 0.508
0.906ThrTrp: 0.906 ± 0.166
2.09ThrTyr: 2.09 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
6.862ValAla: 6.862 ± 0.5
0.906ValCys: 0.906 ± 0.219
6.096ValAsp: 6.096 ± 0.485
5.329ValGlu: 5.329 ± 0.489
1.463ValPhe: 1.463 ± 0.242
5.434ValGly: 5.434 ± 0.379
1.289ValHis: 1.289 ± 0.21
3.971ValIle: 3.971 ± 0.419
3.065ValLys: 3.065 ± 0.271
5.155ValLeu: 5.155 ± 0.389
1.916ValMet: 1.916 ± 0.296
1.846ValAsn: 1.846 ± 0.265
3.239ValPro: 3.239 ± 0.38
2.369ValGln: 2.369 ± 0.329
4.702ValArg: 4.702 ± 0.41
4.041ValSer: 4.041 ± 0.515
4.633ValThr: 4.633 ± 0.529
4.737ValVal: 4.737 ± 0.501
1.08ValTrp: 1.08 ± 0.212
2.16ValTyr: 2.16 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
1.916TrpAla: 1.916 ± 0.216
0.174TrpCys: 0.174 ± 0.092
1.707TrpAsp: 1.707 ± 0.243
1.254TrpGlu: 1.254 ± 0.243
0.697TrpPhe: 0.697 ± 0.173
1.08TrpGly: 1.08 ± 0.219
0.766TrpHis: 0.766 ± 0.186
1.149TrpIle: 1.149 ± 0.187
0.801TrpLys: 0.801 ± 0.134
1.846TrpLeu: 1.846 ± 0.323
0.418TrpMet: 0.418 ± 0.14
0.94TrpAsn: 0.94 ± 0.148
0.697TrpPro: 0.697 ± 0.169
0.836TrpGln: 0.836 ± 0.177
1.324TrpArg: 1.324 ± 0.18
1.115TrpSer: 1.115 ± 0.2
1.533TrpThr: 1.533 ± 0.217
1.707TrpVal: 1.707 ± 0.226
0.697TrpTrp: 0.697 ± 0.206
0.522TrpTyr: 0.522 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.194TyrAla: 2.194 ± 0.274
0.418TyrCys: 0.418 ± 0.152
1.881TyrAsp: 1.881 ± 0.352
2.264TyrGlu: 2.264 ± 0.278
1.01TyrPhe: 1.01 ± 0.159
2.752TyrGly: 2.752 ± 0.306
0.801TyrHis: 0.801 ± 0.207
1.324TyrIle: 1.324 ± 0.205
1.184TyrLys: 1.184 ± 0.201
2.16TyrLeu: 2.16 ± 0.269
0.801TyrMet: 0.801 ± 0.173
1.393TyrAsn: 1.393 ± 0.215
1.498TyrPro: 1.498 ± 0.266
1.393TyrGln: 1.393 ± 0.224
2.787TyrArg: 2.787 ± 0.328
1.567TyrSer: 1.567 ± 0.267
2.264TyrThr: 2.264 ± 0.253
2.647TyrVal: 2.647 ± 0.355
0.836TyrTrp: 0.836 ± 0.18
0.801TyrTyr: 0.801 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 166 proteins (28710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski