Amino acid dipepetide frequency for Gordonia phage GMA6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.136AlaAla: 9.136 ± 0.792
1.045AlaCys: 1.045 ± 0.239
5.187AlaAsp: 5.187 ± 0.531
5.458AlaGlu: 5.458 ± 0.57
2.903AlaPhe: 2.903 ± 0.404
7.897AlaGly: 7.897 ± 0.68
1.665AlaHis: 1.665 ± 0.262
4.723AlaIle: 4.723 ± 0.654
5.419AlaLys: 5.419 ± 0.474
7.045AlaLeu: 7.045 ± 0.536
2.477AlaMet: 2.477 ± 0.24
3.561AlaAsn: 3.561 ± 0.361
4.336AlaPro: 4.336 ± 0.491
3.716AlaGln: 3.716 ± 0.402
5.845AlaArg: 5.845 ± 0.403
5.032AlaSer: 5.032 ± 0.517
6.194AlaThr: 6.194 ± 0.648
6.736AlaVal: 6.736 ± 0.773
1.123AlaTrp: 1.123 ± 0.232
1.974AlaTyr: 1.974 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.139
0.039CysCys: 0.039 ± 0.04
0.31CysAsp: 0.31 ± 0.107
0.426CysGlu: 0.426 ± 0.122
0.116CysPhe: 0.116 ± 0.066
0.929CysGly: 0.929 ± 0.209
0.194CysHis: 0.194 ± 0.094
0.426CysIle: 0.426 ± 0.117
0.542CysLys: 0.542 ± 0.157
0.813CysLeu: 0.813 ± 0.167
0.039CysMet: 0.039 ± 0.038
0.465CysAsn: 0.465 ± 0.144
0.619CysPro: 0.619 ± 0.172
0.426CysGln: 0.426 ± 0.138
0.89CysArg: 0.89 ± 0.214
0.697CysSer: 0.697 ± 0.175
0.735CysThr: 0.735 ± 0.212
0.852CysVal: 0.852 ± 0.168
0.271CysTrp: 0.271 ± 0.089
0.348CysTyr: 0.348 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
6.271AspAla: 6.271 ± 0.59
0.426AspCys: 0.426 ± 0.129
5.497AspAsp: 5.497 ± 0.73
5.536AspGlu: 5.536 ± 0.989
1.703AspPhe: 1.703 ± 0.262
5.226AspGly: 5.226 ± 0.479
1.123AspHis: 1.123 ± 0.22
2.826AspIle: 2.826 ± 0.296
2.942AspLys: 2.942 ± 0.337
4.645AspLeu: 4.645 ± 0.368
1.742AspMet: 1.742 ± 0.289
2.477AspAsn: 2.477 ± 0.306
3.329AspPro: 3.329 ± 0.386
2.245AspGln: 2.245 ± 0.298
3.6AspArg: 3.6 ± 0.358
3.794AspSer: 3.794 ± 0.375
4.065AspThr: 4.065 ± 0.387
4.8AspVal: 4.8 ± 0.443
1.239AspTrp: 1.239 ± 0.267
2.09AspTyr: 2.09 ± 0.29
0.0AspXaa: 0.0 ± 0.0
Glu
5.148GluAla: 5.148 ± 0.559
0.929GluCys: 0.929 ± 0.239
5.11GluAsp: 5.11 ± 0.852
4.645GluGlu: 4.645 ± 0.805
1.703GluPhe: 1.703 ± 0.248
4.336GluGly: 4.336 ± 0.472
1.277GluHis: 1.277 ± 0.208
2.284GluIle: 2.284 ± 0.239
2.787GluLys: 2.787 ± 0.381
5.148GluLeu: 5.148 ± 0.476
1.665GluMet: 1.665 ± 0.272
2.4GluAsn: 2.4 ± 0.278
2.942GluPro: 2.942 ± 0.414
2.516GluGln: 2.516 ± 0.31
4.684GluArg: 4.684 ± 0.488
3.252GluSer: 3.252 ± 0.386
3.368GluThr: 3.368 ± 0.346
4.916GluVal: 4.916 ± 0.573
1.665GluTrp: 1.665 ± 0.271
2.477GluTyr: 2.477 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
2.516PheAla: 2.516 ± 0.404
0.116PheCys: 0.116 ± 0.072
2.129PheAsp: 2.129 ± 0.304
1.974PheGlu: 1.974 ± 0.267
0.929PhePhe: 0.929 ± 0.184
2.748PheGly: 2.748 ± 0.334
0.619PheHis: 0.619 ± 0.238
1.239PheIle: 1.239 ± 0.188
1.084PheLys: 1.084 ± 0.196
1.897PheLeu: 1.897 ± 0.272
0.774PheMet: 0.774 ± 0.182
1.587PheAsn: 1.587 ± 0.361
1.51PhePro: 1.51 ± 0.204
1.239PheGln: 1.239 ± 0.167
2.168PheArg: 2.168 ± 0.231
1.936PheSer: 1.936 ± 0.276
1.974PheThr: 1.974 ± 0.251
2.4PheVal: 2.4 ± 0.299
0.387PheTrp: 0.387 ± 0.134
0.387PheTyr: 0.387 ± 0.119
0.0PheXaa: 0.0 ± 0.0
Gly
7.2GlyAla: 7.2 ± 0.571
0.735GlyCys: 0.735 ± 0.218
5.768GlyAsp: 5.768 ± 0.451
5.148GlyGlu: 5.148 ± 0.447
2.671GlyPhe: 2.671 ± 0.327
8.013GlyGly: 8.013 ± 0.861
1.316GlyHis: 1.316 ± 0.201
3.677GlyIle: 3.677 ± 0.389
4.413GlyLys: 4.413 ± 0.515
6.155GlyLeu: 6.155 ± 0.494
2.477GlyMet: 2.477 ± 0.297
3.368GlyAsn: 3.368 ± 0.374
2.516GlyPro: 2.516 ± 0.28
2.71GlyGln: 2.71 ± 0.328
4.529GlyArg: 4.529 ± 0.405
6.116GlySer: 6.116 ± 0.518
6.232GlyThr: 6.232 ± 0.66
6.697GlyVal: 6.697 ± 0.611
1.781GlyTrp: 1.781 ± 0.347
3.213GlyTyr: 3.213 ± 0.349
0.0GlyXaa: 0.0 ± 0.0
His
1.006HisAla: 1.006 ± 0.174
0.426HisCys: 0.426 ± 0.115
1.045HisAsp: 1.045 ± 0.179
1.045HisGlu: 1.045 ± 0.232
0.581HisPhe: 0.581 ± 0.153
1.316HisGly: 1.316 ± 0.251
0.465HisHis: 0.465 ± 0.125
0.503HisIle: 0.503 ± 0.14
0.852HisLys: 0.852 ± 0.191
1.471HisLeu: 1.471 ± 0.227
0.426HisMet: 0.426 ± 0.137
0.852HisAsn: 0.852 ± 0.149
1.045HisPro: 1.045 ± 0.205
0.31HisGln: 0.31 ± 0.106
0.968HisArg: 0.968 ± 0.245
0.697HisSer: 0.697 ± 0.137
1.045HisThr: 1.045 ± 0.179
1.626HisVal: 1.626 ± 0.299
0.387HisTrp: 0.387 ± 0.119
0.89HisTyr: 0.89 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
4.761IleAla: 4.761 ± 0.491
0.348IleCys: 0.348 ± 0.093
3.871IleAsp: 3.871 ± 0.314
3.639IleGlu: 3.639 ± 0.382
0.813IlePhe: 0.813 ± 0.201
3.948IleGly: 3.948 ± 0.463
0.426IleHis: 0.426 ± 0.127
1.974IleIle: 1.974 ± 0.357
2.555IleLys: 2.555 ± 0.404
2.671IleLeu: 2.671 ± 0.343
1.355IleMet: 1.355 ± 0.246
1.858IleAsn: 1.858 ± 0.232
2.09IlePro: 2.09 ± 0.309
1.665IleGln: 1.665 ± 0.294
2.865IleArg: 2.865 ± 0.337
2.284IleSer: 2.284 ± 0.254
3.561IleThr: 3.561 ± 0.416
3.368IleVal: 3.368 ± 0.472
0.465IleTrp: 0.465 ± 0.14
1.161IleTyr: 1.161 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
5.807LysAla: 5.807 ± 0.433
0.194LysCys: 0.194 ± 0.092
2.71LysAsp: 2.71 ± 0.329
2.942LysGlu: 2.942 ± 0.377
1.471LysPhe: 1.471 ± 0.214
4.103LysGly: 4.103 ± 0.455
0.929LysHis: 0.929 ± 0.159
2.323LysIle: 2.323 ± 0.319
4.568LysLys: 4.568 ± 0.786
3.987LysLeu: 3.987 ± 0.407
1.858LysMet: 1.858 ± 0.241
1.703LysAsn: 1.703 ± 0.274
2.671LysPro: 2.671 ± 0.406
2.052LysGln: 2.052 ± 0.241
4.8LysArg: 4.8 ± 0.475
3.677LysSer: 3.677 ± 0.389
3.368LysThr: 3.368 ± 0.369
3.329LysVal: 3.329 ± 0.332
0.929LysTrp: 0.929 ± 0.174
1.471LysTyr: 1.471 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
7.278LeuAla: 7.278 ± 0.556
0.735LeuCys: 0.735 ± 0.218
5.458LeuAsp: 5.458 ± 0.571
3.794LeuGlu: 3.794 ± 0.371
2.206LeuPhe: 2.206 ± 0.308
5.807LeuGly: 5.807 ± 0.474
0.968LeuHis: 0.968 ± 0.167
4.065LeuIle: 4.065 ± 0.44
3.523LeuLys: 3.523 ± 0.365
5.226LeuLeu: 5.226 ± 0.469
2.206LeuMet: 2.206 ± 0.262
3.445LeuAsn: 3.445 ± 0.395
3.716LeuPro: 3.716 ± 0.318
2.168LeuGln: 2.168 ± 0.314
5.148LeuArg: 5.148 ± 0.437
5.845LeuSer: 5.845 ± 0.533
4.142LeuThr: 4.142 ± 0.448
4.955LeuVal: 4.955 ± 0.416
0.774LeuTrp: 0.774 ± 0.174
2.206LeuTyr: 2.206 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
2.787MetAla: 2.787 ± 0.419
0.155MetCys: 0.155 ± 0.087
1.355MetAsp: 1.355 ± 0.214
1.665MetGlu: 1.665 ± 0.261
0.581MetPhe: 0.581 ± 0.141
2.09MetGly: 2.09 ± 0.349
0.465MetHis: 0.465 ± 0.144
0.697MetIle: 0.697 ± 0.167
1.51MetLys: 1.51 ± 0.324
2.168MetLeu: 2.168 ± 0.27
0.387MetMet: 0.387 ± 0.116
0.968MetAsn: 0.968 ± 0.231
1.742MetPro: 1.742 ± 0.276
0.852MetGln: 0.852 ± 0.198
1.703MetArg: 1.703 ± 0.247
2.245MetSer: 2.245 ± 0.299
2.245MetThr: 2.245 ± 0.309
1.51MetVal: 1.51 ± 0.267
0.155MetTrp: 0.155 ± 0.071
0.697MetTyr: 0.697 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.252AsnAla: 3.252 ± 0.494
0.542AsnCys: 0.542 ± 0.162
2.748AsnAsp: 2.748 ± 0.394
2.4AsnGlu: 2.4 ± 0.325
1.084AsnPhe: 1.084 ± 0.205
4.645AsnGly: 4.645 ± 0.532
0.658AsnHis: 0.658 ± 0.134
1.936AsnIle: 1.936 ± 0.242
1.974AsnLys: 1.974 ± 0.256
2.555AsnLeu: 2.555 ± 0.325
0.658AsnMet: 0.658 ± 0.201
1.781AsnAsn: 1.781 ± 0.315
2.361AsnPro: 2.361 ± 0.32
1.123AsnGln: 1.123 ± 0.18
2.671AsnArg: 2.671 ± 0.29
1.819AsnSer: 1.819 ± 0.252
3.213AsnThr: 3.213 ± 0.409
3.213AsnVal: 3.213 ± 0.373
0.503AsnTrp: 0.503 ± 0.142
1.742AsnTyr: 1.742 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
4.607ProAla: 4.607 ± 0.437
0.348ProCys: 0.348 ± 0.11
3.097ProAsp: 3.097 ± 0.32
4.103ProGlu: 4.103 ± 0.366
1.394ProPhe: 1.394 ± 0.218
4.49ProGly: 4.49 ± 0.48
0.89ProHis: 0.89 ± 0.139
2.052ProIle: 2.052 ± 0.284
2.787ProLys: 2.787 ± 0.416
2.71ProLeu: 2.71 ± 0.327
0.968ProMet: 0.968 ± 0.184
2.206ProAsn: 2.206 ± 0.333
1.819ProPro: 1.819 ± 0.309
1.2ProGln: 1.2 ± 0.182
2.206ProArg: 2.206 ± 0.266
3.252ProSer: 3.252 ± 0.335
3.136ProThr: 3.136 ± 0.388
3.445ProVal: 3.445 ± 0.302
0.852ProTrp: 0.852 ± 0.164
1.161ProTyr: 1.161 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
3.329GlnAla: 3.329 ± 0.361
0.387GlnCys: 0.387 ± 0.099
1.471GlnAsp: 1.471 ± 0.295
1.819GlnGlu: 1.819 ± 0.324
1.51GlnPhe: 1.51 ± 0.233
1.781GlnGly: 1.781 ± 0.279
0.619GlnHis: 0.619 ± 0.145
2.245GlnIle: 2.245 ± 0.271
1.432GlnLys: 1.432 ± 0.273
3.445GlnLeu: 3.445 ± 0.37
0.852GlnMet: 0.852 ± 0.196
1.665GlnAsn: 1.665 ± 0.252
1.548GlnPro: 1.548 ± 0.262
1.432GlnGln: 1.432 ± 0.308
2.71GlnArg: 2.71 ± 0.44
2.361GlnSer: 2.361 ± 0.277
1.665GlnThr: 1.665 ± 0.258
2.323GlnVal: 2.323 ± 0.249
1.006GlnTrp: 1.006 ± 0.224
1.161GlnTyr: 1.161 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
4.761ArgAla: 4.761 ± 0.515
0.89ArgCys: 0.89 ± 0.21
4.026ArgAsp: 4.026 ± 0.342
4.026ArgGlu: 4.026 ± 0.476
2.477ArgPhe: 2.477 ± 0.299
4.916ArgGly: 4.916 ± 0.399
1.123ArgHis: 1.123 ± 0.199
2.942ArgIle: 2.942 ± 0.266
5.342ArgLys: 5.342 ± 0.429
5.303ArgLeu: 5.303 ± 0.563
2.052ArgMet: 2.052 ± 0.317
2.555ArgAsn: 2.555 ± 0.296
2.477ArgPro: 2.477 ± 0.314
2.206ArgGln: 2.206 ± 0.321
5.071ArgArg: 5.071 ± 0.496
4.413ArgSer: 4.413 ± 0.5
3.484ArgThr: 3.484 ± 0.424
4.103ArgVal: 4.103 ± 0.398
1.006ArgTrp: 1.006 ± 0.192
1.936ArgTyr: 1.936 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
5.536SerAla: 5.536 ± 0.543
0.348SerCys: 0.348 ± 0.133
4.181SerAsp: 4.181 ± 0.434
3.6SerGlu: 3.6 ± 0.434
1.742SerPhe: 1.742 ± 0.293
6.852SerGly: 6.852 ± 0.678
0.968SerHis: 0.968 ± 0.246
2.826SerIle: 2.826 ± 0.253
3.832SerLys: 3.832 ± 0.373
5.071SerLeu: 5.071 ± 0.497
1.548SerMet: 1.548 ± 0.21
2.71SerAsn: 2.71 ± 0.328
2.052SerPro: 2.052 ± 0.266
2.516SerGln: 2.516 ± 0.309
3.523SerArg: 3.523 ± 0.32
4.026SerSer: 4.026 ± 0.549
4.684SerThr: 4.684 ± 0.453
4.723SerVal: 4.723 ± 0.517
0.89SerTrp: 0.89 ± 0.184
2.129SerTyr: 2.129 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
6.929ThrAla: 6.929 ± 0.696
0.619ThrCys: 0.619 ± 0.18
3.406ThrAsp: 3.406 ± 0.312
3.677ThrGlu: 3.677 ± 0.386
2.129ThrPhe: 2.129 ± 0.298
6.077ThrGly: 6.077 ± 0.525
1.084ThrHis: 1.084 ± 0.212
3.832ThrIle: 3.832 ± 0.501
3.058ThrLys: 3.058 ± 0.397
4.336ThrLeu: 4.336 ± 0.434
1.626ThrMet: 1.626 ± 0.213
2.555ThrAsn: 2.555 ± 0.39
3.91ThrPro: 3.91 ± 0.426
2.284ThrGln: 2.284 ± 0.34
3.368ThrArg: 3.368 ± 0.387
4.374ThrSer: 4.374 ± 0.535
5.187ThrThr: 5.187 ± 0.649
5.071ThrVal: 5.071 ± 0.498
1.045ThrTrp: 1.045 ± 0.224
2.284ThrTyr: 2.284 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
6.968ValAla: 6.968 ± 0.616
0.774ValCys: 0.774 ± 0.181
4.645ValAsp: 4.645 ± 0.433
4.607ValGlu: 4.607 ± 0.485
2.323ValPhe: 2.323 ± 0.401
5.226ValGly: 5.226 ± 0.491
1.239ValHis: 1.239 ± 0.221
3.019ValIle: 3.019 ± 0.358
3.987ValLys: 3.987 ± 0.415
5.419ValLeu: 5.419 ± 0.388
1.703ValMet: 1.703 ± 0.318
2.71ValAsn: 2.71 ± 0.333
3.948ValPro: 3.948 ± 0.364
2.245ValGln: 2.245 ± 0.327
4.8ValArg: 4.8 ± 0.444
5.148ValSer: 5.148 ± 0.405
5.187ValThr: 5.187 ± 0.498
5.265ValVal: 5.265 ± 0.613
1.161ValTrp: 1.161 ± 0.197
2.323ValTyr: 2.323 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
1.2TrpAla: 1.2 ± 0.222
0.116TrpCys: 0.116 ± 0.065
1.394TrpAsp: 1.394 ± 0.267
1.084TrpGlu: 1.084 ± 0.18
0.697TrpPhe: 0.697 ± 0.164
1.045TrpGly: 1.045 ± 0.194
0.387TrpHis: 0.387 ± 0.138
0.852TrpIle: 0.852 ± 0.159
0.968TrpLys: 0.968 ± 0.216
1.084TrpLeu: 1.084 ± 0.244
0.271TrpMet: 0.271 ± 0.102
0.89TrpAsn: 0.89 ± 0.251
0.619TrpPro: 0.619 ± 0.185
0.774TrpGln: 0.774 ± 0.186
1.084TrpArg: 1.084 ± 0.268
1.006TrpSer: 1.006 ± 0.176
1.045TrpThr: 1.045 ± 0.183
1.045TrpVal: 1.045 ± 0.199
0.426TrpTrp: 0.426 ± 0.13
0.465TrpTyr: 0.465 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.477TyrAla: 2.477 ± 0.363
0.465TyrCys: 0.465 ± 0.127
2.168TyrAsp: 2.168 ± 0.324
1.703TyrGlu: 1.703 ± 0.276
0.581TyrPhe: 0.581 ± 0.146
3.29TyrGly: 3.29 ± 0.417
0.542TyrHis: 0.542 ± 0.133
1.239TyrIle: 1.239 ± 0.237
1.316TyrLys: 1.316 ± 0.212
2.516TyrLeu: 2.516 ± 0.323
0.852TyrMet: 0.852 ± 0.186
1.123TyrAsn: 1.123 ± 0.207
1.471TyrPro: 1.471 ± 0.263
1.084TyrGln: 1.084 ± 0.201
2.477TyrArg: 2.477 ± 0.333
1.781TyrSer: 1.781 ± 0.24
2.323TyrThr: 2.323 ± 0.274
2.323TyrVal: 2.323 ± 0.29
0.387TyrTrp: 0.387 ± 0.135
1.2TyrTyr: 1.2 ± 0.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 115 proteins (25834 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski