Amino acid dipepetide frequency for Arthrobacter phage Tophat

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.07AlaAla: 18.07 ± 1.336
0.85AlaCys: 0.85 ± 0.208
7.067AlaAsp: 7.067 ± 0.506
8.409AlaGlu: 8.409 ± 0.755
2.863AlaPhe: 2.863 ± 0.433
8.409AlaGly: 8.409 ± 0.728
2.415AlaHis: 2.415 ± 0.377
4.383AlaIle: 4.383 ± 0.432
6.217AlaLys: 6.217 ± 0.597
9.885AlaLeu: 9.885 ± 0.838
3.623AlaMet: 3.623 ± 0.403
4.83AlaAsn: 4.83 ± 0.508
5.904AlaPro: 5.904 ± 0.439
3.802AlaGln: 3.802 ± 0.421
6.53AlaArg: 6.53 ± 0.667
4.92AlaSer: 4.92 ± 0.489
9.08AlaThr: 9.08 ± 0.77
8.409AlaVal: 8.409 ± 0.71
1.521AlaTrp: 1.521 ± 0.3
2.639AlaTyr: 2.639 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.141
0.268CysCys: 0.268 ± 0.09
0.179CysAsp: 0.179 ± 0.094
0.581CysGlu: 0.581 ± 0.147
0.268CysPhe: 0.268 ± 0.106
0.939CysGly: 0.939 ± 0.236
0.447CysHis: 0.447 ± 0.153
0.492CysIle: 0.492 ± 0.158
0.224CysLys: 0.224 ± 0.104
0.537CysLeu: 0.537 ± 0.154
0.134CysMet: 0.134 ± 0.073
0.268CysAsn: 0.268 ± 0.121
1.029CysPro: 1.029 ± 0.29
0.805CysGln: 0.805 ± 0.183
0.895CysArg: 0.895 ± 0.197
0.447CysSer: 0.447 ± 0.149
0.671CysThr: 0.671 ± 0.175
0.581CysVal: 0.581 ± 0.137
0.268CysTrp: 0.268 ± 0.138
0.134CysTyr: 0.134 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
8.096AspAla: 8.096 ± 0.624
0.581AspCys: 0.581 ± 0.192
2.236AspAsp: 2.236 ± 0.364
2.907AspGlu: 2.907 ± 0.376
1.655AspPhe: 1.655 ± 0.354
6.083AspGly: 6.083 ± 0.542
1.565AspHis: 1.565 ± 0.25
2.415AspIle: 2.415 ± 0.283
2.773AspLys: 2.773 ± 0.381
5.188AspLeu: 5.188 ± 0.463
1.387AspMet: 1.387 ± 0.219
1.879AspAsn: 1.879 ± 0.253
5.009AspPro: 5.009 ± 0.548
1.521AspGln: 1.521 ± 0.244
2.236AspArg: 2.236 ± 0.331
2.639AspSer: 2.639 ± 0.323
3.891AspThr: 3.891 ± 0.401
4.025AspVal: 4.025 ± 0.428
1.387AspTrp: 1.387 ± 0.285
2.147AspTyr: 2.147 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
7.112GluAla: 7.112 ± 0.678
0.716GluCys: 0.716 ± 0.202
3.668GluAsp: 3.668 ± 0.506
4.025GluGlu: 4.025 ± 0.468
1.521GluPhe: 1.521 ± 0.333
4.83GluGly: 4.83 ± 0.442
1.252GluHis: 1.252 ± 0.233
3.22GluIle: 3.22 ± 0.329
2.236GluLys: 2.236 ± 0.304
5.054GluLeu: 5.054 ± 0.571
1.655GluMet: 1.655 ± 0.259
2.281GluAsn: 2.281 ± 0.364
3.578GluPro: 3.578 ± 0.434
2.594GluGln: 2.594 ± 0.417
3.936GluArg: 3.936 ± 0.385
3.623GluSer: 3.623 ± 0.416
3.489GluThr: 3.489 ± 0.412
4.607GluVal: 4.607 ± 0.457
1.252GluTrp: 1.252 ± 0.225
1.655GluTyr: 1.655 ± 0.318
0.0GluXaa: 0.0 ± 0.0
Phe
2.728PheAla: 2.728 ± 0.349
0.313PheCys: 0.313 ± 0.117
1.789PheAsp: 1.789 ± 0.254
2.057PheGlu: 2.057 ± 0.326
0.85PhePhe: 0.85 ± 0.155
2.102PheGly: 2.102 ± 0.302
0.626PheHis: 0.626 ± 0.19
1.208PheIle: 1.208 ± 0.261
1.297PheLys: 1.297 ± 0.227
1.968PheLeu: 1.968 ± 0.323
1.029PheMet: 1.029 ± 0.22
1.342PheAsn: 1.342 ± 0.251
1.476PhePro: 1.476 ± 0.22
1.789PheGln: 1.789 ± 0.26
1.565PheArg: 1.565 ± 0.267
1.163PheSer: 1.163 ± 0.255
2.057PheThr: 2.057 ± 0.287
1.7PheVal: 1.7 ± 0.258
0.537PheTrp: 0.537 ± 0.182
0.716PheTyr: 0.716 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
7.738GlyAla: 7.738 ± 0.652
0.716GlyCys: 0.716 ± 0.19
5.412GlyAsp: 5.412 ± 0.49
3.668GlyGlu: 3.668 ± 0.395
3.176GlyPhe: 3.176 ± 0.564
8.096GlyGly: 8.096 ± 0.92
2.594GlyHis: 2.594 ± 0.349
3.846GlyIle: 3.846 ± 0.54
4.249GlyLys: 4.249 ± 0.49
6.128GlyLeu: 6.128 ± 0.61
1.655GlyMet: 1.655 ± 0.246
3.131GlyAsn: 3.131 ± 0.383
3.22GlyPro: 3.22 ± 0.325
2.415GlyGln: 2.415 ± 0.309
5.322GlyArg: 5.322 ± 0.549
5.546GlySer: 5.546 ± 0.465
5.636GlyThr: 5.636 ± 0.517
6.172GlyVal: 6.172 ± 0.637
2.147GlyTrp: 2.147 ± 0.292
2.236GlyTyr: 2.236 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
2.415HisAla: 2.415 ± 0.329
0.447HisCys: 0.447 ± 0.16
1.61HisAsp: 1.61 ± 0.339
1.879HisGlu: 1.879 ± 0.32
0.447HisPhe: 0.447 ± 0.152
1.387HisGly: 1.387 ± 0.259
0.581HisHis: 0.581 ± 0.15
1.073HisIle: 1.073 ± 0.219
1.163HisLys: 1.163 ± 0.262
1.879HisLeu: 1.879 ± 0.285
0.581HisMet: 0.581 ± 0.17
0.492HisAsn: 0.492 ± 0.143
1.521HisPro: 1.521 ± 0.296
0.85HisGln: 0.85 ± 0.224
1.297HisArg: 1.297 ± 0.243
1.073HisSer: 1.073 ± 0.25
1.297HisThr: 1.297 ± 0.266
1.7HisVal: 1.7 ± 0.277
0.403HisTrp: 0.403 ± 0.124
0.447HisTyr: 0.447 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
4.83IleAla: 4.83 ± 0.519
0.134IleCys: 0.134 ± 0.08
2.549IleAsp: 2.549 ± 0.351
3.041IleGlu: 3.041 ± 0.361
1.163IlePhe: 1.163 ± 0.196
2.684IleGly: 2.684 ± 0.495
0.716IleHis: 0.716 ± 0.202
1.7IleIle: 1.7 ± 0.372
1.744IleLys: 1.744 ± 0.292
3.086IleLeu: 3.086 ± 0.384
1.073IleMet: 1.073 ± 0.223
1.61IleAsn: 1.61 ± 0.262
2.102IlePro: 2.102 ± 0.332
1.7IleGln: 1.7 ± 0.257
2.549IleArg: 2.549 ± 0.423
1.968IleSer: 1.968 ± 0.263
3.31IleThr: 3.31 ± 0.389
3.22IleVal: 3.22 ± 0.362
0.671IleTrp: 0.671 ± 0.173
0.939IleTyr: 0.939 ± 0.193
0.0IleXaa: 0.0 ± 0.0
Lys
6.977LysAla: 6.977 ± 0.769
0.179LysCys: 0.179 ± 0.071
2.281LysAsp: 2.281 ± 0.351
2.371LysGlu: 2.371 ± 0.352
0.895LysPhe: 0.895 ± 0.198
4.204LysGly: 4.204 ± 0.449
1.163LysHis: 1.163 ± 0.24
1.521LysIle: 1.521 ± 0.263
1.744LysLys: 1.744 ± 0.362
3.668LysLeu: 3.668 ± 0.423
1.163LysMet: 1.163 ± 0.216
1.342LysAsn: 1.342 ± 0.232
3.176LysPro: 3.176 ± 0.439
1.521LysGln: 1.521 ± 0.275
2.639LysArg: 2.639 ± 0.404
2.192LysSer: 2.192 ± 0.314
2.728LysThr: 2.728 ± 0.37
3.176LysVal: 3.176 ± 0.362
0.537LysTrp: 0.537 ± 0.164
1.208LysTyr: 1.208 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
8.364LeuAla: 8.364 ± 0.673
0.537LeuCys: 0.537 ± 0.122
4.607LeuAsp: 4.607 ± 0.57
5.233LeuGlu: 5.233 ± 0.529
2.057LeuPhe: 2.057 ± 0.298
5.457LeuGly: 5.457 ± 0.724
1.744LeuHis: 1.744 ± 0.308
2.863LeuIle: 2.863 ± 0.394
3.399LeuLys: 3.399 ± 0.408
6.128LeuLeu: 6.128 ± 0.693
1.655LeuMet: 1.655 ± 0.198
2.997LeuAsn: 2.997 ± 0.462
5.233LeuPro: 5.233 ± 0.49
2.192LeuGln: 2.192 ± 0.392
4.786LeuArg: 4.786 ± 0.538
4.204LeuSer: 4.204 ± 0.444
5.591LeuThr: 5.591 ± 0.429
5.457LeuVal: 5.457 ± 0.536
1.297LeuTrp: 1.297 ± 0.235
1.387LeuTyr: 1.387 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
2.728MetAla: 2.728 ± 0.342
0.134MetCys: 0.134 ± 0.075
1.297MetAsp: 1.297 ± 0.244
1.297MetGlu: 1.297 ± 0.225
0.671MetPhe: 0.671 ± 0.145
2.326MetGly: 2.326 ± 0.42
0.581MetHis: 0.581 ± 0.15
0.447MetIle: 0.447 ± 0.152
1.297MetLys: 1.297 ± 0.241
1.834MetLeu: 1.834 ± 0.276
0.268MetMet: 0.268 ± 0.107
0.76MetAsn: 0.76 ± 0.147
1.387MetPro: 1.387 ± 0.254
0.895MetGln: 0.895 ± 0.237
1.923MetArg: 1.923 ± 0.305
1.789MetSer: 1.789 ± 0.273
2.728MetThr: 2.728 ± 0.348
1.744MetVal: 1.744 ± 0.225
0.313MetTrp: 0.313 ± 0.161
0.313MetTyr: 0.313 ± 0.108
0.0MetXaa: 0.0 ± 0.0
Asn
3.981AsnAla: 3.981 ± 0.307
0.358AsnCys: 0.358 ± 0.13
1.521AsnAsp: 1.521 ± 0.259
1.789AsnGlu: 1.789 ± 0.257
0.805AsnPhe: 0.805 ± 0.169
4.16AsnGly: 4.16 ± 0.363
0.671AsnHis: 0.671 ± 0.177
1.431AsnIle: 1.431 ± 0.292
1.476AsnLys: 1.476 ± 0.27
2.371AsnLeu: 2.371 ± 0.346
0.939AsnMet: 0.939 ± 0.21
1.7AsnAsn: 1.7 ± 0.327
3.489AsnPro: 3.489 ± 0.409
1.431AsnGln: 1.431 ± 0.258
2.192AsnArg: 2.192 ± 0.286
1.744AsnSer: 1.744 ± 0.271
2.057AsnThr: 2.057 ± 0.384
2.371AsnVal: 2.371 ± 0.316
1.029AsnTrp: 1.029 ± 0.223
1.252AsnTyr: 1.252 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
7.201ProAla: 7.201 ± 0.598
0.895ProCys: 0.895 ± 0.258
4.562ProAsp: 4.562 ± 0.516
4.428ProGlu: 4.428 ± 0.523
1.476ProPhe: 1.476 ± 0.243
5.725ProGly: 5.725 ± 0.523
1.521ProHis: 1.521 ± 0.226
2.505ProIle: 2.505 ± 0.367
2.773ProLys: 2.773 ± 0.359
3.757ProLeu: 3.757 ± 0.358
1.342ProMet: 1.342 ± 0.238
2.236ProAsn: 2.236 ± 0.298
2.907ProPro: 2.907 ± 0.405
1.744ProGln: 1.744 ± 0.265
3.31ProArg: 3.31 ± 0.408
2.863ProSer: 2.863 ± 0.401
4.294ProThr: 4.294 ± 0.66
4.517ProVal: 4.517 ± 0.396
0.805ProTrp: 0.805 ± 0.186
1.252ProTyr: 1.252 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
4.204GlnAla: 4.204 ± 0.494
0.358GlnCys: 0.358 ± 0.166
1.744GlnAsp: 1.744 ± 0.263
1.879GlnGlu: 1.879 ± 0.281
1.297GlnPhe: 1.297 ± 0.274
2.907GlnGly: 2.907 ± 0.381
0.492GlnHis: 0.492 ± 0.167
1.342GlnIle: 1.342 ± 0.24
1.073GlnLys: 1.073 ± 0.209
1.968GlnLeu: 1.968 ± 0.236
0.716GlnMet: 0.716 ± 0.145
0.895GlnAsn: 0.895 ± 0.261
1.923GlnPro: 1.923 ± 0.261
1.163GlnGln: 1.163 ± 0.246
2.684GlnArg: 2.684 ± 0.375
1.789GlnSer: 1.789 ± 0.367
2.549GlnThr: 2.549 ± 0.348
3.399GlnVal: 3.399 ± 0.287
0.895GlnTrp: 0.895 ± 0.22
0.984GlnTyr: 0.984 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
7.335ArgAla: 7.335 ± 0.572
0.716ArgCys: 0.716 ± 0.185
4.741ArgAsp: 4.741 ± 0.36
4.428ArgGlu: 4.428 ± 0.498
2.236ArgPhe: 2.236 ± 0.379
3.623ArgGly: 3.623 ± 0.439
1.163ArgHis: 1.163 ± 0.23
2.684ArgIle: 2.684 ± 0.348
3.086ArgLys: 3.086 ± 0.487
4.517ArgLeu: 4.517 ± 0.508
1.476ArgMet: 1.476 ± 0.261
2.147ArgAsn: 2.147 ± 0.274
2.818ArgPro: 2.818 ± 0.386
1.7ArgGln: 1.7 ± 0.279
4.652ArgArg: 4.652 ± 0.611
2.863ArgSer: 2.863 ± 0.321
3.846ArgThr: 3.846 ± 0.396
4.115ArgVal: 4.115 ± 0.362
1.297ArgTrp: 1.297 ± 0.228
1.565ArgTyr: 1.565 ± 0.257
0.0ArgXaa: 0.0 ± 0.0
Ser
6.485SerAla: 6.485 ± 0.773
0.313SerCys: 0.313 ± 0.113
2.684SerAsp: 2.684 ± 0.313
3.131SerGlu: 3.131 ± 0.412
1.163SerPhe: 1.163 ± 0.185
5.009SerGly: 5.009 ± 0.675
1.252SerHis: 1.252 ± 0.233
2.057SerIle: 2.057 ± 0.333
2.326SerLys: 2.326 ± 0.486
3.802SerLeu: 3.802 ± 0.354
1.431SerMet: 1.431 ± 0.25
2.013SerAsn: 2.013 ± 0.283
2.684SerPro: 2.684 ± 0.301
1.521SerGln: 1.521 ± 0.27
3.22SerArg: 3.22 ± 0.335
2.281SerSer: 2.281 ± 0.279
3.176SerThr: 3.176 ± 0.468
3.623SerVal: 3.623 ± 0.455
0.805SerTrp: 0.805 ± 0.2
1.923SerTyr: 1.923 ± 0.287
0.0SerXaa: 0.0 ± 0.0
Thr
8.453ThrAla: 8.453 ± 0.684
0.716ThrCys: 0.716 ± 0.188
3.444ThrAsp: 3.444 ± 0.282
3.355ThrGlu: 3.355 ± 0.358
2.147ThrPhe: 2.147 ± 0.344
5.68ThrGly: 5.68 ± 0.66
1.7ThrHis: 1.7 ± 0.285
2.684ThrIle: 2.684 ± 0.333
2.46ThrLys: 2.46 ± 0.307
4.83ThrLeu: 4.83 ± 0.405
1.252ThrMet: 1.252 ± 0.241
2.415ThrAsn: 2.415 ± 0.307
5.501ThrPro: 5.501 ± 0.737
2.505ThrGln: 2.505 ± 0.317
4.16ThrArg: 4.16 ± 0.5
3.936ThrSer: 3.936 ± 0.441
4.652ThrThr: 4.652 ± 0.601
6.351ThrVal: 6.351 ± 0.541
1.387ThrTrp: 1.387 ± 0.321
1.7ThrTyr: 1.7 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
8.677ValAla: 8.677 ± 0.614
0.581ValCys: 0.581 ± 0.165
5.457ValAsp: 5.457 ± 0.522
5.233ValGlu: 5.233 ± 0.462
2.236ValPhe: 2.236 ± 0.305
5.233ValGly: 5.233 ± 0.65
1.163ValHis: 1.163 ± 0.28
3.22ValIle: 3.22 ± 0.393
3.22ValLys: 3.22 ± 0.388
5.278ValLeu: 5.278 ± 0.562
2.192ValMet: 2.192 ± 0.297
2.147ValAsn: 2.147 ± 0.311
4.473ValPro: 4.473 ± 0.428
2.057ValGln: 2.057 ± 0.247
4.652ValArg: 4.652 ± 0.481
4.338ValSer: 4.338 ± 0.421
5.814ValThr: 5.814 ± 0.569
5.501ValVal: 5.501 ± 0.517
0.76ValTrp: 0.76 ± 0.175
1.431ValTyr: 1.431 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
1.61TrpAla: 1.61 ± 0.259
0.492TrpCys: 0.492 ± 0.167
1.073TrpAsp: 1.073 ± 0.211
1.029TrpGlu: 1.029 ± 0.222
0.805TrpPhe: 0.805 ± 0.304
1.387TrpGly: 1.387 ± 0.194
0.671TrpHis: 0.671 ± 0.194
0.895TrpIle: 0.895 ± 0.289
0.716TrpLys: 0.716 ± 0.158
1.61TrpLeu: 1.61 ± 0.344
0.447TrpMet: 0.447 ± 0.143
1.163TrpAsn: 1.163 ± 0.302
0.984TrpPro: 0.984 ± 0.219
0.85TrpGln: 0.85 ± 0.228
1.118TrpArg: 1.118 ± 0.211
0.626TrpSer: 0.626 ± 0.144
0.939TrpThr: 0.939 ± 0.198
1.118TrpVal: 1.118 ± 0.26
0.537TrpTrp: 0.537 ± 0.202
0.268TrpTyr: 0.268 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.415TyrAla: 2.415 ± 0.365
0.268TyrCys: 0.268 ± 0.115
1.7TyrAsp: 1.7 ± 0.239
1.565TyrGlu: 1.565 ± 0.247
0.581TyrPhe: 0.581 ± 0.137
2.863TyrGly: 2.863 ± 0.382
0.224TyrHis: 0.224 ± 0.111
0.805TyrIle: 0.805 ± 0.184
1.208TyrLys: 1.208 ± 0.246
1.7TyrLeu: 1.7 ± 0.3
0.671TyrMet: 0.671 ± 0.157
1.118TyrAsn: 1.118 ± 0.21
1.7TyrPro: 1.7 ± 0.297
1.208TyrGln: 1.208 ± 0.227
1.387TyrArg: 1.387 ± 0.29
0.895TyrSer: 0.895 ± 0.209
1.521TyrThr: 1.521 ± 0.264
1.879TyrVal: 1.879 ± 0.331
0.492TyrTrp: 0.492 ± 0.17
0.76TyrTyr: 0.76 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (22359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski