Amino acid dipepetide frequency for Microbacterium phage ValentiniPuff

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.255AlaAla: 17.255 ± 1.599
0.558AlaCys: 0.558 ± 0.207
7.816AlaAsp: 7.816 ± 0.632
7.257AlaGlu: 7.257 ± 0.887
2.385AlaPhe: 2.385 ± 0.383
9.998AlaGly: 9.998 ± 1.142
2.335AlaHis: 2.335 ± 0.314
5.075AlaIle: 5.075 ± 0.601
4.06AlaLys: 4.06 ± 0.529
8.069AlaLeu: 8.069 ± 0.729
3.654AlaMet: 3.654 ± 0.455
2.436AlaAsn: 2.436 ± 0.341
5.887AlaPro: 5.887 ± 0.715
4.162AlaGln: 4.162 ± 0.425
7.41AlaArg: 7.41 ± 0.62
5.583AlaSer: 5.583 ± 0.742
7.054AlaThr: 7.054 ± 0.632
10.099AlaVal: 10.099 ± 1.242
2.233AlaTrp: 2.233 ± 0.364
2.741AlaTyr: 2.741 ± 0.3
0.0AlaXaa: 0.0 ± 0.0
Cys
0.203CysAla: 0.203 ± 0.093
0.102CysCys: 0.102 ± 0.081
0.355CysAsp: 0.355 ± 0.158
0.457CysGlu: 0.457 ± 0.165
0.102CysPhe: 0.102 ± 0.079
1.218CysGly: 1.218 ± 0.346
0.355CysHis: 0.355 ± 0.142
0.254CysIle: 0.254 ± 0.117
0.152CysLys: 0.152 ± 0.089
0.203CysLeu: 0.203 ± 0.105
0.152CysMet: 0.152 ± 0.082
0.051CysAsn: 0.051 ± 0.057
0.305CysPro: 0.305 ± 0.128
0.152CysGln: 0.152 ± 0.095
0.558CysArg: 0.558 ± 0.202
0.508CysSer: 0.508 ± 0.206
0.254CysThr: 0.254 ± 0.121
0.406CysVal: 0.406 ± 0.142
0.305CysTrp: 0.305 ± 0.116
0.152CysTyr: 0.152 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
9.084AspAla: 9.084 ± 0.697
0.254AspCys: 0.254 ± 0.123
5.278AspAsp: 5.278 ± 0.683
4.771AspGlu: 4.771 ± 0.632
1.979AspPhe: 1.979 ± 0.307
6.293AspGly: 6.293 ± 0.581
1.117AspHis: 1.117 ± 0.274
2.69AspIle: 2.69 ± 0.393
1.117AspLys: 1.117 ± 0.222
4.771AspLeu: 4.771 ± 0.611
1.269AspMet: 1.269 ± 0.224
1.472AspAsn: 1.472 ± 0.214
4.821AspPro: 4.821 ± 0.637
1.878AspGln: 1.878 ± 0.298
4.06AspArg: 4.06 ± 0.487
3.197AspSer: 3.197 ± 0.433
4.162AspThr: 4.162 ± 0.436
5.887AspVal: 5.887 ± 0.559
1.624AspTrp: 1.624 ± 0.287
1.675AspTyr: 1.675 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
6.902GluAla: 6.902 ± 0.659
0.203GluCys: 0.203 ± 0.094
3.908GluAsp: 3.908 ± 0.599
3.603GluGlu: 3.603 ± 0.57
2.233GluPhe: 2.233 ± 0.295
3.451GluGly: 3.451 ± 0.387
1.776GluHis: 1.776 ± 0.375
4.06GluIle: 4.06 ± 0.392
1.929GluLys: 1.929 ± 0.323
4.06GluLeu: 4.06 ± 0.408
1.675GluMet: 1.675 ± 0.297
1.979GluAsn: 1.979 ± 0.358
3.045GluPro: 3.045 ± 0.439
2.791GluGln: 2.791 ± 0.541
4.872GluArg: 4.872 ± 0.623
2.741GluSer: 2.741 ± 0.388
3.299GluThr: 3.299 ± 0.389
5.735GluVal: 5.735 ± 0.564
1.37GluTrp: 1.37 ± 0.254
1.573GluTyr: 1.573 ± 0.286
0.0GluXaa: 0.0 ± 0.0
Phe
3.147PheAla: 3.147 ± 0.397
0.203PheCys: 0.203 ± 0.108
2.233PheAsp: 2.233 ± 0.32
1.726PheGlu: 1.726 ± 0.287
0.761PhePhe: 0.761 ± 0.325
2.69PheGly: 2.69 ± 0.396
0.508PheHis: 0.508 ± 0.171
1.573PheIle: 1.573 ± 0.255
0.508PheLys: 0.508 ± 0.146
1.827PheLeu: 1.827 ± 0.351
0.914PheMet: 0.914 ± 0.201
1.066PheAsn: 1.066 ± 0.216
1.117PhePro: 1.117 ± 0.246
1.015PheGln: 1.015 ± 0.259
1.218PheArg: 1.218 ± 0.234
1.472PheSer: 1.472 ± 0.239
1.878PheThr: 1.878 ± 0.306
2.335PheVal: 2.335 ± 0.338
0.609PheTrp: 0.609 ± 0.179
0.457PheTyr: 0.457 ± 0.126
0.0PheXaa: 0.0 ± 0.0
Gly
7.156GlyAla: 7.156 ± 0.92
0.558GlyCys: 0.558 ± 0.189
4.263GlyAsp: 4.263 ± 0.385
4.466GlyGlu: 4.466 ± 0.387
2.385GlyPhe: 2.385 ± 0.346
5.887GlyGly: 5.887 ± 0.796
1.37GlyHis: 1.37 ± 0.253
4.009GlyIle: 4.009 ± 0.529
3.451GlyLys: 3.451 ± 0.409
7.004GlyLeu: 7.004 ± 0.435
1.776GlyMet: 1.776 ± 0.273
2.132GlyAsn: 2.132 ± 0.38
3.4GlyPro: 3.4 ± 0.452
2.385GlyGln: 2.385 ± 0.327
5.887GlyArg: 5.887 ± 0.646
5.735GlySer: 5.735 ± 1.344
7.054GlyThr: 7.054 ± 0.85
5.735GlyVal: 5.735 ± 0.491
1.776GlyTrp: 1.776 ± 0.277
2.791GlyTyr: 2.791 ± 0.454
0.0GlyXaa: 0.0 ± 0.0
His
2.182HisAla: 2.182 ± 0.425
0.152HisCys: 0.152 ± 0.099
1.523HisAsp: 1.523 ± 0.251
1.269HisGlu: 1.269 ± 0.261
0.66HisPhe: 0.66 ± 0.182
1.37HisGly: 1.37 ± 0.249
0.406HisHis: 0.406 ± 0.145
1.269HisIle: 1.269 ± 0.25
0.558HisLys: 0.558 ± 0.164
1.218HisLeu: 1.218 ± 0.276
0.254HisMet: 0.254 ± 0.104
0.558HisAsn: 0.558 ± 0.157
1.878HisPro: 1.878 ± 0.388
0.711HisGln: 0.711 ± 0.183
1.776HisArg: 1.776 ± 0.289
1.066HisSer: 1.066 ± 0.252
1.269HisThr: 1.269 ± 0.264
1.573HisVal: 1.573 ± 0.274
0.355HisTrp: 0.355 ± 0.131
0.558HisTyr: 0.558 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
6.75IleAla: 6.75 ± 0.596
0.102IleCys: 0.102 ± 0.08
3.959IleAsp: 3.959 ± 0.487
3.451IleGlu: 3.451 ± 0.497
0.863IlePhe: 0.863 ± 0.161
4.263IleGly: 4.263 ± 0.486
1.218IleHis: 1.218 ± 0.293
2.132IleIle: 2.132 ± 0.371
1.523IleLys: 1.523 ± 0.351
3.553IleLeu: 3.553 ± 0.476
0.812IleMet: 0.812 ± 0.24
1.726IleAsn: 1.726 ± 0.273
2.944IlePro: 2.944 ± 0.447
1.32IleGln: 1.32 ± 0.294
3.502IleArg: 3.502 ± 0.386
2.741IleSer: 2.741 ± 0.499
3.197IleThr: 3.197 ± 0.463
4.314IleVal: 4.314 ± 0.569
1.066IleTrp: 1.066 ± 0.221
1.015IleTyr: 1.015 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
4.568LysAla: 4.568 ± 0.627
0.051LysCys: 0.051 ± 0.052
1.675LysAsp: 1.675 ± 0.277
1.421LysGlu: 1.421 ± 0.298
0.964LysPhe: 0.964 ± 0.236
2.538LysGly: 2.538 ± 0.423
0.558LysHis: 0.558 ± 0.233
1.979LysIle: 1.979 ± 0.279
1.573LysLys: 1.573 ± 0.353
3.502LysLeu: 3.502 ± 0.475
0.914LysMet: 0.914 ± 0.216
1.117LysAsn: 1.117 ± 0.219
1.827LysPro: 1.827 ± 0.295
1.32LysGln: 1.32 ± 0.242
2.081LysArg: 2.081 ± 0.341
1.979LysSer: 1.979 ± 0.313
2.03LysThr: 2.03 ± 0.339
2.335LysVal: 2.335 ± 0.325
0.812LysTrp: 0.812 ± 0.207
0.812LysTyr: 0.812 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
8.932LeuAla: 8.932 ± 0.695
0.457LeuCys: 0.457 ± 0.172
5.532LeuAsp: 5.532 ± 0.506
5.329LeuGlu: 5.329 ± 0.564
1.37LeuPhe: 1.37 ± 0.276
5.684LeuGly: 5.684 ± 0.586
1.523LeuHis: 1.523 ± 0.311
4.618LeuIle: 4.618 ± 0.49
2.335LeuLys: 2.335 ± 0.393
5.786LeuLeu: 5.786 ± 0.816
1.573LeuMet: 1.573 ± 0.285
2.03LeuAsn: 2.03 ± 0.295
4.517LeuPro: 4.517 ± 0.57
2.944LeuGln: 2.944 ± 0.411
5.329LeuArg: 5.329 ± 0.551
3.959LeuSer: 3.959 ± 0.417
5.583LeuThr: 5.583 ± 0.415
5.989LeuVal: 5.989 ± 0.671
1.218LeuTrp: 1.218 ± 0.2
0.863LeuTyr: 0.863 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.639MetAla: 2.639 ± 0.356
0.355MetCys: 0.355 ± 0.17
1.218MetAsp: 1.218 ± 0.295
0.964MetGlu: 0.964 ± 0.242
0.761MetPhe: 0.761 ± 0.2
1.675MetGly: 1.675 ± 0.277
0.305MetHis: 0.305 ± 0.155
0.964MetIle: 0.964 ± 0.198
1.117MetLys: 1.117 ± 0.253
2.284MetLeu: 2.284 ± 0.309
0.558MetMet: 0.558 ± 0.145
0.66MetAsn: 0.66 ± 0.192
1.675MetPro: 1.675 ± 0.38
0.863MetGln: 0.863 ± 0.231
1.167MetArg: 1.167 ± 0.263
2.081MetSer: 2.081 ± 0.37
2.893MetThr: 2.893 ± 0.427
1.624MetVal: 1.624 ± 0.301
0.711MetTrp: 0.711 ± 0.187
0.305MetTyr: 0.305 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
4.009AsnAla: 4.009 ± 0.479
0.051AsnCys: 0.051 ± 0.04
1.979AsnAsp: 1.979 ± 0.41
1.421AsnGlu: 1.421 ± 0.3
0.609AsnPhe: 0.609 ± 0.151
3.147AsnGly: 3.147 ± 0.415
0.254AsnHis: 0.254 ± 0.133
0.609AsnIle: 0.609 ± 0.15
0.812AsnLys: 0.812 ± 0.211
1.878AsnLeu: 1.878 ± 0.324
0.355AsnMet: 0.355 ± 0.14
0.711AsnAsn: 0.711 ± 0.192
2.588AsnPro: 2.588 ± 0.423
0.558AsnGln: 0.558 ± 0.178
1.218AsnArg: 1.218 ± 0.236
1.776AsnSer: 1.776 ± 0.253
1.167AsnThr: 1.167 ± 0.246
2.741AsnVal: 2.741 ± 0.306
0.508AsnTrp: 0.508 ± 0.127
0.812AsnTyr: 0.812 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
7.156ProAla: 7.156 ± 0.924
0.254ProCys: 0.254 ± 0.128
4.821ProAsp: 4.821 ± 0.48
4.72ProGlu: 4.72 ± 0.673
1.573ProPhe: 1.573 ± 0.317
3.197ProGly: 3.197 ± 0.412
1.269ProHis: 1.269 ± 0.292
2.944ProIle: 2.944 ± 0.452
2.03ProLys: 2.03 ± 0.292
3.908ProLeu: 3.908 ± 0.491
1.472ProMet: 1.472 ± 0.283
1.523ProAsn: 1.523 ± 0.259
2.944ProPro: 2.944 ± 0.424
1.421ProGln: 1.421 ± 0.305
3.147ProArg: 3.147 ± 0.456
3.553ProSer: 3.553 ± 0.41
4.162ProThr: 4.162 ± 0.517
4.314ProVal: 4.314 ± 0.455
1.117ProTrp: 1.117 ± 0.251
1.269ProTyr: 1.269 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
3.299GlnAla: 3.299 ± 0.392
0.102GlnCys: 0.102 ± 0.073
1.472GlnAsp: 1.472 ± 0.278
2.03GlnGlu: 2.03 ± 0.332
0.863GlnPhe: 0.863 ± 0.209
2.182GlnGly: 2.182 ± 0.336
0.964GlnHis: 0.964 ± 0.26
1.979GlnIle: 1.979 ± 0.339
1.218GlnLys: 1.218 ± 0.267
3.045GlnLeu: 3.045 ± 0.446
1.066GlnMet: 1.066 ± 0.256
1.117GlnAsn: 1.117 ± 0.217
2.284GlnPro: 2.284 ± 0.327
2.132GlnGln: 2.132 ± 0.322
2.893GlnArg: 2.893 ± 0.317
2.03GlnSer: 2.03 ± 0.32
1.929GlnThr: 1.929 ± 0.342
2.284GlnVal: 2.284 ± 0.324
0.66GlnTrp: 0.66 ± 0.146
1.32GlnTyr: 1.32 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
5.938ArgAla: 5.938 ± 0.525
0.812ArgCys: 0.812 ± 0.242
3.248ArgAsp: 3.248 ± 0.412
4.06ArgGlu: 4.06 ± 0.46
2.284ArgPhe: 2.284 ± 0.315
5.329ArgGly: 5.329 ± 0.603
1.573ArgHis: 1.573 ± 0.358
3.654ArgIle: 3.654 ± 0.442
2.182ArgLys: 2.182 ± 0.323
5.887ArgLeu: 5.887 ± 0.591
2.335ArgMet: 2.335 ± 0.343
1.827ArgAsn: 1.827 ± 0.283
2.994ArgPro: 2.994 ± 0.43
2.182ArgGln: 2.182 ± 0.39
5.329ArgArg: 5.329 ± 0.755
3.603ArgSer: 3.603 ± 0.467
4.415ArgThr: 4.415 ± 0.509
4.517ArgVal: 4.517 ± 0.557
1.015ArgTrp: 1.015 ± 0.217
1.37ArgTyr: 1.37 ± 0.256
0.0ArgXaa: 0.0 ± 0.0
Ser
7.004SerAla: 7.004 ± 1.049
0.66SerCys: 0.66 ± 0.173
3.096SerAsp: 3.096 ± 0.447
2.944SerGlu: 2.944 ± 0.338
1.979SerPhe: 1.979 ± 0.34
5.177SerGly: 5.177 ± 0.773
0.812SerHis: 0.812 ± 0.197
2.842SerIle: 2.842 ± 0.356
2.588SerLys: 2.588 ± 0.341
4.669SerLeu: 4.669 ± 0.485
1.675SerMet: 1.675 ± 0.296
1.624SerAsn: 1.624 ± 0.258
3.603SerPro: 3.603 ± 0.383
2.03SerGln: 2.03 ± 0.452
2.893SerArg: 2.893 ± 0.299
2.994SerSer: 2.994 ± 0.46
4.974SerThr: 4.974 ± 0.578
3.502SerVal: 3.502 ± 0.506
1.218SerTrp: 1.218 ± 0.245
0.964SerTyr: 0.964 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
6.902ThrAla: 6.902 ± 0.721
0.609ThrCys: 0.609 ± 0.191
4.923ThrAsp: 4.923 ± 0.671
2.791ThrGlu: 2.791 ± 0.47
2.335ThrPhe: 2.335 ± 0.365
6.801ThrGly: 6.801 ± 0.817
1.624ThrHis: 1.624 ± 0.295
4.618ThrIle: 4.618 ± 0.594
2.69ThrLys: 2.69 ± 0.382
5.024ThrLeu: 5.024 ± 0.453
1.421ThrMet: 1.421 ± 0.262
1.878ThrAsn: 1.878 ± 0.281
5.633ThrPro: 5.633 ± 0.626
1.929ThrGln: 1.929 ± 0.306
3.603ThrArg: 3.603 ± 0.407
3.806ThrSer: 3.806 ± 0.702
5.532ThrThr: 5.532 ± 0.676
5.583ThrVal: 5.583 ± 0.556
1.472ThrTrp: 1.472 ± 0.322
2.081ThrTyr: 2.081 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
8.983ValAla: 8.983 ± 0.609
0.457ValCys: 0.457 ± 0.159
6.851ValAsp: 6.851 ± 0.494
5.38ValGlu: 5.38 ± 0.569
1.929ValPhe: 1.929 ± 0.304
5.278ValGly: 5.278 ± 0.706
1.929ValHis: 1.929 ± 0.318
3.4ValIle: 3.4 ± 0.545
2.944ValLys: 2.944 ± 0.34
5.481ValLeu: 5.481 ± 0.614
2.182ValMet: 2.182 ± 0.38
1.624ValAsn: 1.624 ± 0.33
3.806ValPro: 3.806 ± 0.548
2.791ValGln: 2.791 ± 0.362
4.263ValArg: 4.263 ± 0.535
4.669ValSer: 4.669 ± 0.451
7.207ValThr: 7.207 ± 0.782
7.765ValVal: 7.765 ± 0.723
2.03ValTrp: 2.03 ± 0.374
1.726ValTyr: 1.726 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
1.675TrpAla: 1.675 ± 0.337
0.102TrpCys: 0.102 ± 0.077
1.218TrpAsp: 1.218 ± 0.245
1.32TrpGlu: 1.32 ± 0.254
0.66TrpPhe: 0.66 ± 0.167
1.269TrpGly: 1.269 ± 0.273
0.355TrpHis: 0.355 ± 0.145
0.914TrpIle: 0.914 ± 0.222
0.711TrpLys: 0.711 ± 0.191
1.776TrpLeu: 1.776 ± 0.294
0.609TrpMet: 0.609 ± 0.19
1.066TrpAsn: 1.066 ± 0.283
0.558TrpPro: 0.558 ± 0.19
1.269TrpGln: 1.269 ± 0.223
1.472TrpArg: 1.472 ± 0.269
2.132TrpSer: 2.132 ± 0.285
1.624TrpThr: 1.624 ± 0.287
1.573TrpVal: 1.573 ± 0.271
0.406TrpTrp: 0.406 ± 0.134
0.457TrpTyr: 0.457 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.284TyrAla: 2.284 ± 0.283
0.203TyrCys: 0.203 ± 0.105
2.03TyrAsp: 2.03 ± 0.4
1.827TyrGlu: 1.827 ± 0.359
0.558TyrPhe: 0.558 ± 0.174
1.776TyrGly: 1.776 ± 0.285
0.355TyrHis: 0.355 ± 0.126
0.863TyrIle: 0.863 ± 0.186
0.558TyrLys: 0.558 ± 0.165
1.624TyrLeu: 1.624 ± 0.272
0.102TyrMet: 0.102 ± 0.066
0.609TyrAsn: 0.609 ± 0.183
0.964TyrPro: 0.964 ± 0.204
1.015TyrGln: 1.015 ± 0.22
1.878TyrArg: 1.878 ± 0.316
1.624TyrSer: 1.624 ± 0.334
1.573TyrThr: 1.573 ± 0.35
2.436TyrVal: 2.436 ± 0.617
0.66TyrTrp: 0.66 ± 0.196
0.406TyrTyr: 0.406 ± 0.137
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 112 proteins (19705 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski