Amino acid dipepetide frequency for Microbacterium phage Mashley

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.921AlaAla: 17.921 ± 1.356
0.407AlaCys: 0.407 ± 0.15
7.84AlaAsp: 7.84 ± 0.613
7.993AlaGlu: 7.993 ± 0.875
2.698AlaPhe: 2.698 ± 0.399
8.757AlaGly: 8.757 ± 1.028
1.935AlaHis: 1.935 ± 0.337
6.058AlaIle: 6.058 ± 0.531
4.175AlaLys: 4.175 ± 0.467
10.284AlaLeu: 10.284 ± 0.906
2.647AlaMet: 2.647 ± 0.309
3.157AlaAsn: 3.157 ± 0.407
6.16AlaPro: 6.16 ± 0.692
3.92AlaGln: 3.92 ± 0.501
7.942AlaArg: 7.942 ± 0.577
5.397AlaSer: 5.397 ± 0.96
8.095AlaThr: 8.095 ± 0.839
8.553AlaVal: 8.553 ± 0.761
1.935AlaTrp: 1.935 ± 0.346
3.106AlaTyr: 3.106 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
0.764CysAla: 0.764 ± 0.218
0.153CysCys: 0.153 ± 0.088
0.764CysAsp: 0.764 ± 0.217
0.56CysGlu: 0.56 ± 0.184
0.356CysPhe: 0.356 ± 0.192
1.171CysGly: 1.171 ± 0.273
0.153CysHis: 0.153 ± 0.081
0.153CysIle: 0.153 ± 0.085
0.102CysLys: 0.102 ± 0.064
0.458CysLeu: 0.458 ± 0.162
0.0CysMet: 0.0 ± 0.0
0.204CysAsn: 0.204 ± 0.099
0.713CysPro: 0.713 ± 0.173
0.102CysGln: 0.102 ± 0.068
0.611CysArg: 0.611 ± 0.22
0.458CysSer: 0.458 ± 0.175
0.611CysThr: 0.611 ± 0.176
0.662CysVal: 0.662 ± 0.164
0.102CysTrp: 0.102 ± 0.072
0.204CysTyr: 0.204 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
8.044AspAla: 8.044 ± 0.712
0.611AspCys: 0.611 ± 0.129
5.142AspAsp: 5.142 ± 0.677
4.277AspGlu: 4.277 ± 0.509
2.189AspPhe: 2.189 ± 0.33
6.873AspGly: 6.873 ± 0.704
1.426AspHis: 1.426 ± 0.289
2.546AspIle: 2.546 ± 0.442
1.629AspLys: 1.629 ± 0.316
5.295AspLeu: 5.295 ± 0.586
1.426AspMet: 1.426 ± 0.253
1.731AspAsn: 1.731 ± 0.261
4.124AspPro: 4.124 ± 0.399
2.546AspGln: 2.546 ± 0.373
4.327AspArg: 4.327 ± 0.421
2.698AspSer: 2.698 ± 0.379
3.004AspThr: 3.004 ± 0.31
4.378AspVal: 4.378 ± 0.475
1.476AspTrp: 1.476 ± 0.268
1.68AspTyr: 1.68 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
7.484GluAla: 7.484 ± 0.584
0.356GluCys: 0.356 ± 0.137
3.818GluAsp: 3.818 ± 0.448
4.022GluGlu: 4.022 ± 0.511
1.935GluPhe: 1.935 ± 0.275
4.633GluGly: 4.633 ± 0.481
1.578GluHis: 1.578 ± 0.335
3.564GluIle: 3.564 ± 0.474
2.393GluLys: 2.393 ± 0.472
5.498GluLeu: 5.498 ± 0.691
1.629GluMet: 1.629 ± 0.323
2.189GluAsn: 2.189 ± 0.345
4.429GluPro: 4.429 ± 0.778
3.411GluGln: 3.411 ± 0.525
5.448GluArg: 5.448 ± 0.624
2.189GluSer: 2.189 ± 0.326
3.258GluThr: 3.258 ± 0.406
5.702GluVal: 5.702 ± 0.608
1.324GluTrp: 1.324 ± 0.264
1.476GluTyr: 1.476 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
2.8PheAla: 2.8 ± 0.393
0.255PheCys: 0.255 ± 0.114
2.546PheAsp: 2.546 ± 0.371
1.935PheGlu: 1.935 ± 0.283
0.305PhePhe: 0.305 ± 0.131
3.309PheGly: 3.309 ± 0.427
0.56PheHis: 0.56 ± 0.209
1.171PheIle: 1.171 ± 0.248
0.611PheLys: 0.611 ± 0.171
2.036PheLeu: 2.036 ± 0.347
0.509PheMet: 0.509 ± 0.183
0.204PheAsn: 0.204 ± 0.095
1.375PhePro: 1.375 ± 0.332
0.611PheGln: 0.611 ± 0.157
1.833PheArg: 1.833 ± 0.368
1.782PheSer: 1.782 ± 0.314
2.8PheThr: 2.8 ± 0.374
1.375PheVal: 1.375 ± 0.25
0.153PheTrp: 0.153 ± 0.12
1.222PheTyr: 1.222 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
7.891GlyAla: 7.891 ± 0.742
1.12GlyCys: 1.12 ± 0.273
5.448GlyAsp: 5.448 ± 0.59
5.142GlyGlu: 5.142 ± 0.542
3.106GlyPhe: 3.106 ± 0.434
7.688GlyGly: 7.688 ± 0.635
1.375GlyHis: 1.375 ± 0.326
3.207GlyIle: 3.207 ± 0.449
3.564GlyLys: 3.564 ± 0.457
5.906GlyLeu: 5.906 ± 0.575
2.444GlyMet: 2.444 ± 0.375
2.495GlyAsn: 2.495 ± 0.386
3.411GlyPro: 3.411 ± 0.391
2.902GlyGln: 2.902 ± 0.632
6.517GlyArg: 6.517 ± 0.66
5.346GlySer: 5.346 ± 0.649
6.669GlyThr: 6.669 ± 1.093
7.128GlyVal: 7.128 ± 0.637
1.833GlyTrp: 1.833 ± 0.338
2.8GlyTyr: 2.8 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
1.884HisAla: 1.884 ± 0.289
0.153HisCys: 0.153 ± 0.089
1.018HisAsp: 1.018 ± 0.246
1.273HisGlu: 1.273 ± 0.26
0.611HisPhe: 0.611 ± 0.139
1.731HisGly: 1.731 ± 0.367
0.865HisHis: 0.865 ± 0.248
0.662HisIle: 0.662 ± 0.25
0.255HisLys: 0.255 ± 0.102
1.578HisLeu: 1.578 ± 0.243
0.356HisMet: 0.356 ± 0.129
0.611HisAsn: 0.611 ± 0.16
1.578HisPro: 1.578 ± 0.292
0.509HisGln: 0.509 ± 0.149
1.527HisArg: 1.527 ± 0.331
0.764HisSer: 0.764 ± 0.217
0.56HisThr: 0.56 ± 0.158
1.375HisVal: 1.375 ± 0.29
0.356HisTrp: 0.356 ± 0.118
0.407HisTyr: 0.407 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
5.906IleAla: 5.906 ± 0.568
0.356IleCys: 0.356 ± 0.16
3.869IleAsp: 3.869 ± 0.407
3.309IleGlu: 3.309 ± 0.465
1.527IlePhe: 1.527 ± 0.312
4.277IleGly: 4.277 ± 0.609
0.815IleHis: 0.815 ± 0.268
2.902IleIle: 2.902 ± 0.381
0.662IleLys: 0.662 ± 0.213
2.647IleLeu: 2.647 ± 0.319
0.764IleMet: 0.764 ± 0.203
1.426IleAsn: 1.426 ± 0.237
2.851IlePro: 2.851 ± 0.504
1.324IleGln: 1.324 ± 0.249
3.462IleArg: 3.462 ± 0.444
2.596IleSer: 2.596 ± 0.405
4.175IleThr: 4.175 ± 0.482
4.022IleVal: 4.022 ± 0.463
0.865IleTrp: 0.865 ± 0.222
0.916IleTyr: 0.916 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
3.717LysAla: 3.717 ± 0.526
0.153LysCys: 0.153 ± 0.119
1.273LysAsp: 1.273 ± 0.169
1.884LysGlu: 1.884 ± 0.363
0.967LysPhe: 0.967 ± 0.171
1.782LysGly: 1.782 ± 0.313
0.509LysHis: 0.509 ± 0.167
1.069LysIle: 1.069 ± 0.253
0.356LysLys: 0.356 ± 0.155
2.596LysLeu: 2.596 ± 0.44
0.713LysMet: 0.713 ± 0.216
1.018LysAsn: 1.018 ± 0.206
1.68LysPro: 1.68 ± 0.279
0.967LysGln: 0.967 ± 0.234
1.833LysArg: 1.833 ± 0.282
1.629LysSer: 1.629 ± 0.274
1.833LysThr: 1.833 ± 0.408
1.884LysVal: 1.884 ± 0.371
0.458LysTrp: 0.458 ± 0.154
0.764LysTyr: 0.764 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
10.59LeuAla: 10.59 ± 0.87
0.305LeuCys: 0.305 ± 0.135
5.295LeuAsp: 5.295 ± 0.538
5.498LeuGlu: 5.498 ± 0.603
1.476LeuPhe: 1.476 ± 0.323
6.568LeuGly: 6.568 ± 0.649
0.916LeuHis: 0.916 ± 0.247
3.258LeuIle: 3.258 ± 0.426
2.036LeuLys: 2.036 ± 0.348
5.498LeuLeu: 5.498 ± 0.614
1.222LeuMet: 1.222 ± 0.257
3.004LeuAsn: 3.004 ± 0.473
3.462LeuPro: 3.462 ± 0.447
1.629LeuGln: 1.629 ± 0.295
5.193LeuArg: 5.193 ± 0.445
4.735LeuSer: 4.735 ± 0.565
5.906LeuThr: 5.906 ± 0.472
5.04LeuVal: 5.04 ± 0.594
1.171LeuTrp: 1.171 ± 0.224
1.375LeuTyr: 1.375 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
2.953MetAla: 2.953 ± 0.504
0.153MetCys: 0.153 ± 0.1
1.018MetAsp: 1.018 ± 0.24
0.713MetGlu: 0.713 ± 0.175
0.56MetPhe: 0.56 ± 0.158
1.629MetGly: 1.629 ± 0.288
0.56MetHis: 0.56 ± 0.155
1.171MetIle: 1.171 ± 0.244
0.815MetLys: 0.815 ± 0.175
2.036MetLeu: 2.036 ± 0.318
0.407MetMet: 0.407 ± 0.177
0.56MetAsn: 0.56 ± 0.161
1.68MetPro: 1.68 ± 0.283
0.967MetGln: 0.967 ± 0.285
1.68MetArg: 1.68 ± 0.264
2.444MetSer: 2.444 ± 0.368
1.527MetThr: 1.527 ± 0.25
1.476MetVal: 1.476 ± 0.245
0.255MetTrp: 0.255 ± 0.106
0.153MetTyr: 0.153 ± 0.096
0.0MetXaa: 0.0 ± 0.0
Asn
3.309AsnAla: 3.309 ± 0.435
0.204AsnCys: 0.204 ± 0.082
1.476AsnAsp: 1.476 ± 0.274
1.629AsnGlu: 1.629 ± 0.279
0.611AsnPhe: 0.611 ± 0.214
3.309AsnGly: 3.309 ± 0.408
0.356AsnHis: 0.356 ± 0.172
1.222AsnIle: 1.222 ± 0.236
0.407AsnLys: 0.407 ± 0.115
2.189AsnLeu: 2.189 ± 0.365
0.56AsnMet: 0.56 ± 0.188
0.713AsnAsn: 0.713 ± 0.214
2.444AsnPro: 2.444 ± 0.556
1.68AsnGln: 1.68 ± 0.289
1.527AsnArg: 1.527 ± 0.306
1.833AsnSer: 1.833 ± 0.323
2.546AsnThr: 2.546 ± 0.382
1.578AsnVal: 1.578 ± 0.247
0.865AsnTrp: 0.865 ± 0.199
0.764AsnTyr: 0.764 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
7.178ProAla: 7.178 ± 0.962
0.611ProCys: 0.611 ± 0.2
3.717ProAsp: 3.717 ± 0.541
5.091ProGlu: 5.091 ± 0.649
1.069ProPhe: 1.069 ± 0.201
4.786ProGly: 4.786 ± 0.712
0.865ProHis: 0.865 ± 0.172
2.393ProIle: 2.393 ± 0.373
1.273ProLys: 1.273 ± 0.294
3.36ProLeu: 3.36 ± 0.386
0.916ProMet: 0.916 ± 0.186
1.986ProAsn: 1.986 ± 0.381
2.902ProPro: 2.902 ± 0.448
2.495ProGln: 2.495 ± 0.935
3.666ProArg: 3.666 ± 0.537
3.717ProSer: 3.717 ± 0.397
4.48ProThr: 4.48 ± 0.677
4.48ProVal: 4.48 ± 0.619
1.12ProTrp: 1.12 ± 0.203
1.476ProTyr: 1.476 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
4.684GlnAla: 4.684 ± 0.668
0.356GlnCys: 0.356 ± 0.149
1.069GlnAsp: 1.069 ± 0.189
2.036GlnGlu: 2.036 ± 0.358
1.018GlnPhe: 1.018 ± 0.225
2.749GlnGly: 2.749 ± 0.878
0.916GlnHis: 0.916 ± 0.197
2.087GlnIle: 2.087 ± 0.352
0.764GlnLys: 0.764 ± 0.188
2.444GlnLeu: 2.444 ± 0.288
1.222GlnMet: 1.222 ± 0.346
0.815GlnAsn: 0.815 ± 0.169
2.444GlnPro: 2.444 ± 0.837
3.106GlnGln: 3.106 ± 1.884
2.953GlnArg: 2.953 ± 0.363
1.782GlnSer: 1.782 ± 0.375
1.527GlnThr: 1.527 ± 0.202
2.546GlnVal: 2.546 ± 0.415
1.018GlnTrp: 1.018 ± 0.205
0.865GlnTyr: 0.865 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
8.706ArgAla: 8.706 ± 0.641
0.967ArgCys: 0.967 ± 0.254
4.887ArgAsp: 4.887 ± 0.633
4.735ArgGlu: 4.735 ± 0.51
1.935ArgPhe: 1.935 ± 0.313
5.142ArgGly: 5.142 ± 0.608
1.171ArgHis: 1.171 ± 0.243
3.666ArgIle: 3.666 ± 0.512
2.546ArgLys: 2.546 ± 0.362
4.378ArgLeu: 4.378 ± 0.451
2.546ArgMet: 2.546 ± 0.343
1.986ArgAsn: 1.986 ± 0.296
2.189ArgPro: 2.189 ± 0.365
2.291ArgGln: 2.291 ± 0.334
6.72ArgArg: 6.72 ± 0.666
4.124ArgSer: 4.124 ± 0.383
3.971ArgThr: 3.971 ± 0.377
5.04ArgVal: 5.04 ± 0.521
1.222ArgTrp: 1.222 ± 0.251
2.087ArgTyr: 2.087 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
6.517SerAla: 6.517 ± 1.036
0.356SerCys: 0.356 ± 0.145
2.953SerAsp: 2.953 ± 0.296
3.615SerGlu: 3.615 ± 0.485
1.324SerPhe: 1.324 ± 0.219
4.735SerGly: 4.735 ± 0.52
0.916SerHis: 0.916 ± 0.259
2.495SerIle: 2.495 ± 0.307
1.069SerLys: 1.069 ± 0.262
4.226SerLeu: 4.226 ± 0.51
1.629SerMet: 1.629 ± 0.26
1.426SerAsn: 1.426 ± 0.268
3.055SerPro: 3.055 ± 0.473
1.578SerGln: 1.578 ± 0.233
3.717SerArg: 3.717 ± 0.473
3.106SerSer: 3.106 ± 0.503
5.295SerThr: 5.295 ± 0.537
3.564SerVal: 3.564 ± 0.41
0.764SerTrp: 0.764 ± 0.232
1.935SerTyr: 1.935 ± 0.435
0.0SerXaa: 0.0 ± 0.0
Thr
7.077ThrAla: 7.077 ± 0.764
0.713ThrCys: 0.713 ± 0.221
4.073ThrAsp: 4.073 ± 0.634
4.938ThrGlu: 4.938 ± 0.485
2.189ThrPhe: 2.189 ± 0.273
6.058ThrGly: 6.058 ± 0.663
1.069ThrHis: 1.069 ± 0.263
4.022ThrIle: 4.022 ± 0.654
1.782ThrLys: 1.782 ± 0.276
6.008ThrLeu: 6.008 ± 0.541
1.527ThrMet: 1.527 ± 0.266
1.68ThrAsn: 1.68 ± 0.313
6.058ThrPro: 6.058 ± 0.732
1.986ThrGln: 1.986 ± 0.274
3.818ThrArg: 3.818 ± 0.457
3.004ThrSer: 3.004 ± 0.539
4.887ThrThr: 4.887 ± 0.676
6.211ThrVal: 6.211 ± 0.684
1.578ThrTrp: 1.578 ± 0.23
1.476ThrTyr: 1.476 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
7.128ValAla: 7.128 ± 0.625
0.611ValCys: 0.611 ± 0.187
5.957ValAsp: 5.957 ± 0.491
4.684ValGlu: 4.684 ± 0.489
2.087ValPhe: 2.087 ± 0.367
6.72ValGly: 6.72 ± 0.633
1.375ValHis: 1.375 ± 0.248
5.04ValIle: 5.04 ± 0.459
1.731ValLys: 1.731 ± 0.355
4.887ValLeu: 4.887 ± 0.504
1.68ValMet: 1.68 ± 0.31
2.546ValAsn: 2.546 ± 0.307
4.582ValPro: 4.582 ± 0.81
2.647ValGln: 2.647 ± 0.494
4.327ValArg: 4.327 ± 0.512
4.022ValSer: 4.022 ± 0.435
5.906ValThr: 5.906 ± 0.639
5.448ValVal: 5.448 ± 0.534
1.629ValTrp: 1.629 ± 0.232
1.578ValTyr: 1.578 ± 0.287
0.0ValXaa: 0.0 ± 0.0
Trp
2.036TrpAla: 2.036 ± 0.303
0.204TrpCys: 0.204 ± 0.099
1.171TrpAsp: 1.171 ± 0.183
1.222TrpGlu: 1.222 ± 0.29
0.815TrpPhe: 0.815 ± 0.239
1.171TrpGly: 1.171 ± 0.245
0.407TrpHis: 0.407 ± 0.143
0.815TrpIle: 0.815 ± 0.213
0.509TrpLys: 0.509 ± 0.178
1.527TrpLeu: 1.527 ± 0.297
0.255TrpMet: 0.255 ± 0.106
0.56TrpAsn: 0.56 ± 0.15
0.916TrpPro: 0.916 ± 0.221
0.764TrpGln: 0.764 ± 0.216
1.12TrpArg: 1.12 ± 0.298
1.222TrpSer: 1.222 ± 0.277
1.935TrpThr: 1.935 ± 0.296
1.527TrpVal: 1.527 ± 0.245
0.713TrpTrp: 0.713 ± 0.207
0.407TrpTyr: 0.407 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.393TyrAla: 2.393 ± 0.332
0.255TyrCys: 0.255 ± 0.095
2.087TyrAsp: 2.087 ± 0.392
1.782TyrGlu: 1.782 ± 0.303
0.611TyrPhe: 0.611 ± 0.203
2.902TyrGly: 2.902 ± 0.394
0.305TyrHis: 0.305 ± 0.116
1.171TyrIle: 1.171 ± 0.232
0.407TyrLys: 0.407 ± 0.163
1.324TyrLeu: 1.324 ± 0.262
0.255TyrMet: 0.255 ± 0.116
0.916TyrAsn: 0.916 ± 0.229
1.527TyrPro: 1.527 ± 0.265
0.967TyrGln: 0.967 ± 0.227
2.087TyrArg: 2.087 ± 0.34
1.476TyrSer: 1.476 ± 0.193
1.171TyrThr: 1.171 ± 0.21
2.596TyrVal: 2.596 ± 0.372
0.458TyrTrp: 0.458 ± 0.158
0.305TyrTyr: 0.305 ± 0.097
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (19643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski