Amino acid dipepetide frequency for Microbacterium phage Alleb

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.374AlaAla: 11.374 ± 0.945
0.423AlaCys: 0.423 ± 0.151
4.761AlaAsp: 4.761 ± 0.499
8.2AlaGlu: 8.2 ± 0.887
3.174AlaPhe: 3.174 ± 0.399
7.671AlaGly: 7.671 ± 0.943
1.323AlaHis: 1.323 ± 0.348
4.814AlaIle: 4.814 ± 0.519
4.55AlaLys: 4.55 ± 0.53
9.364AlaLeu: 9.364 ± 0.859
3.386AlaMet: 3.386 ± 0.486
4.761AlaAsn: 4.761 ± 0.503
3.703AlaPro: 3.703 ± 0.341
3.915AlaGln: 3.915 ± 0.445
7.671AlaArg: 7.671 ± 0.672
5.502AlaSer: 5.502 ± 0.465
6.295AlaThr: 6.295 ± 0.582
5.925AlaVal: 5.925 ± 0.622
1.746AlaTrp: 1.746 ± 0.293
2.698AlaTyr: 2.698 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.476CysAla: 0.476 ± 0.151
0.053CysCys: 0.053 ± 0.05
0.37CysAsp: 0.37 ± 0.136
0.265CysGlu: 0.265 ± 0.142
0.212CysPhe: 0.212 ± 0.094
0.423CysGly: 0.423 ± 0.126
0.053CysHis: 0.053 ± 0.05
0.317CysIle: 0.317 ± 0.139
0.476CysLys: 0.476 ± 0.174
0.053CysLeu: 0.053 ± 0.057
0.0CysMet: 0.0 ± 0.0
0.053CysAsn: 0.053 ± 0.045
0.159CysPro: 0.159 ± 0.121
0.159CysGln: 0.159 ± 0.083
0.265CysArg: 0.265 ± 0.117
0.159CysSer: 0.159 ± 0.084
0.212CysThr: 0.212 ± 0.118
0.159CysVal: 0.159 ± 0.092
0.106CysTrp: 0.106 ± 0.076
0.159CysTyr: 0.159 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
6.295AspAla: 6.295 ± 0.695
0.317AspCys: 0.317 ± 0.137
4.444AspAsp: 4.444 ± 0.453
5.502AspGlu: 5.502 ± 0.707
1.957AspPhe: 1.957 ± 0.328
5.343AspGly: 5.343 ± 0.492
0.899AspHis: 0.899 ± 0.227
2.857AspIle: 2.857 ± 0.401
2.751AspLys: 2.751 ± 0.509
5.079AspLeu: 5.079 ± 0.466
1.481AspMet: 1.481 ± 0.28
1.693AspAsn: 1.693 ± 0.309
3.703AspPro: 3.703 ± 0.453
1.323AspGln: 1.323 ± 0.253
3.703AspArg: 3.703 ± 0.584
4.391AspSer: 4.391 ± 0.434
3.174AspThr: 3.174 ± 0.479
4.391AspVal: 4.391 ± 0.443
1.481AspTrp: 1.481 ± 0.266
2.222AspTyr: 2.222 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
7.195GluAla: 7.195 ± 0.692
0.529GluCys: 0.529 ± 0.193
4.126GluAsp: 4.126 ± 0.516
5.184GluGlu: 5.184 ± 0.655
2.804GluPhe: 2.804 ± 0.343
5.131GluGly: 5.131 ± 0.533
1.746GluHis: 1.746 ± 0.334
4.708GluIle: 4.708 ± 0.593
4.232GluLys: 4.232 ± 0.588
4.708GluLeu: 4.708 ± 0.544
1.746GluMet: 1.746 ± 0.362
2.592GluAsn: 2.592 ± 0.409
2.539GluPro: 2.539 ± 0.382
3.015GluGln: 3.015 ± 0.369
5.502GluArg: 5.502 ± 0.647
3.968GluSer: 3.968 ± 0.567
4.232GluThr: 4.232 ± 0.508
5.925GluVal: 5.925 ± 0.584
1.375GluTrp: 1.375 ± 0.284
2.433GluTyr: 2.433 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
3.121PheAla: 3.121 ± 0.37
0.106PheCys: 0.106 ± 0.08
2.592PheAsp: 2.592 ± 0.314
2.645PheGlu: 2.645 ± 0.424
0.794PhePhe: 0.794 ± 0.2
2.804PheGly: 2.804 ± 0.404
0.476PheHis: 0.476 ± 0.135
1.746PheIle: 1.746 ± 0.287
1.217PheLys: 1.217 ± 0.197
1.693PheLeu: 1.693 ± 0.286
1.217PheMet: 1.217 ± 0.231
1.323PheAsn: 1.323 ± 0.247
1.375PhePro: 1.375 ± 0.269
1.111PheGln: 1.111 ± 0.277
2.116PheArg: 2.116 ± 0.327
2.222PheSer: 2.222 ± 0.447
2.645PheThr: 2.645 ± 0.319
2.751PheVal: 2.751 ± 0.49
0.741PheTrp: 0.741 ± 0.179
0.952PheTyr: 0.952 ± 0.244
0.0PheXaa: 0.0 ± 0.0
Gly
6.877GlyAla: 6.877 ± 0.805
0.423GlyCys: 0.423 ± 0.137
5.925GlyAsp: 5.925 ± 1.075
6.56GlyGlu: 6.56 ± 0.739
4.179GlyPhe: 4.179 ± 0.539
8.306GlyGly: 8.306 ± 0.954
1.164GlyHis: 1.164 ± 0.22
4.021GlyIle: 4.021 ± 0.523
2.804GlyLys: 2.804 ± 0.398
6.084GlyLeu: 6.084 ± 0.756
2.222GlyMet: 2.222 ± 0.387
2.645GlyAsn: 2.645 ± 0.363
3.333GlyPro: 3.333 ± 0.749
3.492GlyGln: 3.492 ± 0.521
6.719GlyArg: 6.719 ± 0.668
4.92GlySer: 4.92 ± 0.468
5.872GlyThr: 5.872 ± 0.735
6.613GlyVal: 6.613 ± 0.601
1.534GlyTrp: 1.534 ± 0.388
2.751GlyTyr: 2.751 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.323HisAla: 1.323 ± 0.295
0.053HisCys: 0.053 ± 0.057
1.058HisAsp: 1.058 ± 0.289
1.111HisGlu: 1.111 ± 0.259
0.635HisPhe: 0.635 ± 0.173
1.058HisGly: 1.058 ± 0.264
0.529HisHis: 0.529 ± 0.175
1.111HisIle: 1.111 ± 0.26
0.688HisLys: 0.688 ± 0.237
1.058HisLeu: 1.058 ± 0.216
0.423HisMet: 0.423 ± 0.146
0.582HisAsn: 0.582 ± 0.164
1.005HisPro: 1.005 ± 0.298
0.688HisGln: 0.688 ± 0.185
1.111HisArg: 1.111 ± 0.235
0.952HisSer: 0.952 ± 0.248
1.534HisThr: 1.534 ± 0.307
1.164HisVal: 1.164 ± 0.236
0.423HisTrp: 0.423 ± 0.133
0.529HisTyr: 0.529 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.819IleAla: 5.819 ± 0.608
0.159IleCys: 0.159 ± 0.09
3.968IleAsp: 3.968 ± 0.453
5.502IleGlu: 5.502 ± 0.49
1.005IlePhe: 1.005 ± 0.22
3.121IleGly: 3.121 ± 0.438
0.688IleHis: 0.688 ± 0.156
2.381IleIle: 2.381 ± 0.366
2.433IleLys: 2.433 ± 0.361
2.804IleLeu: 2.804 ± 0.444
0.794IleMet: 0.794 ± 0.209
2.01IleAsn: 2.01 ± 0.314
2.539IlePro: 2.539 ± 0.352
2.275IleGln: 2.275 ± 0.373
3.756IleArg: 3.756 ± 0.454
2.645IleSer: 2.645 ± 0.324
3.121IleThr: 3.121 ± 0.405
3.386IleVal: 3.386 ± 0.41
0.794IleTrp: 0.794 ± 0.187
1.746IleTyr: 1.746 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
5.29LysAla: 5.29 ± 0.6
0.0LysCys: 0.0 ± 0.0
1.957LysAsp: 1.957 ± 0.436
2.539LysGlu: 2.539 ± 0.437
1.693LysPhe: 1.693 ± 0.321
3.544LysGly: 3.544 ± 0.769
0.899LysHis: 0.899 ± 0.208
2.328LysIle: 2.328 ± 0.396
3.333LysLys: 3.333 ± 0.57
3.28LysLeu: 3.28 ± 0.6
0.952LysMet: 0.952 ± 0.256
1.111LysAsn: 1.111 ± 0.268
1.852LysPro: 1.852 ± 0.435
1.481LysGln: 1.481 ± 0.28
2.962LysArg: 2.962 ± 0.487
2.328LysSer: 2.328 ± 0.406
2.539LysThr: 2.539 ± 0.459
2.857LysVal: 2.857 ± 0.388
0.899LysTrp: 0.899 ± 0.242
1.428LysTyr: 1.428 ± 0.32
0.0LysXaa: 0.0 ± 0.0
Leu
7.3LeuAla: 7.3 ± 0.684
0.265LeuCys: 0.265 ± 0.12
5.237LeuAsp: 5.237 ± 0.636
4.602LeuGlu: 4.602 ± 0.494
2.063LeuPhe: 2.063 ± 0.292
5.925LeuGly: 5.925 ± 0.799
1.428LeuHis: 1.428 ± 0.322
3.227LeuIle: 3.227 ± 0.507
2.592LeuLys: 2.592 ± 0.451
4.073LeuLeu: 4.073 ± 0.542
1.799LeuMet: 1.799 ± 0.353
2.804LeuAsn: 2.804 ± 0.568
3.862LeuPro: 3.862 ± 0.427
2.116LeuGln: 2.116 ± 0.5
5.396LeuArg: 5.396 ± 0.486
4.761LeuSer: 4.761 ± 0.389
4.232LeuThr: 4.232 ± 0.541
4.232LeuVal: 4.232 ± 0.44
1.481LeuTrp: 1.481 ± 0.277
1.904LeuTyr: 1.904 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
2.539MetAla: 2.539 ± 0.389
0.159MetCys: 0.159 ± 0.081
1.428MetAsp: 1.428 ± 0.364
1.217MetGlu: 1.217 ± 0.234
0.899MetPhe: 0.899 ± 0.202
1.852MetGly: 1.852 ± 0.353
0.529MetHis: 0.529 ± 0.165
0.794MetIle: 0.794 ± 0.286
1.005MetLys: 1.005 ± 0.237
1.164MetLeu: 1.164 ± 0.279
0.423MetMet: 0.423 ± 0.171
0.952MetAsn: 0.952 ± 0.224
1.323MetPro: 1.323 ± 0.273
0.635MetGln: 0.635 ± 0.18
1.957MetArg: 1.957 ± 0.306
2.645MetSer: 2.645 ± 0.424
2.063MetThr: 2.063 ± 0.36
1.428MetVal: 1.428 ± 0.348
0.529MetTrp: 0.529 ± 0.194
0.529MetTyr: 0.529 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.756AsnAla: 3.756 ± 0.326
0.0AsnCys: 0.0 ± 0.0
1.375AsnAsp: 1.375 ± 0.252
2.328AsnGlu: 2.328 ± 0.353
0.952AsnPhe: 0.952 ± 0.184
5.026AsnGly: 5.026 ± 0.577
0.741AsnHis: 0.741 ± 0.277
1.375AsnIle: 1.375 ± 0.285
0.635AsnLys: 0.635 ± 0.23
2.433AsnLeu: 2.433 ± 0.301
1.005AsnMet: 1.005 ± 0.255
1.852AsnAsn: 1.852 ± 0.336
2.539AsnPro: 2.539 ± 0.346
1.164AsnGln: 1.164 ± 0.369
2.645AsnArg: 2.645 ± 0.469
2.381AsnSer: 2.381 ± 0.423
1.852AsnThr: 1.852 ± 0.327
3.492AsnVal: 3.492 ± 0.427
0.688AsnTrp: 0.688 ± 0.14
1.428AsnTyr: 1.428 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
3.915ProAla: 3.915 ± 0.583
0.106ProCys: 0.106 ± 0.1
3.439ProAsp: 3.439 ± 0.44
3.915ProGlu: 3.915 ± 0.577
1.481ProPhe: 1.481 ± 0.296
4.338ProGly: 4.338 ± 0.565
0.899ProHis: 0.899 ± 0.229
2.381ProIle: 2.381 ± 0.351
2.645ProLys: 2.645 ± 0.442
2.91ProLeu: 2.91 ± 0.376
0.846ProMet: 0.846 ± 0.201
1.957ProAsn: 1.957 ± 0.39
1.852ProPro: 1.852 ± 0.334
1.27ProGln: 1.27 ± 0.307
1.904ProArg: 1.904 ± 0.352
2.645ProSer: 2.645 ± 0.444
2.962ProThr: 2.962 ± 0.474
3.915ProVal: 3.915 ± 0.568
1.005ProTrp: 1.005 ± 0.261
1.323ProTyr: 1.323 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
3.597GlnAla: 3.597 ± 0.491
0.106GlnCys: 0.106 ± 0.07
1.852GlnAsp: 1.852 ± 0.332
2.169GlnGlu: 2.169 ± 0.317
1.111GlnPhe: 1.111 ± 0.271
3.227GlnGly: 3.227 ± 0.601
0.423GlnHis: 0.423 ± 0.152
1.852GlnIle: 1.852 ± 0.328
1.693GlnLys: 1.693 ± 0.303
2.116GlnLeu: 2.116 ± 0.43
0.635GlnMet: 0.635 ± 0.179
1.375GlnAsn: 1.375 ± 0.288
1.217GlnPro: 1.217 ± 0.256
1.058GlnGln: 1.058 ± 0.289
2.328GlnArg: 2.328 ± 0.467
1.746GlnSer: 1.746 ± 0.328
1.587GlnThr: 1.587 ± 0.329
3.386GlnVal: 3.386 ± 0.356
0.37GlnTrp: 0.37 ± 0.153
0.846GlnTyr: 0.846 ± 0.188
0.0GlnXaa: 0.0 ± 0.0
Arg
7.777ArgAla: 7.777 ± 0.778
0.37ArgCys: 0.37 ± 0.147
3.968ArgAsp: 3.968 ± 0.506
5.131ArgGlu: 5.131 ± 0.595
2.539ArgPhe: 2.539 ± 0.307
5.978ArgGly: 5.978 ± 0.657
1.27ArgHis: 1.27 ± 0.315
4.391ArgIle: 4.391 ± 0.525
2.804ArgLys: 2.804 ± 0.506
5.502ArgLeu: 5.502 ± 0.624
1.693ArgMet: 1.693 ± 0.332
2.857ArgAsn: 2.857 ± 0.411
3.28ArgPro: 3.28 ± 0.528
1.746ArgGln: 1.746 ± 0.314
6.084ArgArg: 6.084 ± 0.721
2.698ArgSer: 2.698 ± 0.384
3.439ArgThr: 3.439 ± 0.365
5.449ArgVal: 5.449 ± 0.601
1.64ArgTrp: 1.64 ± 0.3
2.328ArgTyr: 2.328 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
5.026SerAla: 5.026 ± 0.581
0.106SerCys: 0.106 ± 0.06
4.338SerAsp: 4.338 ± 0.567
3.439SerGlu: 3.439 ± 0.452
2.063SerPhe: 2.063 ± 0.304
5.713SerGly: 5.713 ± 0.715
1.217SerHis: 1.217 ± 0.225
3.333SerIle: 3.333 ± 0.395
3.068SerLys: 3.068 ± 0.449
4.338SerLeu: 4.338 ± 0.525
1.534SerMet: 1.534 ± 0.289
2.063SerAsn: 2.063 ± 0.404
2.592SerPro: 2.592 ± 0.399
1.375SerGln: 1.375 ± 0.33
3.862SerArg: 3.862 ± 0.487
3.386SerSer: 3.386 ± 0.496
4.073SerThr: 4.073 ± 0.519
4.391SerVal: 4.391 ± 0.529
1.217SerTrp: 1.217 ± 0.26
1.587SerTyr: 1.587 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
6.401ThrAla: 6.401 ± 0.716
0.423ThrCys: 0.423 ± 0.175
3.65ThrAsp: 3.65 ± 0.507
3.968ThrGlu: 3.968 ± 0.443
3.174ThrPhe: 3.174 ± 0.428
6.93ThrGly: 6.93 ± 0.847
1.217ThrHis: 1.217 ± 0.301
2.751ThrIle: 2.751 ± 0.519
2.01ThrLys: 2.01 ± 0.309
4.655ThrLeu: 4.655 ± 0.562
1.058ThrMet: 1.058 ± 0.25
2.328ThrAsn: 2.328 ± 0.305
3.174ThrPro: 3.174 ± 0.406
1.957ThrGln: 1.957 ± 0.309
3.174ThrArg: 3.174 ± 0.391
3.121ThrSer: 3.121 ± 0.488
4.391ThrThr: 4.391 ± 0.498
5.079ThrVal: 5.079 ± 0.589
0.952ThrTrp: 0.952 ± 0.174
2.381ThrTyr: 2.381 ± 0.381
0.0ThrXaa: 0.0 ± 0.0
Val
7.618ValAla: 7.618 ± 0.626
0.317ValCys: 0.317 ± 0.149
4.92ValAsp: 4.92 ± 0.425
5.713ValGlu: 5.713 ± 0.658
1.587ValPhe: 1.587 ± 0.339
6.084ValGly: 6.084 ± 0.542
1.005ValHis: 1.005 ± 0.181
4.073ValIle: 4.073 ± 0.47
3.333ValLys: 3.333 ± 0.651
4.973ValLeu: 4.973 ± 0.496
1.587ValMet: 1.587 ± 0.255
2.698ValAsn: 2.698 ± 0.339
3.439ValPro: 3.439 ± 0.49
2.275ValGln: 2.275 ± 0.316
5.819ValArg: 5.819 ± 0.614
4.92ValSer: 4.92 ± 0.733
5.237ValThr: 5.237 ± 0.541
6.242ValVal: 6.242 ± 0.576
1.27ValTrp: 1.27 ± 0.253
1.746ValTyr: 1.746 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
1.852TrpAla: 1.852 ± 0.307
0.0TrpCys: 0.0 ± 0.0
1.428TrpAsp: 1.428 ± 0.307
1.217TrpGlu: 1.217 ± 0.228
0.635TrpPhe: 0.635 ± 0.189
1.904TrpGly: 1.904 ± 0.269
0.212TrpHis: 0.212 ± 0.088
1.217TrpIle: 1.217 ± 0.212
0.317TrpLys: 0.317 ± 0.124
0.899TrpLeu: 0.899 ± 0.203
0.476TrpMet: 0.476 ± 0.155
0.846TrpAsn: 0.846 ± 0.205
0.741TrpPro: 0.741 ± 0.179
0.476TrpGln: 0.476 ± 0.183
1.799TrpArg: 1.799 ± 0.311
1.27TrpSer: 1.27 ± 0.26
1.217TrpThr: 1.217 ± 0.259
1.746TrpVal: 1.746 ± 0.308
0.582TrpTrp: 0.582 ± 0.145
0.529TrpTyr: 0.529 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.703TyrAla: 3.703 ± 0.411
0.212TyrCys: 0.212 ± 0.114
2.169TyrAsp: 2.169 ± 0.377
2.486TyrGlu: 2.486 ± 0.319
0.688TyrPhe: 0.688 ± 0.203
2.063TyrGly: 2.063 ± 0.362
0.317TyrHis: 0.317 ± 0.121
1.323TyrIle: 1.323 ± 0.229
0.688TyrLys: 0.688 ± 0.169
2.116TyrLeu: 2.116 ± 0.348
0.741TyrMet: 0.741 ± 0.202
1.164TyrAsn: 1.164 ± 0.266
1.481TyrPro: 1.481 ± 0.278
1.164TyrGln: 1.164 ± 0.228
2.169TyrArg: 2.169 ± 0.379
2.063TyrSer: 2.063 ± 0.32
2.116TyrThr: 2.116 ± 0.327
2.328TyrVal: 2.328 ± 0.385
0.529TyrTrp: 0.529 ± 0.171
0.952TyrTyr: 0.952 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 109 proteins (18904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski