Amino acid dipepetide frequency for Microbacterium phage Fede

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.691AlaAla: 10.691 ± 1.114
0.232AlaCys: 0.232 ± 0.116
6.682AlaAsp: 6.682 ± 0.696
6.275AlaGlu: 6.275 ± 0.657
2.789AlaPhe: 2.789 ± 0.446
6.798AlaGly: 6.798 ± 0.706
1.336AlaHis: 1.336 ± 0.251
3.66AlaIle: 3.66 ± 0.361
3.312AlaLys: 3.312 ± 0.461
8.192AlaLeu: 8.192 ± 0.861
2.15AlaMet: 2.15 ± 0.545
3.951AlaAsn: 3.951 ± 0.487
5.055AlaPro: 5.055 ± 0.743
5.52AlaGln: 5.52 ± 0.923
5.868AlaArg: 5.868 ± 0.529
5.985AlaSer: 5.985 ± 1.003
6.275AlaThr: 6.275 ± 0.783
6.507AlaVal: 6.507 ± 0.534
1.743AlaTrp: 1.743 ± 0.37
2.557AlaTyr: 2.557 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.116CysAla: 0.116 ± 0.084
0.116CysCys: 0.116 ± 0.12
0.232CysAsp: 0.232 ± 0.125
0.349CysGlu: 0.349 ± 0.149
0.349CysPhe: 0.349 ± 0.19
0.232CysGly: 0.232 ± 0.122
0.291CysHis: 0.291 ± 0.184
0.174CysIle: 0.174 ± 0.133
0.349CysLys: 0.349 ± 0.206
0.349CysLeu: 0.349 ± 0.148
0.116CysMet: 0.116 ± 0.085
0.116CysAsn: 0.116 ± 0.088
0.465CysPro: 0.465 ± 0.234
0.116CysGln: 0.116 ± 0.12
0.116CysArg: 0.116 ± 0.082
0.465CysSer: 0.465 ± 0.239
0.116CysThr: 0.116 ± 0.091
0.465CysVal: 0.465 ± 0.233
0.0CysTrp: 0.0 ± 0.0
0.174CysTyr: 0.174 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
7.437AspAla: 7.437 ± 0.646
0.349AspCys: 0.349 ± 0.159
4.881AspAsp: 4.881 ± 0.727
5.287AspGlu: 5.287 ± 0.689
1.801AspPhe: 1.801 ± 0.31
6.682AspGly: 6.682 ± 0.478
0.813AspHis: 0.813 ± 0.166
4.183AspIle: 4.183 ± 0.47
2.557AspLys: 2.557 ± 0.429
5.229AspLeu: 5.229 ± 0.531
1.743AspMet: 1.743 ± 0.343
2.731AspAsn: 2.731 ± 0.426
2.905AspPro: 2.905 ± 0.402
3.37AspGln: 3.37 ± 0.444
3.777AspArg: 3.777 ± 0.619
3.079AspSer: 3.079 ± 0.398
4.183AspThr: 4.183 ± 0.367
3.196AspVal: 3.196 ± 0.493
1.569AspTrp: 1.569 ± 0.251
1.859AspTyr: 1.859 ± 0.501
0.0AspXaa: 0.0 ± 0.0
Glu
5.636GluAla: 5.636 ± 0.734
0.116GluCys: 0.116 ± 0.091
3.835GluAsp: 3.835 ± 0.54
4.125GluGlu: 4.125 ± 0.675
2.557GluPhe: 2.557 ± 0.499
3.602GluGly: 3.602 ± 0.413
0.93GluHis: 0.93 ± 0.211
3.312GluIle: 3.312 ± 0.32
2.847GluLys: 2.847 ± 0.415
5.345GluLeu: 5.345 ± 0.552
2.15GluMet: 2.15 ± 0.383
2.905GluAsn: 2.905 ± 0.507
1.801GluPro: 1.801 ± 0.244
3.486GluGln: 3.486 ± 0.58
3.777GluArg: 3.777 ± 0.625
3.254GluSer: 3.254 ± 0.324
3.37GluThr: 3.37 ± 0.365
4.183GluVal: 4.183 ± 0.409
2.266GluTrp: 2.266 ± 0.4
2.208GluTyr: 2.208 ± 0.516
0.0GluXaa: 0.0 ± 0.0
Phe
2.557PheAla: 2.557 ± 0.425
0.407PheCys: 0.407 ± 0.168
3.37PheAsp: 3.37 ± 0.369
1.917PheGlu: 1.917 ± 0.469
1.569PhePhe: 1.569 ± 0.318
2.731PheGly: 2.731 ± 0.344
0.465PheHis: 0.465 ± 0.174
1.685PheIle: 1.685 ± 0.278
1.453PheLys: 1.453 ± 0.286
1.801PheLeu: 1.801 ± 0.329
1.22PheMet: 1.22 ± 0.397
1.801PheAsn: 1.801 ± 0.303
1.162PhePro: 1.162 ± 0.352
0.988PheGln: 0.988 ± 0.303
2.092PheArg: 2.092 ± 0.428
2.034PheSer: 2.034 ± 0.393
2.615PheThr: 2.615 ± 0.483
2.034PheVal: 2.034 ± 0.415
0.465PheTrp: 0.465 ± 0.183
1.046PheTyr: 1.046 ± 0.188
0.0PheXaa: 0.0 ± 0.0
Gly
7.321GlyAla: 7.321 ± 0.843
0.174GlyCys: 0.174 ± 0.113
4.764GlyAsp: 4.764 ± 0.51
4.706GlyGlu: 4.706 ± 0.531
2.557GlyPhe: 2.557 ± 0.32
7.67GlyGly: 7.67 ± 1.121
0.988GlyHis: 0.988 ± 0.255
3.835GlyIle: 3.835 ± 0.397
2.789GlyLys: 2.789 ± 0.553
6.914GlyLeu: 6.914 ± 1.058
1.975GlyMet: 1.975 ± 0.452
3.486GlyAsn: 3.486 ± 0.364
2.382GlyPro: 2.382 ± 0.246
3.079GlyGln: 3.079 ± 0.299
4.241GlyArg: 4.241 ± 0.668
5.055GlySer: 5.055 ± 0.521
5.81GlyThr: 5.81 ± 0.728
5.287GlyVal: 5.287 ± 0.613
1.511GlyTrp: 1.511 ± 0.235
2.789GlyTyr: 2.789 ± 0.335
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.238
0.291HisCys: 0.291 ± 0.144
1.336HisAsp: 1.336 ± 0.337
0.988HisGlu: 0.988 ± 0.236
0.465HisPhe: 0.465 ± 0.192
1.453HisGly: 1.453 ± 0.326
0.407HisHis: 0.407 ± 0.211
0.697HisIle: 0.697 ± 0.274
1.278HisLys: 1.278 ± 0.456
1.104HisLeu: 1.104 ± 0.241
0.349HisMet: 0.349 ± 0.114
0.872HisAsn: 0.872 ± 0.197
1.278HisPro: 1.278 ± 0.233
0.291HisGln: 0.291 ± 0.112
1.104HisArg: 1.104 ± 0.232
0.755HisSer: 0.755 ± 0.231
0.755HisThr: 0.755 ± 0.186
0.639HisVal: 0.639 ± 0.298
0.291HisTrp: 0.291 ± 0.114
0.174HisTyr: 0.174 ± 0.111
0.0HisXaa: 0.0 ± 0.0
Ile
5.229IleAla: 5.229 ± 0.476
0.174IleCys: 0.174 ± 0.145
3.602IleAsp: 3.602 ± 0.376
3.66IleGlu: 3.66 ± 0.673
1.511IlePhe: 1.511 ± 0.379
3.37IleGly: 3.37 ± 0.37
0.813IleHis: 0.813 ± 0.188
2.382IleIle: 2.382 ± 0.38
3.138IleLys: 3.138 ± 0.515
3.602IleLeu: 3.602 ± 0.542
0.872IleMet: 0.872 ± 0.201
2.789IleAsn: 2.789 ± 0.297
2.034IlePro: 2.034 ± 0.367
2.324IleGln: 2.324 ± 0.444
2.324IleArg: 2.324 ± 0.299
2.557IleSer: 2.557 ± 0.372
3.602IleThr: 3.602 ± 0.512
3.602IleVal: 3.602 ± 0.468
0.523IleTrp: 0.523 ± 0.17
1.569IleTyr: 1.569 ± 0.294
0.0IleXaa: 0.0 ± 0.0
Lys
4.416LysAla: 4.416 ± 0.556
0.116LysCys: 0.116 ± 0.103
3.138LysAsp: 3.138 ± 0.43
2.266LysGlu: 2.266 ± 0.584
1.22LysPhe: 1.22 ± 0.318
2.789LysGly: 2.789 ± 0.397
0.755LysHis: 0.755 ± 0.228
2.557LysIle: 2.557 ± 0.539
2.034LysLys: 2.034 ± 0.452
3.428LysLeu: 3.428 ± 0.619
1.162LysMet: 1.162 ± 0.274
1.627LysAsn: 1.627 ± 0.285
2.092LysPro: 2.092 ± 0.53
1.859LysGln: 1.859 ± 0.398
1.743LysArg: 1.743 ± 0.474
2.557LysSer: 2.557 ± 0.454
2.905LysThr: 2.905 ± 0.591
2.963LysVal: 2.963 ± 0.407
0.813LysTrp: 0.813 ± 0.294
1.685LysTyr: 1.685 ± 0.41
0.0LysXaa: 0.0 ± 0.0
Leu
6.275LeuAla: 6.275 ± 0.562
0.232LeuCys: 0.232 ± 0.145
5.404LeuAsp: 5.404 ± 0.589
4.822LeuGlu: 4.822 ± 0.572
2.498LeuPhe: 2.498 ± 0.386
5.578LeuGly: 5.578 ± 0.41
1.22LeuHis: 1.22 ± 0.192
4.067LeuIle: 4.067 ± 0.512
3.37LeuLys: 3.37 ± 0.519
5.81LeuLeu: 5.81 ± 0.603
2.266LeuMet: 2.266 ± 0.386
4.241LeuAsn: 4.241 ± 0.409
3.951LeuPro: 3.951 ± 0.516
4.532LeuGln: 4.532 ± 0.659
5.171LeuArg: 5.171 ± 0.562
5.868LeuSer: 5.868 ± 0.907
5.868LeuThr: 5.868 ± 0.762
6.101LeuVal: 6.101 ± 0.588
0.988LeuTrp: 0.988 ± 0.254
2.44LeuTyr: 2.44 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
2.615MetAla: 2.615 ± 0.314
0.174MetCys: 0.174 ± 0.104
1.801MetAsp: 1.801 ± 0.271
1.453MetGlu: 1.453 ± 0.28
1.162MetPhe: 1.162 ± 0.215
1.511MetGly: 1.511 ± 0.236
0.407MetHis: 0.407 ± 0.187
0.93MetIle: 0.93 ± 0.203
1.278MetLys: 1.278 ± 0.396
2.208MetLeu: 2.208 ± 0.435
0.581MetMet: 0.581 ± 0.253
0.755MetAsn: 0.755 ± 0.23
1.104MetPro: 1.104 ± 0.261
1.162MetGln: 1.162 ± 0.297
1.336MetArg: 1.336 ± 0.332
2.382MetSer: 2.382 ± 0.334
1.627MetThr: 1.627 ± 0.356
1.743MetVal: 1.743 ± 0.315
0.174MetTrp: 0.174 ± 0.089
0.581MetTyr: 0.581 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.183AsnAla: 4.183 ± 0.403
0.232AsnCys: 0.232 ± 0.132
2.382AsnAsp: 2.382 ± 0.603
2.44AsnGlu: 2.44 ± 0.455
1.22AsnPhe: 1.22 ± 0.147
4.358AsnGly: 4.358 ± 0.584
0.639AsnHis: 0.639 ± 0.323
2.673AsnIle: 2.673 ± 0.291
1.569AsnLys: 1.569 ± 0.27
3.021AsnLeu: 3.021 ± 0.415
0.813AsnMet: 0.813 ± 0.247
2.498AsnAsn: 2.498 ± 0.413
3.312AsnPro: 3.312 ± 0.597
1.743AsnGln: 1.743 ± 0.311
2.498AsnArg: 2.498 ± 0.379
3.196AsnSer: 3.196 ± 0.543
3.835AsnThr: 3.835 ± 0.454
3.777AsnVal: 3.777 ± 0.487
0.465AsnTrp: 0.465 ± 0.162
1.743AsnTyr: 1.743 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
4.067ProAla: 4.067 ± 0.645
0.291ProCys: 0.291 ± 0.191
4.125ProAsp: 4.125 ± 0.597
3.66ProGlu: 3.66 ± 0.495
1.453ProPhe: 1.453 ± 0.288
3.719ProGly: 3.719 ± 0.419
0.697ProHis: 0.697 ± 0.152
2.324ProIle: 2.324 ± 0.433
2.208ProLys: 2.208 ± 0.351
2.847ProLeu: 2.847 ± 0.289
1.162ProMet: 1.162 ± 0.283
2.382ProAsn: 2.382 ± 0.286
2.15ProPro: 2.15 ± 0.638
1.859ProGln: 1.859 ± 0.389
2.324ProArg: 2.324 ± 0.467
3.777ProSer: 3.777 ± 0.452
3.312ProThr: 3.312 ± 0.54
3.021ProVal: 3.021 ± 0.395
1.104ProTrp: 1.104 ± 0.281
1.511ProTyr: 1.511 ± 0.213
0.0ProXaa: 0.0 ± 0.0
Gln
5.462GlnAla: 5.462 ± 0.578
0.291GlnCys: 0.291 ± 0.162
2.324GlnAsp: 2.324 ± 0.343
2.44GlnGlu: 2.44 ± 0.327
1.627GlnPhe: 1.627 ± 0.334
3.079GlnGly: 3.079 ± 0.553
0.872GlnHis: 0.872 ± 0.22
2.498GlnIle: 2.498 ± 0.457
1.685GlnLys: 1.685 ± 0.333
4.358GlnLeu: 4.358 ± 0.525
1.511GlnMet: 1.511 ± 0.354
2.092GlnAsn: 2.092 ± 0.275
2.498GlnPro: 2.498 ± 0.37
2.615GlnGln: 2.615 ± 0.744
2.905GlnArg: 2.905 ± 0.52
2.382GlnSer: 2.382 ± 0.397
4.241GlnThr: 4.241 ± 0.546
2.382GlnVal: 2.382 ± 0.293
0.813GlnTrp: 0.813 ± 0.255
1.569GlnTyr: 1.569 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
5.578ArgAla: 5.578 ± 1.124
0.116ArgCys: 0.116 ± 0.096
4.125ArgAsp: 4.125 ± 0.602
3.835ArgGlu: 3.835 ± 0.447
1.859ArgPhe: 1.859 ± 0.399
3.893ArgGly: 3.893 ± 0.492
0.755ArgHis: 0.755 ± 0.261
3.312ArgIle: 3.312 ± 0.415
2.44ArgLys: 2.44 ± 0.471
4.706ArgLeu: 4.706 ± 0.544
2.034ArgMet: 2.034 ± 0.423
2.847ArgAsn: 2.847 ± 0.397
2.266ArgPro: 2.266 ± 0.459
2.498ArgGln: 2.498 ± 0.349
4.125ArgArg: 4.125 ± 0.85
3.021ArgSer: 3.021 ± 0.526
3.544ArgThr: 3.544 ± 0.644
3.66ArgVal: 3.66 ± 0.533
0.813ArgTrp: 0.813 ± 0.192
1.569ArgTyr: 1.569 ± 0.278
0.0ArgXaa: 0.0 ± 0.0
Ser
6.566SerAla: 6.566 ± 0.703
0.465SerCys: 0.465 ± 0.189
2.963SerAsp: 2.963 ± 0.489
3.37SerGlu: 3.37 ± 0.476
2.731SerPhe: 2.731 ± 0.473
5.694SerGly: 5.694 ± 0.642
1.104SerHis: 1.104 ± 0.265
3.196SerIle: 3.196 ± 0.498
2.092SerLys: 2.092 ± 0.231
5.171SerLeu: 5.171 ± 0.559
1.743SerMet: 1.743 ± 0.286
2.905SerAsn: 2.905 ± 0.484
3.138SerPro: 3.138 ± 0.415
2.789SerGln: 2.789 ± 0.409
3.079SerArg: 3.079 ± 0.468
4.532SerSer: 4.532 ± 0.613
4.416SerThr: 4.416 ± 0.582
3.951SerVal: 3.951 ± 0.684
0.581SerTrp: 0.581 ± 0.189
1.453SerTyr: 1.453 ± 0.307
0.0SerXaa: 0.0 ± 0.0
Thr
6.74ThrAla: 6.74 ± 0.707
0.349ThrCys: 0.349 ± 0.139
4.3ThrAsp: 4.3 ± 0.534
3.893ThrGlu: 3.893 ± 0.406
2.266ThrPhe: 2.266 ± 0.3
6.217ThrGly: 6.217 ± 0.702
0.988ThrHis: 0.988 ± 0.242
2.731ThrIle: 2.731 ± 0.403
2.15ThrLys: 2.15 ± 0.498
6.217ThrLeu: 6.217 ± 0.713
0.755ThrMet: 0.755 ± 0.203
3.602ThrAsn: 3.602 ± 0.479
4.183ThrPro: 4.183 ± 0.47
3.37ThrGln: 3.37 ± 0.518
4.358ThrArg: 4.358 ± 0.476
4.125ThrSer: 4.125 ± 0.591
4.997ThrThr: 4.997 ± 0.673
5.055ThrVal: 5.055 ± 0.764
0.813ThrTrp: 0.813 ± 0.199
2.324ThrTyr: 2.324 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
5.868ValAla: 5.868 ± 0.629
0.232ValCys: 0.232 ± 0.145
4.881ValAsp: 4.881 ± 0.51
3.544ValGlu: 3.544 ± 0.495
2.382ValPhe: 2.382 ± 0.378
4.706ValGly: 4.706 ± 0.51
1.569ValHis: 1.569 ± 0.378
3.138ValIle: 3.138 ± 0.394
2.731ValLys: 2.731 ± 0.339
6.043ValLeu: 6.043 ± 0.47
1.22ValMet: 1.22 ± 0.248
2.557ValAsn: 2.557 ± 0.441
4.764ValPro: 4.764 ± 0.583
3.777ValGln: 3.777 ± 0.409
3.951ValArg: 3.951 ± 0.379
3.486ValSer: 3.486 ± 0.401
4.358ValThr: 4.358 ± 0.505
4.241ValVal: 4.241 ± 0.45
1.627ValTrp: 1.627 ± 0.347
1.859ValTyr: 1.859 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.31
0.116TrpCys: 0.116 ± 0.111
1.394TrpAsp: 1.394 ± 0.325
1.046TrpGlu: 1.046 ± 0.399
0.581TrpPhe: 0.581 ± 0.219
1.22TrpGly: 1.22 ± 0.298
0.349TrpHis: 0.349 ± 0.149
0.813TrpIle: 0.813 ± 0.265
0.988TrpLys: 0.988 ± 0.289
1.453TrpLeu: 1.453 ± 0.281
0.407TrpMet: 0.407 ± 0.171
1.046TrpAsn: 1.046 ± 0.236
0.465TrpPro: 0.465 ± 0.165
0.755TrpGln: 0.755 ± 0.182
0.755TrpArg: 0.755 ± 0.226
1.104TrpSer: 1.104 ± 0.412
1.743TrpThr: 1.743 ± 0.272
0.988TrpVal: 0.988 ± 0.255
0.465TrpTrp: 0.465 ± 0.23
0.697TrpTyr: 0.697 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.208TyrAla: 2.208 ± 0.339
0.232TyrCys: 0.232 ± 0.134
2.092TyrAsp: 2.092 ± 0.295
1.627TyrGlu: 1.627 ± 0.275
0.813TyrPhe: 0.813 ± 0.266
2.034TyrGly: 2.034 ± 0.231
0.523TyrHis: 0.523 ± 0.211
1.569TyrIle: 1.569 ± 0.235
1.859TyrLys: 1.859 ± 0.315
2.905TyrLeu: 2.905 ± 0.377
0.697TyrMet: 0.697 ± 0.17
1.394TyrAsn: 1.394 ± 0.273
0.988TyrPro: 0.988 ± 0.284
1.511TyrGln: 1.511 ± 0.278
1.453TyrArg: 1.453 ± 0.371
2.324TyrSer: 2.324 ± 0.476
1.859TyrThr: 1.859 ± 0.531
3.138TyrVal: 3.138 ± 0.643
0.581TyrTrp: 0.581 ± 0.196
1.278TyrTyr: 1.278 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (17212 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski