Amino acid dipepetide frequency for Microbacterium phage Stromboli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.07AlaAla: 14.07 ± 1.435
0.387AlaCys: 0.387 ± 0.188
6.649AlaAsp: 6.649 ± 0.774
7.576AlaGlu: 7.576 ± 0.751
3.711AlaPhe: 3.711 ± 0.519
12.06AlaGly: 12.06 ± 0.919
1.933AlaHis: 1.933 ± 0.373
4.948AlaIle: 4.948 ± 0.787
5.102AlaLys: 5.102 ± 0.73
10.978AlaLeu: 10.978 ± 1.029
2.551AlaMet: 2.551 ± 0.41
3.324AlaAsn: 3.324 ± 0.587
5.257AlaPro: 5.257 ± 0.582
5.025AlaGln: 5.025 ± 0.643
7.344AlaArg: 7.344 ± 0.691
5.798AlaSer: 5.798 ± 0.816
6.417AlaThr: 6.417 ± 0.773
9.818AlaVal: 9.818 ± 0.955
2.86AlaTrp: 2.86 ± 0.405
2.087AlaTyr: 2.087 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.232CysAla: 0.232 ± 0.159
0.0CysCys: 0.0 ± 0.0
0.309CysAsp: 0.309 ± 0.19
0.0CysGlu: 0.0 ± 0.0
0.077CysPhe: 0.077 ± 0.085
0.773CysGly: 0.773 ± 0.271
0.077CysHis: 0.077 ± 0.069
0.0CysIle: 0.0 ± 0.0
0.232CysLys: 0.232 ± 0.139
0.232CysLeu: 0.232 ± 0.124
0.077CysMet: 0.077 ± 0.096
0.232CysAsn: 0.232 ± 0.155
0.85CysPro: 0.85 ± 0.249
0.077CysGln: 0.077 ± 0.086
0.155CysArg: 0.155 ± 0.095
0.309CysSer: 0.309 ± 0.157
0.077CysThr: 0.077 ± 0.072
0.464CysVal: 0.464 ± 0.215
0.0CysTrp: 0.0 ± 0.0
0.155CysTyr: 0.155 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
6.107AspAla: 6.107 ± 0.665
0.155AspCys: 0.155 ± 0.144
3.556AspAsp: 3.556 ± 0.735
3.865AspGlu: 3.865 ± 0.654
2.319AspPhe: 2.319 ± 0.383
6.339AspGly: 6.339 ± 0.843
1.546AspHis: 1.546 ± 0.42
2.242AspIle: 2.242 ± 0.401
1.701AspLys: 1.701 ± 0.334
6.03AspLeu: 6.03 ± 0.861
1.624AspMet: 1.624 ± 0.463
1.546AspAsn: 1.546 ± 0.331
4.716AspPro: 4.716 ± 0.603
1.855AspGln: 1.855 ± 0.361
3.402AspArg: 3.402 ± 0.706
4.02AspSer: 4.02 ± 0.552
2.474AspThr: 2.474 ± 0.482
4.948AspVal: 4.948 ± 0.593
1.546AspTrp: 1.546 ± 0.321
2.474AspTyr: 2.474 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
8.504GluAla: 8.504 ± 0.875
0.232GluCys: 0.232 ± 0.147
3.788GluAsp: 3.788 ± 0.447
5.102GluGlu: 5.102 ± 0.854
2.551GluPhe: 2.551 ± 0.384
4.329GluGly: 4.329 ± 0.527
1.546GluHis: 1.546 ± 0.465
1.392GluIle: 1.392 ± 0.317
2.706GluLys: 2.706 ± 0.497
7.576GluLeu: 7.576 ± 0.573
2.01GluMet: 2.01 ± 0.42
1.469GluAsn: 1.469 ± 0.428
3.17GluPro: 3.17 ± 0.483
2.242GluGln: 2.242 ± 0.4
3.015GluArg: 3.015 ± 0.478
2.397GluSer: 2.397 ± 0.503
3.479GluThr: 3.479 ± 0.45
5.644GluVal: 5.644 ± 0.828
1.237GluTrp: 1.237 ± 0.359
1.546GluTyr: 1.546 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
3.402PheAla: 3.402 ± 0.544
0.155PheCys: 0.155 ± 0.11
1.701PheAsp: 1.701 ± 0.313
2.165PheGlu: 2.165 ± 0.319
0.618PhePhe: 0.618 ± 0.227
2.242PheGly: 2.242 ± 0.398
0.309PheHis: 0.309 ± 0.158
1.237PheIle: 1.237 ± 0.276
1.546PheLys: 1.546 ± 0.391
3.092PheLeu: 3.092 ± 0.556
1.005PheMet: 1.005 ± 0.307
1.16PheAsn: 1.16 ± 0.28
1.237PhePro: 1.237 ± 0.3
1.314PheGln: 1.314 ± 0.319
3.479PheArg: 3.479 ± 0.642
2.165PheSer: 2.165 ± 0.394
2.551PheThr: 2.551 ± 0.422
2.087PheVal: 2.087 ± 0.453
0.309PheTrp: 0.309 ± 0.179
0.541PheTyr: 0.541 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
9.432GlyAla: 9.432 ± 1.066
0.696GlyCys: 0.696 ± 0.288
6.881GlyAsp: 6.881 ± 0.605
4.716GlyGlu: 4.716 ± 0.598
2.938GlyPhe: 2.938 ± 0.573
7.112GlyGly: 7.112 ± 0.823
1.392GlyHis: 1.392 ± 0.402
2.938GlyIle: 2.938 ± 0.444
3.865GlyLys: 3.865 ± 0.536
6.881GlyLeu: 6.881 ± 0.685
1.855GlyMet: 1.855 ± 0.407
2.319GlyAsn: 2.319 ± 0.613
3.556GlyPro: 3.556 ± 0.7
3.634GlyGln: 3.634 ± 0.753
4.793GlyArg: 4.793 ± 0.661
4.175GlySer: 4.175 ± 0.558
6.185GlyThr: 6.185 ± 0.942
7.808GlyVal: 7.808 ± 0.901
1.778GlyTrp: 1.778 ± 0.332
2.938GlyTyr: 2.938 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
2.01HisAla: 2.01 ± 0.348
0.077HisCys: 0.077 ± 0.087
0.696HisAsp: 0.696 ± 0.277
0.541HisGlu: 0.541 ± 0.245
0.618HisPhe: 0.618 ± 0.208
1.237HisGly: 1.237 ± 0.307
0.464HisHis: 0.464 ± 0.179
0.618HisIle: 0.618 ± 0.218
0.773HisLys: 0.773 ± 0.255
2.087HisLeu: 2.087 ± 0.388
0.387HisMet: 0.387 ± 0.173
0.464HisAsn: 0.464 ± 0.169
1.237HisPro: 1.237 ± 0.257
0.387HisGln: 0.387 ± 0.151
1.082HisArg: 1.082 ± 0.232
0.928HisSer: 0.928 ± 0.355
0.696HisThr: 0.696 ± 0.243
1.624HisVal: 1.624 ± 0.379
0.309HisTrp: 0.309 ± 0.144
0.928HisTyr: 0.928 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.561IleAla: 4.561 ± 0.501
0.0IleCys: 0.0 ± 0.0
2.783IleAsp: 2.783 ± 0.499
2.706IleGlu: 2.706 ± 0.43
0.773IlePhe: 0.773 ± 0.224
2.706IleGly: 2.706 ± 0.5
1.082IleHis: 1.082 ± 0.313
1.624IleIle: 1.624 ± 0.374
2.397IleLys: 2.397 ± 0.512
3.015IleLeu: 3.015 ± 0.489
0.541IleMet: 0.541 ± 0.19
1.237IleAsn: 1.237 ± 0.315
1.778IlePro: 1.778 ± 0.367
0.85IleGln: 0.85 ± 0.231
2.629IleArg: 2.629 ± 0.475
1.855IleSer: 1.855 ± 0.321
3.247IleThr: 3.247 ± 0.52
4.02IleVal: 4.02 ± 0.629
0.464IleTrp: 0.464 ± 0.235
1.082IleTyr: 1.082 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
5.18LysAla: 5.18 ± 0.976
0.464LysCys: 0.464 ± 0.197
3.015LysAsp: 3.015 ± 0.489
2.01LysGlu: 2.01 ± 0.411
1.082LysPhe: 1.082 ± 0.368
3.402LysGly: 3.402 ± 0.424
0.618LysHis: 0.618 ± 0.199
1.701LysIle: 1.701 ± 0.385
2.087LysLys: 2.087 ± 0.468
4.02LysLeu: 4.02 ± 0.628
0.928LysMet: 0.928 ± 0.195
1.16LysAsn: 1.16 ± 0.264
2.86LysPro: 2.86 ± 0.457
1.546LysGln: 1.546 ± 0.343
3.324LysArg: 3.324 ± 0.599
2.165LysSer: 2.165 ± 0.485
2.783LysThr: 2.783 ± 0.462
3.402LysVal: 3.402 ± 0.374
0.309LysTrp: 0.309 ± 0.199
0.696LysTyr: 0.696 ± 0.238
0.0LysXaa: 0.0 ± 0.0
Leu
10.205LeuAla: 10.205 ± 1.321
0.387LeuCys: 0.387 ± 0.168
6.107LeuAsp: 6.107 ± 0.759
4.948LeuGlu: 4.948 ± 0.656
2.474LeuPhe: 2.474 ± 0.364
8.427LeuGly: 8.427 ± 0.893
1.314LeuHis: 1.314 ± 0.294
3.247LeuIle: 3.247 ± 0.47
3.943LeuLys: 3.943 ± 0.716
6.262LeuLeu: 6.262 ± 0.744
2.087LeuMet: 2.087 ± 0.324
2.551LeuAsn: 2.551 ± 0.564
4.871LeuPro: 4.871 ± 0.741
3.17LeuGln: 3.17 ± 0.668
5.566LeuArg: 5.566 ± 0.903
5.257LeuSer: 5.257 ± 0.65
5.18LeuThr: 5.18 ± 0.685
7.035LeuVal: 7.035 ± 0.895
1.237LeuTrp: 1.237 ± 0.329
1.855LeuTyr: 1.855 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
3.556MetAla: 3.556 ± 0.449
0.077MetCys: 0.077 ± 0.077
1.392MetAsp: 1.392 ± 0.327
1.314MetGlu: 1.314 ± 0.297
0.155MetPhe: 0.155 ± 0.108
1.392MetGly: 1.392 ± 0.335
0.077MetHis: 0.077 ± 0.08
0.773MetIle: 0.773 ± 0.19
1.16MetLys: 1.16 ± 0.321
1.546MetLeu: 1.546 ± 0.349
0.232MetMet: 0.232 ± 0.152
1.237MetAsn: 1.237 ± 0.285
0.85MetPro: 0.85 ± 0.243
0.618MetGln: 0.618 ± 0.251
1.237MetArg: 1.237 ± 0.331
1.392MetSer: 1.392 ± 0.323
2.165MetThr: 2.165 ± 0.408
1.933MetVal: 1.933 ± 0.498
0.387MetTrp: 0.387 ± 0.193
0.464MetTyr: 0.464 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
2.706AsnAla: 2.706 ± 0.518
0.155AsnCys: 0.155 ± 0.096
1.237AsnAsp: 1.237 ± 0.269
1.855AsnGlu: 1.855 ± 0.415
0.618AsnPhe: 0.618 ± 0.231
3.402AsnGly: 3.402 ± 0.637
0.309AsnHis: 0.309 ± 0.156
0.928AsnIle: 0.928 ± 0.274
1.314AsnLys: 1.314 ± 0.354
2.474AsnLeu: 2.474 ± 0.427
0.541AsnMet: 0.541 ± 0.239
1.005AsnAsn: 1.005 ± 0.26
2.242AsnPro: 2.242 ± 0.418
1.082AsnGln: 1.082 ± 0.316
1.624AsnArg: 1.624 ± 0.383
2.397AsnSer: 2.397 ± 0.529
1.778AsnThr: 1.778 ± 0.529
1.933AsnVal: 1.933 ± 0.423
0.618AsnTrp: 0.618 ± 0.265
1.005AsnTyr: 1.005 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
7.499ProAla: 7.499 ± 0.645
0.309ProCys: 0.309 ± 0.164
3.634ProAsp: 3.634 ± 0.675
5.18ProGlu: 5.18 ± 0.708
1.933ProPhe: 1.933 ± 0.444
5.102ProGly: 5.102 ± 0.658
0.928ProHis: 0.928 ± 0.287
2.629ProIle: 2.629 ± 0.418
2.01ProLys: 2.01 ± 0.448
3.865ProLeu: 3.865 ± 0.642
0.464ProMet: 0.464 ± 0.176
1.855ProAsn: 1.855 ± 0.361
1.392ProPro: 1.392 ± 0.356
1.701ProGln: 1.701 ± 0.385
2.319ProArg: 2.319 ± 0.5
2.629ProSer: 2.629 ± 0.419
3.634ProThr: 3.634 ± 0.479
2.938ProVal: 2.938 ± 0.488
1.469ProTrp: 1.469 ± 0.339
0.618ProTyr: 0.618 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
4.793GlnAla: 4.793 ± 0.565
0.0GlnCys: 0.0 ± 0.0
2.165GlnAsp: 2.165 ± 0.402
2.165GlnGlu: 2.165 ± 0.352
1.469GlnPhe: 1.469 ± 0.372
2.706GlnGly: 2.706 ± 0.422
0.464GlnHis: 0.464 ± 0.191
1.082GlnIle: 1.082 ± 0.365
1.933GlnLys: 1.933 ± 0.351
3.402GlnLeu: 3.402 ± 0.4
0.618GlnMet: 0.618 ± 0.195
1.237GlnAsn: 1.237 ± 0.328
1.237GlnPro: 1.237 ± 0.347
1.005GlnGln: 1.005 ± 0.203
2.165GlnArg: 2.165 ± 0.391
1.778GlnSer: 1.778 ± 0.401
1.701GlnThr: 1.701 ± 0.3
3.015GlnVal: 3.015 ± 0.546
0.696GlnTrp: 0.696 ± 0.253
0.773GlnTyr: 0.773 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
6.803ArgAla: 6.803 ± 0.894
0.464ArgCys: 0.464 ± 0.219
3.634ArgAsp: 3.634 ± 0.469
4.252ArgGlu: 4.252 ± 0.707
1.933ArgPhe: 1.933 ± 0.394
4.871ArgGly: 4.871 ± 0.689
1.16ArgHis: 1.16 ± 0.292
2.551ArgIle: 2.551 ± 0.447
2.242ArgLys: 2.242 ± 0.388
5.644ArgLeu: 5.644 ± 0.586
1.237ArgMet: 1.237 ± 0.374
1.469ArgAsn: 1.469 ± 0.376
3.711ArgPro: 3.711 ± 0.502
2.242ArgGln: 2.242 ± 0.395
5.489ArgArg: 5.489 ± 1.018
3.788ArgSer: 3.788 ± 0.662
3.324ArgThr: 3.324 ± 0.517
5.798ArgVal: 5.798 ± 0.689
1.392ArgTrp: 1.392 ± 0.317
1.392ArgTyr: 1.392 ± 0.265
0.0ArgXaa: 0.0 ± 0.0
Ser
6.262SerAla: 6.262 ± 0.637
0.077SerCys: 0.077 ± 0.076
3.865SerAsp: 3.865 ± 0.469
2.706SerGlu: 2.706 ± 0.408
2.01SerPhe: 2.01 ± 0.454
5.102SerGly: 5.102 ± 0.638
0.696SerHis: 0.696 ± 0.28
3.015SerIle: 3.015 ± 0.458
2.397SerLys: 2.397 ± 0.348
4.639SerLeu: 4.639 ± 0.648
1.778SerMet: 1.778 ± 0.411
1.933SerAsn: 1.933 ± 0.491
2.629SerPro: 2.629 ± 0.445
1.392SerGln: 1.392 ± 0.36
3.324SerArg: 3.324 ± 0.429
2.86SerSer: 2.86 ± 0.561
3.711SerThr: 3.711 ± 0.438
3.711SerVal: 3.711 ± 0.614
1.469SerTrp: 1.469 ± 0.319
1.314SerTyr: 1.314 ± 0.429
0.0SerXaa: 0.0 ± 0.0
Thr
6.881ThrAla: 6.881 ± 1.059
0.387ThrCys: 0.387 ± 0.247
3.324ThrAsp: 3.324 ± 0.556
4.02ThrGlu: 4.02 ± 0.578
2.938ThrPhe: 2.938 ± 0.601
5.18ThrGly: 5.18 ± 0.706
0.618ThrHis: 0.618 ± 0.212
3.247ThrIle: 3.247 ± 0.565
2.629ThrLys: 2.629 ± 0.431
4.639ThrLeu: 4.639 ± 0.603
0.618ThrMet: 0.618 ± 0.236
1.469ThrAsn: 1.469 ± 0.349
4.097ThrPro: 4.097 ± 0.447
1.16ThrGln: 1.16 ± 0.392
3.788ThrArg: 3.788 ± 0.51
4.175ThrSer: 4.175 ± 0.51
4.561ThrThr: 4.561 ± 0.848
5.953ThrVal: 5.953 ± 0.645
1.778ThrTrp: 1.778 ± 0.294
1.778ThrTyr: 1.778 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
10.282ValAla: 10.282 ± 0.917
0.077ValCys: 0.077 ± 0.098
5.18ValAsp: 5.18 ± 0.671
5.876ValGlu: 5.876 ± 0.68
2.783ValPhe: 2.783 ± 0.447
5.798ValGly: 5.798 ± 0.711
1.701ValHis: 1.701 ± 0.414
3.556ValIle: 3.556 ± 0.436
3.17ValLys: 3.17 ± 0.509
6.649ValLeu: 6.649 ± 0.722
1.546ValMet: 1.546 ± 0.301
2.242ValAsn: 2.242 ± 0.443
4.329ValPro: 4.329 ± 0.593
3.015ValGln: 3.015 ± 0.487
5.644ValArg: 5.644 ± 0.684
3.788ValSer: 3.788 ± 0.508
5.953ValThr: 5.953 ± 0.865
7.112ValVal: 7.112 ± 0.706
1.855ValTrp: 1.855 ± 0.51
2.629ValTyr: 2.629 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
2.474TrpAla: 2.474 ± 0.532
0.0TrpCys: 0.0 ± 0.0
0.773TrpAsp: 0.773 ± 0.271
1.546TrpGlu: 1.546 ± 0.368
0.618TrpPhe: 0.618 ± 0.2
1.469TrpGly: 1.469 ± 0.299
0.232TrpHis: 0.232 ± 0.15
0.773TrpIle: 0.773 ± 0.26
0.928TrpLys: 0.928 ± 0.266
1.855TrpLeu: 1.855 ± 0.401
0.928TrpMet: 0.928 ± 0.252
0.696TrpAsn: 0.696 ± 0.26
0.464TrpPro: 0.464 ± 0.222
1.16TrpGln: 1.16 ± 0.486
1.005TrpArg: 1.005 ± 0.374
1.469TrpSer: 1.469 ± 0.406
1.469TrpThr: 1.469 ± 0.327
2.01TrpVal: 2.01 ± 0.366
0.155TrpTrp: 0.155 ± 0.128
0.309TrpTyr: 0.309 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.938TyrAla: 2.938 ± 0.45
0.232TyrCys: 0.232 ± 0.155
1.778TyrAsp: 1.778 ± 0.41
1.392TyrGlu: 1.392 ± 0.333
0.696TyrPhe: 0.696 ± 0.228
2.242TyrGly: 2.242 ± 0.35
0.696TyrHis: 0.696 ± 0.217
0.85TyrIle: 0.85 ± 0.259
0.618TyrLys: 0.618 ± 0.199
1.237TyrLeu: 1.237 ± 0.278
0.85TyrMet: 0.85 ± 0.249
0.618TyrAsn: 0.618 ± 0.223
1.624TyrPro: 1.624 ± 0.405
0.928TyrGln: 0.928 ± 0.225
2.087TyrArg: 2.087 ± 0.49
1.546TyrSer: 1.546 ± 0.459
1.855TyrThr: 1.855 ± 0.343
1.933TyrVal: 1.933 ± 0.586
0.464TyrTrp: 0.464 ± 0.198
0.464TyrTyr: 0.464 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski