Amino acid dipepetide frequency for Paenibacillus phage Shelly

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.641AlaAla: 5.641 ± 0.978
0.806AlaCys: 0.806 ± 0.199
4.11AlaAsp: 4.11 ± 0.667
6.205AlaGlu: 6.205 ± 0.78
1.853AlaPhe: 1.853 ± 0.316
4.674AlaGly: 4.674 ± 0.684
1.128AlaHis: 1.128 ± 0.272
4.835AlaIle: 4.835 ± 0.913
5.399AlaLys: 5.399 ± 0.744
5.48AlaLeu: 5.48 ± 0.733
1.37AlaMet: 1.37 ± 0.314
2.901AlaAsn: 2.901 ± 0.426
1.451AlaPro: 1.451 ± 0.294
2.659AlaGln: 2.659 ± 0.446
2.337AlaArg: 2.337 ± 0.44
4.513AlaSer: 4.513 ± 0.733
3.949AlaThr: 3.949 ± 0.757
4.11AlaVal: 4.11 ± 0.592
1.209AlaTrp: 1.209 ± 0.267
2.418AlaTyr: 2.418 ± 0.362
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.206
0.081CysCys: 0.081 ± 0.083
0.484CysAsp: 0.484 ± 0.183
0.403CysGlu: 0.403 ± 0.186
0.161CysPhe: 0.161 ± 0.111
0.564CysGly: 0.564 ± 0.205
0.081CysHis: 0.081 ± 0.078
0.886CysIle: 0.886 ± 0.252
1.209CysLys: 1.209 ± 0.398
0.967CysLeu: 0.967 ± 0.264
0.242CysMet: 0.242 ± 0.127
0.161CysAsn: 0.161 ± 0.125
0.484CysPro: 0.484 ± 0.242
0.403CysGln: 0.403 ± 0.176
0.403CysArg: 0.403 ± 0.176
0.806CysSer: 0.806 ± 0.236
0.242CysThr: 0.242 ± 0.121
0.322CysVal: 0.322 ± 0.165
0.081CysTrp: 0.081 ± 0.082
0.242CysTyr: 0.242 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.949AspAla: 3.949 ± 0.563
0.242AspCys: 0.242 ± 0.157
4.271AspAsp: 4.271 ± 0.748
3.868AspGlu: 3.868 ± 0.648
3.304AspPhe: 3.304 ± 0.579
3.707AspGly: 3.707 ± 0.515
0.725AspHis: 0.725 ± 0.23
3.223AspIle: 3.223 ± 0.503
3.788AspLys: 3.788 ± 0.591
5.319AspLeu: 5.319 ± 0.585
1.934AspMet: 1.934 ± 0.443
2.015AspAsn: 2.015 ± 0.404
1.773AspPro: 1.773 ± 0.397
2.256AspGln: 2.256 ± 0.447
2.901AspArg: 2.901 ± 0.425
3.062AspSer: 3.062 ± 0.464
2.982AspThr: 2.982 ± 0.534
4.11AspVal: 4.11 ± 0.567
1.048AspTrp: 1.048 ± 0.304
2.095AspTyr: 2.095 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
5.319GluAla: 5.319 ± 0.696
0.322GluCys: 0.322 ± 0.147
4.271GluAsp: 4.271 ± 0.652
6.205GluGlu: 6.205 ± 0.881
2.498GluPhe: 2.498 ± 0.348
4.11GluGly: 4.11 ± 0.533
1.289GluHis: 1.289 ± 0.394
6.608GluIle: 6.608 ± 0.938
8.462GluLys: 8.462 ± 0.926
7.172GluLeu: 7.172 ± 0.801
4.513GluMet: 4.513 ± 0.596
3.385GluAsn: 3.385 ± 0.624
2.015GluPro: 2.015 ± 0.344
4.191GluGln: 4.191 ± 0.676
3.788GluArg: 3.788 ± 0.599
4.11GluSer: 4.11 ± 0.499
4.755GluThr: 4.755 ± 0.686
5.963GluVal: 5.963 ± 0.666
1.048GluTrp: 1.048 ± 0.316
3.062GluTyr: 3.062 ± 0.541
0.0GluXaa: 0.0 ± 0.0
Phe
1.692PheAla: 1.692 ± 0.372
0.645PheCys: 0.645 ± 0.21
2.015PheAsp: 2.015 ± 0.417
3.465PheGlu: 3.465 ± 0.57
0.725PhePhe: 0.725 ± 0.284
1.934PheGly: 1.934 ± 0.371
0.967PheHis: 0.967 ± 0.298
2.982PheIle: 2.982 ± 0.474
2.659PheLys: 2.659 ± 0.52
2.176PheLeu: 2.176 ± 0.418
0.806PheMet: 0.806 ± 0.284
2.095PheAsn: 2.095 ± 0.413
0.725PhePro: 0.725 ± 0.264
1.451PheGln: 1.451 ± 0.334
0.967PheArg: 0.967 ± 0.296
2.74PheSer: 2.74 ± 0.562
1.612PheThr: 1.612 ± 0.386
2.579PheVal: 2.579 ± 0.44
0.645PheTrp: 0.645 ± 0.257
1.209PheTyr: 1.209 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
3.223GlyAla: 3.223 ± 0.542
0.725GlyCys: 0.725 ± 0.23
2.821GlyAsp: 2.821 ± 0.538
4.916GlyGlu: 4.916 ± 0.653
1.934GlyPhe: 1.934 ± 0.398
3.546GlyGly: 3.546 ± 0.57
0.806GlyHis: 0.806 ± 0.247
5.56GlyIle: 5.56 ± 0.997
6.608GlyLys: 6.608 ± 0.595
4.835GlyLeu: 4.835 ± 0.902
1.773GlyMet: 1.773 ± 0.619
2.982GlyAsn: 2.982 ± 0.534
0.967GlyPro: 0.967 ± 0.259
2.74GlyGln: 2.74 ± 0.478
3.626GlyArg: 3.626 ± 0.518
3.626GlySer: 3.626 ± 0.732
2.74GlyThr: 2.74 ± 0.558
3.304GlyVal: 3.304 ± 0.475
1.209GlyTrp: 1.209 ± 0.266
2.579GlyTyr: 2.579 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
1.37HisAla: 1.37 ± 0.36
0.403HisCys: 0.403 ± 0.209
0.484HisAsp: 0.484 ± 0.24
1.209HisGlu: 1.209 ± 0.304
0.645HisPhe: 0.645 ± 0.227
0.484HisGly: 0.484 ± 0.186
0.484HisHis: 0.484 ± 0.179
1.853HisIle: 1.853 ± 0.401
1.209HisLys: 1.209 ± 0.277
1.853HisLeu: 1.853 ± 0.358
0.564HisMet: 0.564 ± 0.217
0.967HisAsn: 0.967 ± 0.229
0.806HisPro: 0.806 ± 0.259
0.484HisGln: 0.484 ± 0.156
1.048HisArg: 1.048 ± 0.268
1.128HisSer: 1.128 ± 0.372
0.403HisThr: 0.403 ± 0.192
1.048HisVal: 1.048 ± 0.287
0.403HisTrp: 0.403 ± 0.171
0.484HisTyr: 0.484 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.191IleAla: 4.191 ± 0.626
0.645IleCys: 0.645 ± 0.221
4.835IleAsp: 4.835 ± 0.694
6.689IleGlu: 6.689 ± 0.678
3.062IlePhe: 3.062 ± 0.497
3.707IleGly: 3.707 ± 0.635
1.37IleHis: 1.37 ± 0.292
3.143IleIle: 3.143 ± 0.798
7.253IleLys: 7.253 ± 1.025
5.319IleLeu: 5.319 ± 0.691
1.692IleMet: 1.692 ± 0.343
3.304IleAsn: 3.304 ± 0.439
2.74IlePro: 2.74 ± 0.349
3.223IleGln: 3.223 ± 0.525
3.868IleArg: 3.868 ± 0.614
3.788IleSer: 3.788 ± 0.607
3.788IleThr: 3.788 ± 0.417
4.755IleVal: 4.755 ± 0.707
0.564IleTrp: 0.564 ± 0.249
2.579IleTyr: 2.579 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
6.608LysAla: 6.608 ± 0.692
0.403LysCys: 0.403 ± 0.182
4.271LysAsp: 4.271 ± 0.437
9.348LysGlu: 9.348 ± 1.141
1.934LysPhe: 1.934 ± 0.27
5.319LysGly: 5.319 ± 0.567
1.209LysHis: 1.209 ± 0.237
5.48LysIle: 5.48 ± 0.507
8.462LysLys: 8.462 ± 1.074
6.85LysLeu: 6.85 ± 0.743
3.223LysMet: 3.223 ± 0.518
4.191LysAsn: 4.191 ± 0.695
2.579LysPro: 2.579 ± 0.535
4.432LysGln: 4.432 ± 0.605
5.883LysArg: 5.883 ± 0.703
5.56LysSer: 5.56 ± 0.63
5.238LysThr: 5.238 ± 0.668
4.513LysVal: 4.513 ± 0.629
1.209LysTrp: 1.209 ± 0.256
3.062LysTyr: 3.062 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
6.528LeuAla: 6.528 ± 0.796
0.725LeuCys: 0.725 ± 0.274
4.674LeuAsp: 4.674 ± 0.528
6.769LeuGlu: 6.769 ± 0.705
3.062LeuPhe: 3.062 ± 0.541
4.593LeuGly: 4.593 ± 0.641
1.773LeuHis: 1.773 ± 0.355
5.158LeuIle: 5.158 ± 0.485
6.608LeuLys: 6.608 ± 0.738
4.916LeuLeu: 4.916 ± 0.76
1.531LeuMet: 1.531 ± 0.361
4.191LeuAsn: 4.191 ± 0.571
2.821LeuPro: 2.821 ± 0.458
4.191LeuGln: 4.191 ± 0.469
3.626LeuArg: 3.626 ± 0.578
6.366LeuSer: 6.366 ± 0.673
5.158LeuThr: 5.158 ± 0.646
3.465LeuVal: 3.465 ± 0.464
1.128LeuTrp: 1.128 ± 0.379
2.821LeuTyr: 2.821 ± 0.569
0.0LeuXaa: 0.0 ± 0.0
Met
2.015MetAla: 2.015 ± 0.372
0.081MetCys: 0.081 ± 0.078
1.37MetAsp: 1.37 ± 0.284
2.498MetGlu: 2.498 ± 0.379
0.806MetPhe: 0.806 ± 0.203
1.531MetGly: 1.531 ± 0.31
0.242MetHis: 0.242 ± 0.131
2.498MetIle: 2.498 ± 0.471
3.788MetLys: 3.788 ± 0.552
2.579MetLeu: 2.579 ± 0.535
0.484MetMet: 0.484 ± 0.216
1.773MetAsn: 1.773 ± 0.42
1.289MetPro: 1.289 ± 0.331
1.048MetGln: 1.048 ± 0.251
0.886MetArg: 0.886 ± 0.302
2.579MetSer: 2.579 ± 0.374
1.612MetThr: 1.612 ± 0.408
1.048MetVal: 1.048 ± 0.296
0.322MetTrp: 0.322 ± 0.145
0.886MetTyr: 0.886 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
2.579AsnAla: 2.579 ± 0.459
0.484AsnCys: 0.484 ± 0.183
1.934AsnAsp: 1.934 ± 0.406
4.352AsnGlu: 4.352 ± 0.704
1.209AsnPhe: 1.209 ± 0.283
3.868AsnGly: 3.868 ± 0.608
0.725AsnHis: 0.725 ± 0.181
2.982AsnIle: 2.982 ± 0.55
3.626AsnLys: 3.626 ± 0.46
3.788AsnLeu: 3.788 ± 0.545
1.289AsnMet: 1.289 ± 0.477
2.015AsnAsn: 2.015 ± 0.491
2.579AsnPro: 2.579 ± 0.508
1.773AsnGln: 1.773 ± 0.378
3.868AsnArg: 3.868 ± 0.717
2.901AsnSer: 2.901 ± 0.445
2.176AsnThr: 2.176 ± 0.386
2.418AsnVal: 2.418 ± 0.417
0.484AsnTrp: 0.484 ± 0.196
1.451AsnTyr: 1.451 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.337ProAla: 2.337 ± 0.4
0.242ProCys: 0.242 ± 0.125
1.612ProAsp: 1.612 ± 0.389
2.821ProGlu: 2.821 ± 0.817
1.048ProPhe: 1.048 ± 0.291
2.337ProGly: 2.337 ± 0.439
0.645ProHis: 0.645 ± 0.244
2.337ProIle: 2.337 ± 0.489
3.062ProLys: 3.062 ± 0.62
2.176ProLeu: 2.176 ± 0.432
0.806ProMet: 0.806 ± 0.254
1.853ProAsn: 1.853 ± 0.382
0.967ProPro: 0.967 ± 0.288
0.967ProGln: 0.967 ± 0.259
1.37ProArg: 1.37 ± 0.325
2.821ProSer: 2.821 ± 0.647
1.531ProThr: 1.531 ± 0.299
1.853ProVal: 1.853 ± 0.504
0.484ProTrp: 0.484 ± 0.175
1.128ProTyr: 1.128 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
3.465GlnAla: 3.465 ± 0.552
0.0GlnCys: 0.0 ± 0.0
2.015GlnAsp: 2.015 ± 0.319
3.868GlnGlu: 3.868 ± 0.606
1.451GlnPhe: 1.451 ± 0.321
2.498GlnGly: 2.498 ± 0.554
1.048GlnHis: 1.048 ± 0.257
3.465GlnIle: 3.465 ± 0.68
2.982GlnLys: 2.982 ± 0.577
4.11GlnLeu: 4.11 ± 0.569
2.015GlnMet: 2.015 ± 0.411
1.773GlnAsn: 1.773 ± 0.295
1.773GlnPro: 1.773 ± 0.519
2.095GlnGln: 2.095 ± 0.394
2.579GlnArg: 2.579 ± 0.504
2.337GlnSer: 2.337 ± 0.374
2.337GlnThr: 2.337 ± 0.426
2.095GlnVal: 2.095 ± 0.398
0.403GlnTrp: 0.403 ± 0.186
1.531GlnTyr: 1.531 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
2.498ArgAla: 2.498 ± 0.423
0.564ArgCys: 0.564 ± 0.206
3.223ArgAsp: 3.223 ± 0.419
4.352ArgGlu: 4.352 ± 0.738
1.692ArgPhe: 1.692 ± 0.386
2.821ArgGly: 2.821 ± 0.462
1.048ArgHis: 1.048 ± 0.237
3.385ArgIle: 3.385 ± 0.575
6.608ArgLys: 6.608 ± 0.797
4.191ArgLeu: 4.191 ± 0.628
1.451ArgMet: 1.451 ± 0.399
2.256ArgAsn: 2.256 ± 0.453
1.37ArgPro: 1.37 ± 0.345
2.901ArgGln: 2.901 ± 0.46
3.062ArgArg: 3.062 ± 0.489
2.095ArgSer: 2.095 ± 0.487
2.095ArgThr: 2.095 ± 0.41
2.74ArgVal: 2.74 ± 0.433
0.806ArgTrp: 0.806 ± 0.257
1.853ArgTyr: 1.853 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
5.641SerAla: 5.641 ± 0.906
0.322SerCys: 0.322 ± 0.145
4.191SerAsp: 4.191 ± 0.545
4.755SerGlu: 4.755 ± 0.527
2.659SerPhe: 2.659 ± 0.424
4.11SerGly: 4.11 ± 0.743
1.048SerHis: 1.048 ± 0.378
5.802SerIle: 5.802 ± 0.78
4.029SerLys: 4.029 ± 0.63
5.319SerLeu: 5.319 ± 0.616
1.531SerMet: 1.531 ± 0.322
2.176SerAsn: 2.176 ± 0.31
1.853SerPro: 1.853 ± 0.344
2.821SerGln: 2.821 ± 0.538
3.304SerArg: 3.304 ± 0.508
4.432SerSer: 4.432 ± 0.689
3.304SerThr: 3.304 ± 0.471
4.352SerVal: 4.352 ± 0.737
0.484SerTrp: 0.484 ± 0.187
2.095SerTyr: 2.095 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
3.868ThrAla: 3.868 ± 0.724
0.564ThrCys: 0.564 ± 0.201
2.74ThrAsp: 2.74 ± 0.405
3.385ThrGlu: 3.385 ± 0.469
1.773ThrPhe: 1.773 ± 0.359
4.191ThrGly: 4.191 ± 0.572
1.048ThrHis: 1.048 ± 0.332
3.465ThrIle: 3.465 ± 0.515
4.513ThrLys: 4.513 ± 0.559
4.352ThrLeu: 4.352 ± 0.721
1.531ThrMet: 1.531 ± 0.336
2.015ThrAsn: 2.015 ± 0.392
2.821ThrPro: 2.821 ± 0.564
2.095ThrGln: 2.095 ± 0.351
2.337ThrArg: 2.337 ± 0.393
3.626ThrSer: 3.626 ± 0.48
3.062ThrThr: 3.062 ± 0.501
4.271ThrVal: 4.271 ± 0.657
0.403ThrTrp: 0.403 ± 0.183
2.418ThrTyr: 2.418 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
3.546ValAla: 3.546 ± 0.628
0.645ValCys: 0.645 ± 0.22
3.465ValAsp: 3.465 ± 0.55
4.674ValGlu: 4.674 ± 0.78
2.579ValPhe: 2.579 ± 0.449
2.821ValGly: 2.821 ± 0.562
0.645ValHis: 0.645 ± 0.26
3.385ValIle: 3.385 ± 0.456
5.238ValLys: 5.238 ± 0.653
4.271ValLeu: 4.271 ± 0.596
1.612ValMet: 1.612 ± 0.347
2.901ValAsn: 2.901 ± 0.649
1.773ValPro: 1.773 ± 0.415
1.853ValGln: 1.853 ± 0.355
3.062ValArg: 3.062 ± 0.421
5.158ValSer: 5.158 ± 0.608
4.513ValThr: 4.513 ± 0.626
3.788ValVal: 3.788 ± 0.488
1.773ValTrp: 1.773 ± 0.946
2.256ValTyr: 2.256 ± 0.471
0.0ValXaa: 0.0 ± 0.0
Trp
0.645TrpAla: 0.645 ± 0.226
0.161TrpCys: 0.161 ± 0.137
0.564TrpAsp: 0.564 ± 0.316
0.725TrpGlu: 0.725 ± 0.246
0.806TrpPhe: 0.806 ± 0.235
1.289TrpGly: 1.289 ± 0.324
0.645TrpHis: 0.645 ± 0.189
0.806TrpIle: 0.806 ± 0.202
1.289TrpLys: 1.289 ± 0.364
1.37TrpLeu: 1.37 ± 0.31
0.161TrpMet: 0.161 ± 0.147
1.773TrpAsn: 1.773 ± 1.115
0.403TrpPro: 0.403 ± 0.247
0.645TrpGln: 0.645 ± 0.207
0.242TrpArg: 0.242 ± 0.121
0.645TrpSer: 0.645 ± 0.222
0.564TrpThr: 0.564 ± 0.225
0.725TrpVal: 0.725 ± 0.195
0.161TrpTrp: 0.161 ± 0.111
0.564TrpTyr: 0.564 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.531TyrAla: 1.531 ± 0.364
0.564TyrCys: 0.564 ± 0.252
3.223TyrAsp: 3.223 ± 0.479
2.579TyrGlu: 2.579 ± 0.442
0.886TyrPhe: 0.886 ± 0.24
2.418TyrGly: 2.418 ± 0.56
0.403TyrHis: 0.403 ± 0.19
2.659TyrIle: 2.659 ± 0.482
2.982TyrLys: 2.982 ± 0.432
3.062TyrLeu: 3.062 ± 0.551
0.725TyrMet: 0.725 ± 0.254
1.773TyrAsn: 1.773 ± 0.381
1.209TyrPro: 1.209 ± 0.303
1.612TyrGln: 1.612 ± 0.326
1.853TyrArg: 1.853 ± 0.389
2.015TyrSer: 2.015 ± 0.464
2.418TyrThr: 2.418 ± 0.45
2.498TyrVal: 2.498 ± 0.475
0.322TyrTrp: 0.322 ± 0.16
1.692TyrTyr: 1.692 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (12410 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski