Amino acid dipepetide frequency for Streptococcus phage phi-SsuFJZZ32_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.47AlaAla: 3.47 ± 0.778
0.406AlaCys: 0.406 ± 0.125
4.236AlaAsp: 4.236 ± 0.541
6.174AlaGlu: 6.174 ± 1.558
2.659AlaPhe: 2.659 ± 0.33
3.966AlaGly: 3.966 ± 0.553
0.676AlaHis: 0.676 ± 0.182
5.047AlaIle: 5.047 ± 0.621
7.03AlaLys: 7.03 ± 1.1
5.858AlaLeu: 5.858 ± 0.569
1.352AlaMet: 1.352 ± 0.261
4.236AlaAsn: 4.236 ± 0.545
1.667AlaPro: 1.667 ± 0.275
4.011AlaGln: 4.011 ± 1.28
3.11AlaArg: 3.11 ± 0.396
3.921AlaSer: 3.921 ± 0.589
4.777AlaThr: 4.777 ± 0.784
4.056AlaVal: 4.056 ± 0.412
0.811AlaTrp: 0.811 ± 0.256
2.884AlaTyr: 2.884 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.132
0.18CysCys: 0.18 ± 0.085
0.315CysAsp: 0.315 ± 0.113
0.541CysGlu: 0.541 ± 0.119
0.315CysPhe: 0.315 ± 0.12
0.676CysGly: 0.676 ± 0.173
0.135CysHis: 0.135 ± 0.072
0.406CysIle: 0.406 ± 0.155
0.766CysLys: 0.766 ± 0.222
0.721CysLeu: 0.721 ± 0.225
0.045CysMet: 0.045 ± 0.047
0.315CysAsn: 0.315 ± 0.118
0.451CysPro: 0.451 ± 0.125
0.721CysGln: 0.721 ± 0.163
0.406CysArg: 0.406 ± 0.148
0.406CysSer: 0.406 ± 0.145
0.225CysThr: 0.225 ± 0.11
0.676CysVal: 0.676 ± 0.235
0.0CysTrp: 0.0 ± 0.0
0.451CysTyr: 0.451 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
3.74AspAla: 3.74 ± 0.446
0.496AspCys: 0.496 ± 0.154
3.47AspAsp: 3.47 ± 0.492
4.912AspGlu: 4.912 ± 0.584
3.2AspPhe: 3.2 ± 0.422
3.831AspGly: 3.831 ± 0.532
0.856AspHis: 0.856 ± 0.234
4.867AspIle: 4.867 ± 0.397
4.416AspLys: 4.416 ± 0.314
4.507AspLeu: 4.507 ± 0.498
1.262AspMet: 1.262 ± 0.337
3.019AspAsn: 3.019 ± 0.302
1.487AspPro: 1.487 ± 0.274
1.758AspGln: 1.758 ± 0.266
2.524AspArg: 2.524 ± 0.354
3.785AspSer: 3.785 ± 0.421
2.794AspThr: 2.794 ± 0.317
3.2AspVal: 3.2 ± 0.397
0.811AspTrp: 0.811 ± 0.145
3.019AspTyr: 3.019 ± 0.465
0.0AspXaa: 0.0 ± 0.0
Glu
5.588GluAla: 5.588 ± 1.232
0.676GluCys: 0.676 ± 0.208
4.101GluAsp: 4.101 ± 0.501
6.534GluGlu: 6.534 ± 0.734
2.884GluPhe: 2.884 ± 0.51
4.101GluGly: 4.101 ± 0.399
1.262GluHis: 1.262 ± 0.25
4.867GluIle: 4.867 ± 0.395
7.256GluLys: 7.256 ± 0.855
8.653GluLeu: 8.653 ± 0.595
2.343GluMet: 2.343 ± 0.351
4.957GluAsn: 4.957 ± 0.577
1.262GluPro: 1.262 ± 0.241
3.921GluGln: 3.921 ± 0.326
2.929GluArg: 2.929 ± 0.443
4.011GluSer: 4.011 ± 0.434
4.642GluThr: 4.642 ± 0.436
5.047GluVal: 5.047 ± 0.484
0.676GluTrp: 0.676 ± 0.176
2.388GluTyr: 2.388 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.504
0.631PheCys: 0.631 ± 0.149
2.749PheAsp: 2.749 ± 0.345
3.335PheGlu: 3.335 ± 0.485
1.622PhePhe: 1.622 ± 0.28
2.298PheGly: 2.298 ± 0.3
0.676PheHis: 0.676 ± 0.197
2.388PheIle: 2.388 ± 0.341
3.064PheLys: 3.064 ± 0.445
2.974PheLeu: 2.974 ± 0.417
0.991PheMet: 0.991 ± 0.209
2.524PheAsn: 2.524 ± 0.362
0.676PhePro: 0.676 ± 0.158
1.667PheGln: 1.667 ± 0.289
1.893PheArg: 1.893 ± 0.336
2.569PheSer: 2.569 ± 0.281
2.028PheThr: 2.028 ± 0.289
1.938PheVal: 1.938 ± 0.317
0.631PheTrp: 0.631 ± 0.164
2.388PheTyr: 2.388 ± 0.292
0.0PheXaa: 0.0 ± 0.0
Gly
3.11GlyAla: 3.11 ± 0.473
0.315GlyCys: 0.315 ± 0.114
2.929GlyAsp: 2.929 ± 0.498
4.056GlyGlu: 4.056 ± 0.474
2.569GlyPhe: 2.569 ± 0.299
3.2GlyGly: 3.2 ± 0.465
1.667GlyHis: 1.667 ± 0.297
4.912GlyIle: 4.912 ± 0.632
4.236GlyLys: 4.236 ± 0.462
5.723GlyLeu: 5.723 ± 0.583
1.712GlyMet: 1.712 ± 0.249
3.155GlyAsn: 3.155 ± 0.454
0.496GlyPro: 0.496 ± 0.148
2.479GlyGln: 2.479 ± 0.33
3.38GlyArg: 3.38 ± 0.349
3.47GlySer: 3.47 ± 0.378
3.38GlyThr: 3.38 ± 0.363
3.47GlyVal: 3.47 ± 0.453
0.451GlyTrp: 0.451 ± 0.141
2.974GlyTyr: 2.974 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
0.766HisAla: 0.766 ± 0.147
0.225HisCys: 0.225 ± 0.096
0.991HisAsp: 0.991 ± 0.243
1.127HisGlu: 1.127 ± 0.232
0.946HisPhe: 0.946 ± 0.193
1.532HisGly: 1.532 ± 0.259
0.406HisHis: 0.406 ± 0.136
1.352HisIle: 1.352 ± 0.244
0.766HisLys: 0.766 ± 0.198
1.667HisLeu: 1.667 ± 0.257
0.27HisMet: 0.27 ± 0.121
0.991HisAsn: 0.991 ± 0.234
0.901HisPro: 0.901 ± 0.26
0.766HisGln: 0.766 ± 0.219
0.991HisArg: 0.991 ± 0.213
0.856HisSer: 0.856 ± 0.19
1.172HisThr: 1.172 ± 0.204
0.811HisVal: 0.811 ± 0.187
0.225HisTrp: 0.225 ± 0.095
0.901HisTyr: 0.901 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
5.543IleAla: 5.543 ± 0.488
0.631IleCys: 0.631 ± 0.201
5.183IleAsp: 5.183 ± 0.462
5.408IleGlu: 5.408 ± 0.621
2.343IlePhe: 2.343 ± 0.362
4.191IleGly: 4.191 ± 0.456
1.172IleHis: 1.172 ± 0.238
3.515IleIle: 3.515 ± 0.345
4.146IleLys: 4.146 ± 0.424
6.399IleLeu: 6.399 ± 0.602
1.082IleMet: 1.082 ± 0.219
3.29IleAsn: 3.29 ± 0.401
1.938IlePro: 1.938 ± 0.25
3.11IleGln: 3.11 ± 0.323
2.929IleArg: 2.929 ± 0.381
5.453IleSer: 5.453 ± 0.615
4.281IleThr: 4.281 ± 0.607
4.597IleVal: 4.597 ± 0.522
0.766IleTrp: 0.766 ± 0.221
2.253IleTyr: 2.253 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
6.58LysAla: 6.58 ± 1.368
0.541LysCys: 0.541 ± 0.17
3.56LysAsp: 3.56 ± 0.371
6.219LysGlu: 6.219 ± 0.57
2.884LysPhe: 2.884 ± 0.372
3.966LysGly: 3.966 ± 0.415
1.532LysHis: 1.532 ± 0.268
5.408LysIle: 5.408 ± 0.461
5.858LysLys: 5.858 ± 0.598
6.985LysLeu: 6.985 ± 0.597
2.569LysMet: 2.569 ± 0.412
4.281LysAsn: 4.281 ± 0.545
2.208LysPro: 2.208 ± 0.292
3.425LysGln: 3.425 ± 0.443
3.605LysArg: 3.605 ± 0.36
4.416LysSer: 4.416 ± 0.507
4.552LysThr: 4.552 ± 0.411
4.597LysVal: 4.597 ± 0.411
0.946LysTrp: 0.946 ± 0.209
2.614LysTyr: 2.614 ± 0.408
0.0LysXaa: 0.0 ± 0.0
Leu
7.256LeuAla: 7.256 ± 0.875
0.451LeuCys: 0.451 ± 0.156
5.183LeuAsp: 5.183 ± 0.455
7.391LeuGlu: 7.391 ± 0.572
3.019LeuPhe: 3.019 ± 0.479
5.002LeuGly: 5.002 ± 0.487
1.442LeuHis: 1.442 ± 0.238
5.408LeuIle: 5.408 ± 0.466
7.21LeuLys: 7.21 ± 0.556
7.751LeuLeu: 7.751 ± 0.73
1.848LeuMet: 1.848 ± 0.247
4.822LeuAsn: 4.822 ± 0.473
2.794LeuPro: 2.794 ± 0.354
4.056LeuGln: 4.056 ± 0.419
3.65LeuArg: 3.65 ± 0.471
7.391LeuSer: 7.391 ± 0.589
6.625LeuThr: 6.625 ± 0.551
5.453LeuVal: 5.453 ± 0.529
0.586LeuTrp: 0.586 ± 0.154
3.335LeuTyr: 3.335 ± 0.447
0.0LeuXaa: 0.0 ± 0.0
Met
1.803MetAla: 1.803 ± 0.254
0.09MetCys: 0.09 ± 0.063
1.442MetAsp: 1.442 ± 0.258
1.667MetGlu: 1.667 ± 0.376
0.856MetPhe: 0.856 ± 0.172
1.352MetGly: 1.352 ± 0.295
0.0MetHis: 0.0 ± 0.0
1.442MetIle: 1.442 ± 0.262
1.667MetLys: 1.667 ± 0.272
1.487MetLeu: 1.487 ± 0.248
0.766MetMet: 0.766 ± 0.221
0.856MetAsn: 0.856 ± 0.189
0.631MetPro: 0.631 ± 0.188
0.856MetGln: 0.856 ± 0.211
1.442MetArg: 1.442 ± 0.298
1.848MetSer: 1.848 ± 0.324
1.758MetThr: 1.758 ± 0.31
1.577MetVal: 1.577 ± 0.262
0.135MetTrp: 0.135 ± 0.079
0.496MetTyr: 0.496 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
4.191AsnAla: 4.191 ± 0.521
0.27AsnCys: 0.27 ± 0.127
3.155AsnAsp: 3.155 ± 0.366
4.101AsnGlu: 4.101 ± 0.511
2.253AsnPhe: 2.253 ± 0.324
4.326AsnGly: 4.326 ± 0.398
1.127AsnHis: 1.127 ± 0.207
3.29AsnIle: 3.29 ± 0.41
3.876AsnLys: 3.876 ± 0.453
4.957AsnLeu: 4.957 ± 0.648
1.262AsnMet: 1.262 ± 0.243
2.659AsnAsn: 2.659 ± 0.353
2.343AsnPro: 2.343 ± 0.258
2.929AsnGln: 2.929 ± 0.38
2.839AsnArg: 2.839 ± 0.364
2.929AsnSer: 2.929 ± 0.332
2.298AsnThr: 2.298 ± 0.352
2.343AsnVal: 2.343 ± 0.357
0.991AsnTrp: 0.991 ± 0.211
1.938AsnTyr: 1.938 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
1.307ProAla: 1.307 ± 0.269
0.315ProCys: 0.315 ± 0.111
1.487ProAsp: 1.487 ± 0.271
2.343ProGlu: 2.343 ± 0.324
1.172ProPhe: 1.172 ± 0.216
0.721ProGly: 0.721 ± 0.233
0.451ProHis: 0.451 ± 0.159
2.253ProIle: 2.253 ± 0.302
2.388ProLys: 2.388 ± 0.403
2.524ProLeu: 2.524 ± 0.365
0.496ProMet: 0.496 ± 0.134
1.622ProAsn: 1.622 ± 0.23
0.811ProPro: 0.811 ± 0.236
0.766ProGln: 0.766 ± 0.214
1.307ProArg: 1.307 ± 0.265
2.208ProSer: 2.208 ± 0.314
1.893ProThr: 1.893 ± 0.302
2.118ProVal: 2.118 ± 0.299
0.225ProTrp: 0.225 ± 0.108
1.307ProTyr: 1.307 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
4.732GlnAla: 4.732 ± 0.945
0.361GlnCys: 0.361 ± 0.124
1.983GlnAsp: 1.983 ± 0.247
3.425GlnGlu: 3.425 ± 0.434
1.983GlnPhe: 1.983 ± 0.238
1.938GlnGly: 1.938 ± 0.289
0.856GlnHis: 0.856 ± 0.155
2.749GlnIle: 2.749 ± 0.334
3.2GlnLys: 3.2 ± 0.445
4.371GlnLeu: 4.371 ± 0.429
1.127GlnMet: 1.127 ± 0.228
2.884GlnAsn: 2.884 ± 0.311
1.307GlnPro: 1.307 ± 0.239
1.712GlnGln: 1.712 ± 0.272
1.803GlnArg: 1.803 ± 0.313
2.704GlnSer: 2.704 ± 0.375
3.064GlnThr: 3.064 ± 0.502
3.56GlnVal: 3.56 ± 0.458
0.451GlnTrp: 0.451 ± 0.182
1.127GlnTyr: 1.127 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
2.343ArgAla: 2.343 ± 0.32
0.541ArgCys: 0.541 ± 0.173
2.839ArgAsp: 2.839 ± 0.376
3.29ArgGlu: 3.29 ± 0.324
1.622ArgPhe: 1.622 ± 0.303
2.343ArgGly: 2.343 ± 0.334
0.901ArgHis: 0.901 ± 0.194
3.785ArgIle: 3.785 ± 0.426
3.876ArgLys: 3.876 ± 0.497
4.777ArgLeu: 4.777 ± 0.554
0.586ArgMet: 0.586 ± 0.187
2.614ArgAsn: 2.614 ± 0.352
1.352ArgPro: 1.352 ± 0.25
2.524ArgGln: 2.524 ± 0.31
2.118ArgArg: 2.118 ± 0.328
2.929ArgSer: 2.929 ± 0.302
2.298ArgThr: 2.298 ± 0.395
2.704ArgVal: 2.704 ± 0.412
0.811ArgTrp: 0.811 ± 0.17
1.667ArgTyr: 1.667 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
3.695SerAla: 3.695 ± 0.45
0.496SerCys: 0.496 ± 0.151
3.966SerAsp: 3.966 ± 0.466
4.552SerGlu: 4.552 ± 0.446
2.434SerPhe: 2.434 ± 0.357
4.371SerGly: 4.371 ± 0.403
1.352SerHis: 1.352 ± 0.204
5.183SerIle: 5.183 ± 0.496
5.183SerLys: 5.183 ± 0.51
5.858SerLeu: 5.858 ± 0.497
1.127SerMet: 1.127 ± 0.194
3.11SerAsn: 3.11 ± 0.443
2.208SerPro: 2.208 ± 0.303
3.29SerGln: 3.29 ± 0.539
3.11SerArg: 3.11 ± 0.384
4.957SerSer: 4.957 ± 0.696
3.515SerThr: 3.515 ± 0.433
3.921SerVal: 3.921 ± 0.429
0.856SerTrp: 0.856 ± 0.174
3.064SerTyr: 3.064 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
5.137ThrAla: 5.137 ± 0.845
0.135ThrCys: 0.135 ± 0.082
3.29ThrAsp: 3.29 ± 0.452
4.371ThrGlu: 4.371 ± 0.411
2.479ThrPhe: 2.479 ± 0.406
3.695ThrGly: 3.695 ± 0.538
0.721ThrHis: 0.721 ± 0.177
4.101ThrIle: 4.101 ± 0.505
4.957ThrLys: 4.957 ± 0.386
5.047ThrLeu: 5.047 ± 0.347
1.037ThrMet: 1.037 ± 0.174
3.019ThrAsn: 3.019 ± 0.438
1.938ThrPro: 1.938 ± 0.301
2.524ThrGln: 2.524 ± 0.479
2.208ThrArg: 2.208 ± 0.316
4.281ThrSer: 4.281 ± 0.614
4.867ThrThr: 4.867 ± 0.632
5.092ThrVal: 5.092 ± 0.524
0.766ThrTrp: 0.766 ± 0.187
1.803ThrTyr: 1.803 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
4.371ValAla: 4.371 ± 0.577
0.541ValCys: 0.541 ± 0.203
3.831ValAsp: 3.831 ± 0.483
4.732ValGlu: 4.732 ± 0.462
2.208ValPhe: 2.208 ± 0.347
3.245ValGly: 3.245 ± 0.477
1.217ValHis: 1.217 ± 0.179
4.011ValIle: 4.011 ± 0.428
3.966ValLys: 3.966 ± 0.423
5.768ValLeu: 5.768 ± 0.498
1.172ValMet: 1.172 ± 0.262
3.019ValAsn: 3.019 ± 0.349
2.073ValPro: 2.073 ± 0.262
2.298ValGln: 2.298 ± 0.36
3.155ValArg: 3.155 ± 0.51
4.867ValSer: 4.867 ± 0.551
4.191ValThr: 4.191 ± 0.383
3.335ValVal: 3.335 ± 0.342
0.946ValTrp: 0.946 ± 0.218
2.073ValTyr: 2.073 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.218
0.135TrpCys: 0.135 ± 0.085
0.361TrpAsp: 0.361 ± 0.117
1.082TrpGlu: 1.082 ± 0.242
0.631TrpPhe: 0.631 ± 0.185
0.631TrpGly: 0.631 ± 0.124
0.27TrpHis: 0.27 ± 0.11
0.721TrpIle: 0.721 ± 0.176
0.631TrpLys: 0.631 ± 0.177
0.856TrpLeu: 0.856 ± 0.189
0.406TrpMet: 0.406 ± 0.113
0.991TrpAsn: 0.991 ± 0.237
0.09TrpPro: 0.09 ± 0.065
0.676TrpGln: 0.676 ± 0.165
0.541TrpArg: 0.541 ± 0.188
0.766TrpSer: 0.766 ± 0.193
1.037TrpThr: 1.037 ± 0.291
0.586TrpVal: 0.586 ± 0.181
0.18TrpTrp: 0.18 ± 0.09
0.27TrpTyr: 0.27 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.388TyrAla: 2.388 ± 0.264
0.721TyrCys: 0.721 ± 0.193
2.884TyrAsp: 2.884 ± 0.441
2.839TyrGlu: 2.839 ± 0.376
1.983TyrPhe: 1.983 ± 0.287
2.298TyrGly: 2.298 ± 0.343
0.946TyrHis: 0.946 ± 0.25
2.524TyrIle: 2.524 ± 0.368
2.253TyrLys: 2.253 ± 0.311
3.785TyrLeu: 3.785 ± 0.461
0.676TyrMet: 0.676 ± 0.188
1.712TyrAsn: 1.712 ± 0.248
1.082TyrPro: 1.082 ± 0.181
1.893TyrGln: 1.893 ± 0.236
1.983TyrArg: 1.983 ± 0.289
2.479TyrSer: 2.479 ± 0.405
2.073TyrThr: 2.073 ± 0.319
1.938TyrVal: 1.938 ± 0.267
0.496TyrTrp: 0.496 ± 0.184
1.712TyrTyr: 1.712 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (22191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski