Amino acid dipepetide frequency for Streptococcus phage phi-SsuFJZZ39_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.47AlaAla: 3.47 ± 0.776
0.406AlaCys: 0.406 ± 0.121
4.236AlaAsp: 4.236 ± 0.586
6.174AlaGlu: 6.174 ± 1.747
2.659AlaPhe: 2.659 ± 0.292
3.966AlaGly: 3.966 ± 0.599
0.676AlaHis: 0.676 ± 0.191
5.047AlaIle: 5.047 ± 0.519
7.03AlaLys: 7.03 ± 1.218
5.858AlaLeu: 5.858 ± 0.508
1.352AlaMet: 1.352 ± 0.211
4.236AlaAsn: 4.236 ± 0.605
1.667AlaPro: 1.667 ± 0.302
4.011AlaGln: 4.011 ± 1.467
3.11AlaArg: 3.11 ± 0.426
3.921AlaSer: 3.921 ± 0.605
4.777AlaThr: 4.777 ± 0.908
4.056AlaVal: 4.056 ± 0.473
0.811AlaTrp: 0.811 ± 0.221
2.884AlaTyr: 2.884 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.133
0.18CysCys: 0.18 ± 0.082
0.315CysAsp: 0.315 ± 0.114
0.541CysGlu: 0.541 ± 0.146
0.315CysPhe: 0.315 ± 0.132
0.676CysGly: 0.676 ± 0.178
0.135CysHis: 0.135 ± 0.085
0.406CysIle: 0.406 ± 0.159
0.766CysLys: 0.766 ± 0.236
0.721CysLeu: 0.721 ± 0.203
0.045CysMet: 0.045 ± 0.044
0.315CysAsn: 0.315 ± 0.122
0.451CysPro: 0.451 ± 0.16
0.721CysGln: 0.721 ± 0.179
0.406CysArg: 0.406 ± 0.158
0.406CysSer: 0.406 ± 0.141
0.225CysThr: 0.225 ± 0.095
0.676CysVal: 0.676 ± 0.223
0.0CysTrp: 0.0 ± 0.0
0.451CysTyr: 0.451 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
3.74AspAla: 3.74 ± 0.469
0.496AspCys: 0.496 ± 0.157
3.47AspAsp: 3.47 ± 0.607
4.912AspGlu: 4.912 ± 0.58
3.2AspPhe: 3.2 ± 0.472
3.831AspGly: 3.831 ± 0.513
0.856AspHis: 0.856 ± 0.23
4.867AspIle: 4.867 ± 0.454
4.416AspLys: 4.416 ± 0.297
4.507AspLeu: 4.507 ± 0.474
1.262AspMet: 1.262 ± 0.289
3.019AspAsn: 3.019 ± 0.311
1.487AspPro: 1.487 ± 0.301
1.758AspGln: 1.758 ± 0.234
2.524AspArg: 2.524 ± 0.307
3.785AspSer: 3.785 ± 0.414
2.794AspThr: 2.794 ± 0.337
3.2AspVal: 3.2 ± 0.386
0.811AspTrp: 0.811 ± 0.182
3.019AspTyr: 3.019 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
5.588GluAla: 5.588 ± 1.455
0.676GluCys: 0.676 ± 0.194
4.101GluAsp: 4.101 ± 0.531
6.534GluGlu: 6.534 ± 0.775
2.884GluPhe: 2.884 ± 0.492
4.101GluGly: 4.101 ± 0.354
1.262GluHis: 1.262 ± 0.243
4.867GluIle: 4.867 ± 0.373
7.256GluLys: 7.256 ± 0.805
8.653GluLeu: 8.653 ± 0.659
2.298GluMet: 2.298 ± 0.349
4.957GluAsn: 4.957 ± 0.586
1.262GluPro: 1.262 ± 0.258
3.921GluGln: 3.921 ± 0.353
2.929GluArg: 2.929 ± 0.389
4.011GluSer: 4.011 ± 0.4
4.642GluThr: 4.642 ± 0.469
5.047GluVal: 5.047 ± 0.371
0.676GluTrp: 0.676 ± 0.199
2.388GluTyr: 2.388 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.524
0.631PheCys: 0.631 ± 0.168
2.749PheAsp: 2.749 ± 0.393
3.335PheGlu: 3.335 ± 0.46
1.622PhePhe: 1.622 ± 0.294
2.298PheGly: 2.298 ± 0.305
0.676PheHis: 0.676 ± 0.188
2.388PheIle: 2.388 ± 0.379
3.064PheLys: 3.064 ± 0.442
2.974PheLeu: 2.974 ± 0.509
0.991PheMet: 0.991 ± 0.209
2.524PheAsn: 2.524 ± 0.356
0.676PhePro: 0.676 ± 0.176
1.667PheGln: 1.667 ± 0.313
1.893PheArg: 1.893 ± 0.364
2.569PheSer: 2.569 ± 0.296
2.028PheThr: 2.028 ± 0.359
1.938PheVal: 1.938 ± 0.33
0.631PheTrp: 0.631 ± 0.18
2.388PheTyr: 2.388 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
3.11GlyAla: 3.11 ± 0.461
0.315GlyCys: 0.315 ± 0.109
2.929GlyAsp: 2.929 ± 0.495
4.056GlyGlu: 4.056 ± 0.475
2.569GlyPhe: 2.569 ± 0.34
3.2GlyGly: 3.2 ± 0.527
1.667GlyHis: 1.667 ± 0.308
4.912GlyIle: 4.912 ± 0.616
4.236GlyLys: 4.236 ± 0.426
5.723GlyLeu: 5.723 ± 0.596
1.712GlyMet: 1.712 ± 0.287
3.155GlyAsn: 3.155 ± 0.42
0.496GlyPro: 0.496 ± 0.131
2.479GlyGln: 2.479 ± 0.34
3.38GlyArg: 3.38 ± 0.407
3.47GlySer: 3.47 ± 0.341
3.38GlyThr: 3.38 ± 0.429
3.47GlyVal: 3.47 ± 0.44
0.451GlyTrp: 0.451 ± 0.137
2.974GlyTyr: 2.974 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
0.766HisAla: 0.766 ± 0.148
0.225HisCys: 0.225 ± 0.092
0.991HisAsp: 0.991 ± 0.226
1.127HisGlu: 1.127 ± 0.235
0.946HisPhe: 0.946 ± 0.212
1.532HisGly: 1.532 ± 0.266
0.406HisHis: 0.406 ± 0.128
1.352HisIle: 1.352 ± 0.24
0.766HisLys: 0.766 ± 0.187
1.667HisLeu: 1.667 ± 0.275
0.27HisMet: 0.27 ± 0.128
0.991HisAsn: 0.991 ± 0.241
0.901HisPro: 0.901 ± 0.216
0.766HisGln: 0.766 ± 0.226
0.991HisArg: 0.991 ± 0.214
0.856HisSer: 0.856 ± 0.205
1.172HisThr: 1.172 ± 0.261
0.811HisVal: 0.811 ± 0.203
0.225HisTrp: 0.225 ± 0.096
0.901HisTyr: 0.901 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.543IleAla: 5.543 ± 0.514
0.631IleCys: 0.631 ± 0.174
5.183IleAsp: 5.183 ± 0.448
5.408IleGlu: 5.408 ± 0.719
2.343IlePhe: 2.343 ± 0.364
4.191IleGly: 4.191 ± 0.437
1.172IleHis: 1.172 ± 0.22
3.515IleIle: 3.515 ± 0.388
4.146IleLys: 4.146 ± 0.453
6.399IleLeu: 6.399 ± 0.633
1.082IleMet: 1.082 ± 0.227
3.29IleAsn: 3.29 ± 0.389
1.938IlePro: 1.938 ± 0.255
3.11IleGln: 3.11 ± 0.359
2.929IleArg: 2.929 ± 0.399
5.453IleSer: 5.453 ± 0.652
4.281IleThr: 4.281 ± 0.645
4.597IleVal: 4.597 ± 0.492
0.766IleTrp: 0.766 ± 0.189
2.253IleTyr: 2.253 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
6.58LysAla: 6.58 ± 1.684
0.541LysCys: 0.541 ± 0.147
3.56LysAsp: 3.56 ± 0.441
6.219LysGlu: 6.219 ± 0.592
2.884LysPhe: 2.884 ± 0.358
3.966LysGly: 3.966 ± 0.398
1.532LysHis: 1.532 ± 0.265
5.408LysIle: 5.408 ± 0.524
5.858LysLys: 5.858 ± 0.705
6.985LysLeu: 6.985 ± 0.547
2.569LysMet: 2.569 ± 0.382
4.281LysAsn: 4.281 ± 0.568
2.208LysPro: 2.208 ± 0.309
3.425LysGln: 3.425 ± 0.448
3.605LysArg: 3.605 ± 0.443
4.416LysSer: 4.416 ± 0.473
4.552LysThr: 4.552 ± 0.371
4.597LysVal: 4.597 ± 0.522
0.946LysTrp: 0.946 ± 0.227
2.614LysTyr: 2.614 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
7.256LeuAla: 7.256 ± 0.868
0.451LeuCys: 0.451 ± 0.162
5.183LeuAsp: 5.183 ± 0.484
7.391LeuGlu: 7.391 ± 0.616
3.019LeuPhe: 3.019 ± 0.477
5.002LeuGly: 5.002 ± 0.479
1.442LeuHis: 1.442 ± 0.223
5.408LeuIle: 5.408 ± 0.5
7.21LeuLys: 7.21 ± 0.607
7.751LeuLeu: 7.751 ± 0.787
1.848LeuMet: 1.848 ± 0.246
4.822LeuAsn: 4.822 ± 0.531
2.794LeuPro: 2.794 ± 0.328
4.056LeuGln: 4.056 ± 0.431
3.65LeuArg: 3.65 ± 0.434
7.391LeuSer: 7.391 ± 0.582
6.625LeuThr: 6.625 ± 0.542
5.453LeuVal: 5.453 ± 0.517
0.586LeuTrp: 0.586 ± 0.123
3.335LeuTyr: 3.335 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
1.803MetAla: 1.803 ± 0.229
0.09MetCys: 0.09 ± 0.069
1.442MetAsp: 1.442 ± 0.24
1.667MetGlu: 1.667 ± 0.31
0.856MetPhe: 0.856 ± 0.208
1.352MetGly: 1.352 ± 0.264
0.0MetHis: 0.0 ± 0.0
1.442MetIle: 1.442 ± 0.265
1.667MetLys: 1.667 ± 0.261
1.487MetLeu: 1.487 ± 0.256
0.766MetMet: 0.766 ± 0.191
0.856MetAsn: 0.856 ± 0.188
0.631MetPro: 0.631 ± 0.168
0.856MetGln: 0.856 ± 0.188
1.442MetArg: 1.442 ± 0.281
1.848MetSer: 1.848 ± 0.264
1.758MetThr: 1.758 ± 0.303
1.577MetVal: 1.577 ± 0.273
0.135MetTrp: 0.135 ± 0.069
0.496MetTyr: 0.496 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
4.191AsnAla: 4.191 ± 0.543
0.27AsnCys: 0.27 ± 0.125
3.155AsnAsp: 3.155 ± 0.367
4.101AsnGlu: 4.101 ± 0.489
2.253AsnPhe: 2.253 ± 0.346
4.326AsnGly: 4.326 ± 0.365
1.127AsnHis: 1.127 ± 0.209
3.29AsnIle: 3.29 ± 0.452
3.876AsnLys: 3.876 ± 0.491
4.957AsnLeu: 4.957 ± 0.644
1.262AsnMet: 1.262 ± 0.248
2.659AsnAsn: 2.659 ± 0.377
2.343AsnPro: 2.343 ± 0.247
2.929AsnGln: 2.929 ± 0.362
2.839AsnArg: 2.839 ± 0.398
2.929AsnSer: 2.929 ± 0.3
2.298AsnThr: 2.298 ± 0.459
2.343AsnVal: 2.343 ± 0.413
0.991AsnTrp: 0.991 ± 0.2
1.938AsnTyr: 1.938 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
1.307ProAla: 1.307 ± 0.267
0.315ProCys: 0.315 ± 0.121
1.487ProAsp: 1.487 ± 0.266
2.343ProGlu: 2.343 ± 0.355
1.172ProPhe: 1.172 ± 0.2
0.721ProGly: 0.721 ± 0.225
0.451ProHis: 0.451 ± 0.14
2.253ProIle: 2.253 ± 0.329
2.388ProLys: 2.388 ± 0.462
2.524ProLeu: 2.524 ± 0.417
0.496ProMet: 0.496 ± 0.129
1.622ProAsn: 1.622 ± 0.273
0.811ProPro: 0.811 ± 0.219
0.766ProGln: 0.766 ± 0.203
1.307ProArg: 1.307 ± 0.222
2.208ProSer: 2.208 ± 0.308
1.893ProThr: 1.893 ± 0.292
2.118ProVal: 2.118 ± 0.297
0.225ProTrp: 0.225 ± 0.091
1.307ProTyr: 1.307 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.732GlnAla: 4.732 ± 1.06
0.361GlnCys: 0.361 ± 0.128
1.983GlnAsp: 1.983 ± 0.264
3.425GlnGlu: 3.425 ± 0.406
1.983GlnPhe: 1.983 ± 0.283
1.938GlnGly: 1.938 ± 0.311
0.856GlnHis: 0.856 ± 0.184
2.749GlnIle: 2.749 ± 0.357
3.2GlnLys: 3.2 ± 0.383
4.371GlnLeu: 4.371 ± 0.435
1.172GlnMet: 1.172 ± 0.231
2.884GlnAsn: 2.884 ± 0.35
1.307GlnPro: 1.307 ± 0.254
1.712GlnGln: 1.712 ± 0.272
1.803GlnArg: 1.803 ± 0.342
2.704GlnSer: 2.704 ± 0.322
3.064GlnThr: 3.064 ± 0.529
3.56GlnVal: 3.56 ± 0.439
0.451GlnTrp: 0.451 ± 0.172
1.127GlnTyr: 1.127 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
2.343ArgAla: 2.343 ± 0.294
0.541ArgCys: 0.541 ± 0.165
2.839ArgAsp: 2.839 ± 0.404
3.29ArgGlu: 3.29 ± 0.293
1.622ArgPhe: 1.622 ± 0.348
2.343ArgGly: 2.343 ± 0.322
0.901ArgHis: 0.901 ± 0.214
3.785ArgIle: 3.785 ± 0.433
3.876ArgLys: 3.876 ± 0.455
4.777ArgLeu: 4.777 ± 0.602
0.586ArgMet: 0.586 ± 0.162
2.614ArgAsn: 2.614 ± 0.329
1.352ArgPro: 1.352 ± 0.253
2.524ArgGln: 2.524 ± 0.323
2.118ArgArg: 2.118 ± 0.291
2.929ArgSer: 2.929 ± 0.357
2.298ArgThr: 2.298 ± 0.428
2.704ArgVal: 2.704 ± 0.392
0.811ArgTrp: 0.811 ± 0.201
1.667ArgTyr: 1.667 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
3.695SerAla: 3.695 ± 0.385
0.496SerCys: 0.496 ± 0.153
3.966SerAsp: 3.966 ± 0.472
4.552SerGlu: 4.552 ± 0.504
2.434SerPhe: 2.434 ± 0.45
4.371SerGly: 4.371 ± 0.448
1.352SerHis: 1.352 ± 0.23
5.183SerIle: 5.183 ± 0.529
5.183SerLys: 5.183 ± 0.534
5.858SerLeu: 5.858 ± 0.583
1.127SerMet: 1.127 ± 0.163
3.11SerAsn: 3.11 ± 0.503
2.208SerPro: 2.208 ± 0.232
3.29SerGln: 3.29 ± 0.54
3.11SerArg: 3.11 ± 0.459
4.957SerSer: 4.957 ± 0.762
3.515SerThr: 3.515 ± 0.439
3.921SerVal: 3.921 ± 0.384
0.856SerTrp: 0.856 ± 0.172
3.064SerTyr: 3.064 ± 0.484
0.0SerXaa: 0.0 ± 0.0
Thr
5.137ThrAla: 5.137 ± 1.124
0.135ThrCys: 0.135 ± 0.08
3.29ThrAsp: 3.29 ± 0.442
4.371ThrGlu: 4.371 ± 0.434
2.479ThrPhe: 2.479 ± 0.427
3.695ThrGly: 3.695 ± 0.491
0.721ThrHis: 0.721 ± 0.209
4.101ThrIle: 4.101 ± 0.572
4.957ThrLys: 4.957 ± 0.462
5.047ThrLeu: 5.047 ± 0.418
1.037ThrMet: 1.037 ± 0.186
3.019ThrAsn: 3.019 ± 0.378
1.938ThrPro: 1.938 ± 0.313
2.524ThrGln: 2.524 ± 0.521
2.208ThrArg: 2.208 ± 0.315
4.281ThrSer: 4.281 ± 0.652
4.867ThrThr: 4.867 ± 0.789
5.092ThrVal: 5.092 ± 0.56
0.766ThrTrp: 0.766 ± 0.188
1.803ThrTyr: 1.803 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
4.371ValAla: 4.371 ± 0.532
0.541ValCys: 0.541 ± 0.191
3.831ValAsp: 3.831 ± 0.486
4.732ValGlu: 4.732 ± 0.459
2.208ValPhe: 2.208 ± 0.361
3.245ValGly: 3.245 ± 0.47
1.217ValHis: 1.217 ± 0.204
4.011ValIle: 4.011 ± 0.494
3.966ValLys: 3.966 ± 0.438
5.768ValLeu: 5.768 ± 0.482
1.172ValMet: 1.172 ± 0.264
3.019ValAsn: 3.019 ± 0.336
2.073ValPro: 2.073 ± 0.26
2.298ValGln: 2.298 ± 0.408
3.155ValArg: 3.155 ± 0.509
4.867ValSer: 4.867 ± 0.506
4.191ValThr: 4.191 ± 0.489
3.335ValVal: 3.335 ± 0.377
0.946ValTrp: 0.946 ± 0.192
2.073ValTyr: 2.073 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.204
0.135TrpCys: 0.135 ± 0.073
0.361TrpAsp: 0.361 ± 0.132
1.082TrpGlu: 1.082 ± 0.249
0.631TrpPhe: 0.631 ± 0.153
0.631TrpGly: 0.631 ± 0.136
0.27TrpHis: 0.27 ± 0.104
0.721TrpIle: 0.721 ± 0.182
0.631TrpLys: 0.631 ± 0.187
0.856TrpLeu: 0.856 ± 0.176
0.406TrpMet: 0.406 ± 0.1
0.991TrpAsn: 0.991 ± 0.256
0.09TrpPro: 0.09 ± 0.065
0.676TrpGln: 0.676 ± 0.171
0.541TrpArg: 0.541 ± 0.195
0.766TrpSer: 0.766 ± 0.186
1.037TrpThr: 1.037 ± 0.302
0.586TrpVal: 0.586 ± 0.14
0.18TrpTrp: 0.18 ± 0.091
0.27TrpTyr: 0.27 ± 0.113
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.388TyrAla: 2.388 ± 0.286
0.721TyrCys: 0.721 ± 0.208
2.884TyrAsp: 2.884 ± 0.428
2.839TyrGlu: 2.839 ± 0.359
1.983TyrPhe: 1.983 ± 0.35
2.298TyrGly: 2.298 ± 0.344
0.946TyrHis: 0.946 ± 0.214
2.524TyrIle: 2.524 ± 0.344
2.253TyrLys: 2.253 ± 0.371
3.785TyrLeu: 3.785 ± 0.386
0.676TyrMet: 0.676 ± 0.195
1.712TyrAsn: 1.712 ± 0.271
1.082TyrPro: 1.082 ± 0.168
1.893TyrGln: 1.893 ± 0.263
1.983TyrArg: 1.983 ± 0.353
2.479TyrSer: 2.479 ± 0.44
2.073TyrThr: 2.073 ± 0.32
1.938TyrVal: 1.938 ± 0.327
0.496TyrTrp: 0.496 ± 0.157
1.712TyrTyr: 1.712 ± 0.358
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (22191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski