Amino acid dipepetide frequency for Streptococcus phage phi-SsuFJNP3_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.007AlaAla: 4.007 ± 0.905
0.406AlaCys: 0.406 ± 0.151
3.949AlaAsp: 3.949 ± 0.609
5.226AlaGlu: 5.226 ± 0.578
3.31AlaPhe: 3.31 ± 0.386
4.007AlaGly: 4.007 ± 0.456
0.523AlaHis: 0.523 ± 0.179
5.691AlaIle: 5.691 ± 0.535
5.807AlaLys: 5.807 ± 0.511
5.633AlaLeu: 5.633 ± 0.561
1.568AlaMet: 1.568 ± 0.273
3.6AlaAsn: 3.6 ± 0.463
1.278AlaPro: 1.278 ± 0.263
2.555AlaGln: 2.555 ± 0.6
2.323AlaArg: 2.323 ± 0.306
4.007AlaSer: 4.007 ± 0.53
4.007AlaThr: 4.007 ± 0.696
3.658AlaVal: 3.658 ± 0.515
0.813AlaTrp: 0.813 ± 0.229
2.671AlaTyr: 2.671 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.158
0.174CysCys: 0.174 ± 0.107
0.29CysAsp: 0.29 ± 0.126
0.465CysGlu: 0.465 ± 0.155
0.174CysPhe: 0.174 ± 0.09
0.813CysGly: 0.813 ± 0.19
0.116CysHis: 0.116 ± 0.091
0.29CysIle: 0.29 ± 0.127
0.929CysLys: 0.929 ± 0.307
0.523CysLeu: 0.523 ± 0.182
0.174CysMet: 0.174 ± 0.093
0.465CysAsn: 0.465 ± 0.195
0.523CysPro: 0.523 ± 0.164
0.697CysGln: 0.697 ± 0.232
0.581CysArg: 0.581 ± 0.178
0.639CysSer: 0.639 ± 0.193
0.058CysThr: 0.058 ± 0.065
0.697CysVal: 0.697 ± 0.212
0.0CysTrp: 0.0 ± 0.0
0.697CysTyr: 0.697 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
3.252AspAla: 3.252 ± 0.36
0.639AspCys: 0.639 ± 0.188
3.194AspAsp: 3.194 ± 0.501
5.284AspGlu: 5.284 ± 0.596
3.368AspPhe: 3.368 ± 0.408
5.633AspGly: 5.633 ± 0.549
0.639AspHis: 0.639 ± 0.162
4.704AspIle: 4.704 ± 0.462
3.6AspLys: 3.6 ± 0.383
4.878AspLeu: 4.878 ± 0.697
1.8AspMet: 1.8 ± 0.355
3.02AspAsn: 3.02 ± 0.469
1.568AspPro: 1.568 ± 0.333
1.742AspGln: 1.742 ± 0.329
2.149AspArg: 2.149 ± 0.427
3.31AspSer: 3.31 ± 0.441
2.439AspThr: 2.439 ± 0.41
2.613AspVal: 2.613 ± 0.374
0.639AspTrp: 0.639 ± 0.213
3.02AspTyr: 3.02 ± 0.448
0.0AspXaa: 0.0 ± 0.0
Glu
4.529GluAla: 4.529 ± 0.513
0.697GluCys: 0.697 ± 0.198
4.355GluAsp: 4.355 ± 0.457
6.388GluGlu: 6.388 ± 0.88
2.962GluPhe: 2.962 ± 0.426
4.123GluGly: 4.123 ± 0.458
1.103GluHis: 1.103 ± 0.247
5.052GluIle: 5.052 ± 0.547
6.852GluLys: 6.852 ± 0.773
8.478GluLeu: 8.478 ± 0.645
2.381GluMet: 2.381 ± 0.422
4.239GluAsn: 4.239 ± 0.509
1.394GluPro: 1.394 ± 0.348
4.355GluGln: 4.355 ± 0.352
3.194GluArg: 3.194 ± 0.368
4.123GluSer: 4.123 ± 0.457
4.704GluThr: 4.704 ± 0.475
4.181GluVal: 4.181 ± 0.537
0.755GluTrp: 0.755 ± 0.191
2.497GluTyr: 2.497 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
2.613PheAla: 2.613 ± 0.553
0.581PheCys: 0.581 ± 0.148
2.787PheAsp: 2.787 ± 0.455
3.484PheGlu: 3.484 ± 0.483
1.742PhePhe: 1.742 ± 0.305
2.671PheGly: 2.671 ± 0.364
0.871PheHis: 0.871 ± 0.218
3.02PheIle: 3.02 ± 0.438
2.845PheLys: 2.845 ± 0.503
3.542PheLeu: 3.542 ± 0.556
0.871PheMet: 0.871 ± 0.203
2.207PheAsn: 2.207 ± 0.309
0.929PhePro: 0.929 ± 0.246
1.452PheGln: 1.452 ± 0.326
1.858PheArg: 1.858 ± 0.299
2.439PheSer: 2.439 ± 0.375
2.439PheThr: 2.439 ± 0.483
2.671PheVal: 2.671 ± 0.412
0.697PheTrp: 0.697 ± 0.214
2.323PheTyr: 2.323 ± 0.319
0.0PheXaa: 0.0 ± 0.0
Gly
3.368GlyAla: 3.368 ± 0.409
0.406GlyCys: 0.406 ± 0.145
3.484GlyAsp: 3.484 ± 0.618
3.949GlyGlu: 3.949 ± 0.489
3.136GlyPhe: 3.136 ± 0.473
3.136GlyGly: 3.136 ± 0.415
1.916GlyHis: 1.916 ± 0.342
5.342GlyIle: 5.342 ± 0.526
4.297GlyLys: 4.297 ± 0.565
5.284GlyLeu: 5.284 ± 0.718
1.742GlyMet: 1.742 ± 0.35
3.6GlyAsn: 3.6 ± 0.424
0.987GlyPro: 0.987 ± 0.385
3.194GlyGln: 3.194 ± 0.465
3.252GlyArg: 3.252 ± 0.494
3.716GlySer: 3.716 ± 0.532
3.949GlyThr: 3.949 ± 0.41
3.774GlyVal: 3.774 ± 0.442
0.639GlyTrp: 0.639 ± 0.184
3.02GlyTyr: 3.02 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
0.755HisAla: 0.755 ± 0.159
0.174HisCys: 0.174 ± 0.109
0.987HisAsp: 0.987 ± 0.286
0.987HisGlu: 0.987 ± 0.238
0.929HisPhe: 0.929 ± 0.25
1.394HisGly: 1.394 ± 0.244
0.523HisHis: 0.523 ± 0.19
1.452HisIle: 1.452 ± 0.25
0.929HisLys: 0.929 ± 0.239
1.394HisLeu: 1.394 ± 0.259
0.29HisMet: 0.29 ± 0.134
0.755HisAsn: 0.755 ± 0.197
1.103HisPro: 1.103 ± 0.283
0.871HisGln: 0.871 ± 0.24
1.045HisArg: 1.045 ± 0.251
0.871HisSer: 0.871 ± 0.224
0.987HisThr: 0.987 ± 0.292
1.161HisVal: 1.161 ± 0.241
0.232HisTrp: 0.232 ± 0.112
0.871HisTyr: 0.871 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
5.052IleAla: 5.052 ± 0.598
0.581IleCys: 0.581 ± 0.152
5.458IleAsp: 5.458 ± 0.545
5.284IleGlu: 5.284 ± 0.674
2.497IlePhe: 2.497 ± 0.419
5.168IleGly: 5.168 ± 0.671
1.278IleHis: 1.278 ± 0.32
3.658IleIle: 3.658 ± 0.34
5.11IleLys: 5.11 ± 0.699
6.097IleLeu: 6.097 ± 0.764
1.219IleMet: 1.219 ± 0.26
3.542IleAsn: 3.542 ± 0.401
2.497IlePro: 2.497 ± 0.388
2.671IleGln: 2.671 ± 0.367
2.845IleArg: 2.845 ± 0.365
5.226IleSer: 5.226 ± 0.703
5.11IleThr: 5.11 ± 0.853
4.645IleVal: 4.645 ± 0.57
0.813IleTrp: 0.813 ± 0.291
2.613IleTyr: 2.613 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
5.168LysAla: 5.168 ± 0.494
0.348LysCys: 0.348 ± 0.137
3.716LysAsp: 3.716 ± 0.476
6.039LysGlu: 6.039 ± 0.658
3.252LysPhe: 3.252 ± 0.479
3.833LysGly: 3.833 ± 0.398
1.452LysHis: 1.452 ± 0.303
6.039LysIle: 6.039 ± 0.59
6.213LysLys: 6.213 ± 0.844
6.794LysLeu: 6.794 ± 0.641
2.09LysMet: 2.09 ± 0.383
3.891LysAsn: 3.891 ± 0.551
2.497LysPro: 2.497 ± 0.356
3.136LysGln: 3.136 ± 0.484
3.949LysArg: 3.949 ± 0.578
4.123LysSer: 4.123 ± 0.509
3.833LysThr: 3.833 ± 0.541
4.587LysVal: 4.587 ± 0.543
1.278LysTrp: 1.278 ± 0.293
2.613LysTyr: 2.613 ± 0.476
0.0LysXaa: 0.0 ± 0.0
Leu
6.562LeuAla: 6.562 ± 0.687
0.523LeuCys: 0.523 ± 0.2
5.168LeuAsp: 5.168 ± 0.496
7.723LeuGlu: 7.723 ± 0.618
3.252LeuPhe: 3.252 ± 0.473
4.878LeuGly: 4.878 ± 0.421
1.394LeuHis: 1.394 ± 0.275
5.168LeuIle: 5.168 ± 0.609
6.388LeuLys: 6.388 ± 0.617
7.897LeuLeu: 7.897 ± 1.057
2.032LeuMet: 2.032 ± 0.356
4.239LeuAsn: 4.239 ± 0.482
3.542LeuPro: 3.542 ± 0.579
3.252LeuGln: 3.252 ± 0.499
3.658LeuArg: 3.658 ± 0.399
7.317LeuSer: 7.317 ± 0.681
6.504LeuThr: 6.504 ± 0.646
6.155LeuVal: 6.155 ± 0.752
0.871LeuTrp: 0.871 ± 0.307
3.658LeuTyr: 3.658 ± 0.586
0.0LeuXaa: 0.0 ± 0.0
Met
1.974MetAla: 1.974 ± 0.255
0.232MetCys: 0.232 ± 0.114
1.684MetAsp: 1.684 ± 0.357
1.452MetGlu: 1.452 ± 0.334
0.813MetPhe: 0.813 ± 0.231
1.394MetGly: 1.394 ± 0.365
0.116MetHis: 0.116 ± 0.093
2.207MetIle: 2.207 ± 0.401
1.684MetLys: 1.684 ± 0.287
1.858MetLeu: 1.858 ± 0.312
0.697MetMet: 0.697 ± 0.216
1.045MetAsn: 1.045 ± 0.262
0.755MetPro: 0.755 ± 0.165
0.755MetGln: 0.755 ± 0.228
1.103MetArg: 1.103 ± 0.197
2.207MetSer: 2.207 ± 0.392
1.452MetThr: 1.452 ± 0.383
1.394MetVal: 1.394 ± 0.242
0.29MetTrp: 0.29 ± 0.13
0.348MetTyr: 0.348 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
3.949AsnAla: 3.949 ± 0.515
0.581AsnCys: 0.581 ± 0.16
2.265AsnAsp: 2.265 ± 0.315
2.962AsnGlu: 2.962 ± 0.453
2.381AsnPhe: 2.381 ± 0.417
4.181AsnGly: 4.181 ± 0.542
1.336AsnHis: 1.336 ± 0.266
3.252AsnIle: 3.252 ± 0.399
3.716AsnLys: 3.716 ± 0.503
4.587AsnLeu: 4.587 ± 0.74
0.987AsnMet: 0.987 ± 0.257
2.787AsnAsn: 2.787 ± 0.543
2.265AsnPro: 2.265 ± 0.379
2.265AsnGln: 2.265 ± 0.351
2.613AsnArg: 2.613 ± 0.396
2.845AsnSer: 2.845 ± 0.355
1.916AsnThr: 1.916 ± 0.418
3.136AsnVal: 3.136 ± 0.542
1.045AsnTrp: 1.045 ± 0.276
1.568AsnTyr: 1.568 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
1.51ProAla: 1.51 ± 0.254
0.29ProCys: 0.29 ± 0.138
2.265ProAsp: 2.265 ± 0.318
2.497ProGlu: 2.497 ± 0.368
1.336ProPhe: 1.336 ± 0.284
0.871ProGly: 0.871 ± 0.265
0.697ProHis: 0.697 ± 0.248
1.684ProIle: 1.684 ± 0.301
2.439ProLys: 2.439 ± 0.441
3.02ProLeu: 3.02 ± 0.375
0.465ProMet: 0.465 ± 0.17
1.684ProAsn: 1.684 ± 0.308
0.871ProPro: 0.871 ± 0.267
0.871ProGln: 0.871 ± 0.235
1.742ProArg: 1.742 ± 0.3
2.265ProSer: 2.265 ± 0.351
2.265ProThr: 2.265 ± 0.419
2.207ProVal: 2.207 ± 0.398
0.29ProTrp: 0.29 ± 0.126
1.452ProTyr: 1.452 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
3.833GlnAla: 3.833 ± 0.619
0.406GlnCys: 0.406 ± 0.142
1.8GlnAsp: 1.8 ± 0.322
3.02GlnGlu: 3.02 ± 0.463
1.858GlnPhe: 1.858 ± 0.316
2.149GlnGly: 2.149 ± 0.379
0.639GlnHis: 0.639 ± 0.156
2.729GlnIle: 2.729 ± 0.357
3.136GlnLys: 3.136 ± 0.487
4.297GlnLeu: 4.297 ± 0.485
1.219GlnMet: 1.219 ± 0.297
1.858GlnAsn: 1.858 ± 0.365
1.278GlnPro: 1.278 ± 0.337
1.626GlnGln: 1.626 ± 0.234
1.742GlnArg: 1.742 ± 0.405
2.555GlnSer: 2.555 ± 0.339
2.787GlnThr: 2.787 ± 0.743
3.542GlnVal: 3.542 ± 0.502
0.697GlnTrp: 0.697 ± 0.232
0.929GlnTyr: 0.929 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
2.381ArgAla: 2.381 ± 0.322
0.523ArgCys: 0.523 ± 0.159
2.613ArgAsp: 2.613 ± 0.415
3.716ArgGlu: 3.716 ± 0.327
1.51ArgPhe: 1.51 ± 0.281
2.555ArgGly: 2.555 ± 0.333
0.929ArgHis: 0.929 ± 0.222
3.716ArgIle: 3.716 ± 0.614
4.065ArgLys: 4.065 ± 0.647
4.355ArgLeu: 4.355 ± 0.46
0.813ArgMet: 0.813 ± 0.229
2.323ArgAsn: 2.323 ± 0.354
1.51ArgPro: 1.51 ± 0.307
2.323ArgGln: 2.323 ± 0.336
1.452ArgArg: 1.452 ± 0.279
2.323ArgSer: 2.323 ± 0.354
2.555ArgThr: 2.555 ± 0.538
2.729ArgVal: 2.729 ± 0.435
0.813ArgTrp: 0.813 ± 0.213
1.568ArgTyr: 1.568 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
4.181SerAla: 4.181 ± 0.482
0.465SerCys: 0.465 ± 0.154
3.774SerAsp: 3.774 ± 0.516
4.123SerGlu: 4.123 ± 0.484
2.613SerPhe: 2.613 ± 0.496
4.471SerGly: 4.471 ± 0.468
1.103SerHis: 1.103 ± 0.246
4.878SerIle: 4.878 ± 0.463
4.645SerLys: 4.645 ± 0.471
4.936SerLeu: 4.936 ± 0.477
1.394SerMet: 1.394 ± 0.245
3.252SerAsn: 3.252 ± 0.416
2.149SerPro: 2.149 ± 0.348
2.903SerGln: 2.903 ± 0.625
3.31SerArg: 3.31 ± 0.473
4.878SerSer: 4.878 ± 0.756
4.355SerThr: 4.355 ± 0.492
3.949SerVal: 3.949 ± 0.455
1.278SerTrp: 1.278 ± 0.216
2.555SerTyr: 2.555 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
4.007ThrAla: 4.007 ± 0.535
0.174ThrCys: 0.174 ± 0.109
2.903ThrAsp: 2.903 ± 0.571
4.645ThrGlu: 4.645 ± 0.617
2.09ThrPhe: 2.09 ± 0.42
4.181ThrGly: 4.181 ± 0.634
0.929ThrHis: 0.929 ± 0.199
4.936ThrIle: 4.936 ± 0.663
4.355ThrLys: 4.355 ± 0.458
5.517ThrLeu: 5.517 ± 0.5
1.219ThrMet: 1.219 ± 0.276
2.787ThrAsn: 2.787 ± 0.41
2.032ThrPro: 2.032 ± 0.411
2.09ThrGln: 2.09 ± 0.662
1.974ThrArg: 1.974 ± 0.489
4.878ThrSer: 4.878 ± 0.809
4.645ThrThr: 4.645 ± 0.731
5.342ThrVal: 5.342 ± 0.611
0.755ThrTrp: 0.755 ± 0.196
2.381ThrTyr: 2.381 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
4.297ValAla: 4.297 ± 0.633
0.639ValCys: 0.639 ± 0.221
3.368ValAsp: 3.368 ± 0.55
5.284ValGlu: 5.284 ± 0.584
2.497ValPhe: 2.497 ± 0.468
3.426ValGly: 3.426 ± 0.462
0.929ValHis: 0.929 ± 0.209
3.833ValIle: 3.833 ± 0.474
4.471ValLys: 4.471 ± 0.462
6.213ValLeu: 6.213 ± 0.558
1.045ValMet: 1.045 ± 0.235
2.439ValAsn: 2.439 ± 0.371
1.974ValPro: 1.974 ± 0.25
2.497ValGln: 2.497 ± 0.386
3.31ValArg: 3.31 ± 0.648
4.239ValSer: 4.239 ± 0.618
4.762ValThr: 4.762 ± 0.618
3.542ValVal: 3.542 ± 0.381
1.045ValTrp: 1.045 ± 0.268
2.787ValTyr: 2.787 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
0.871TrpAla: 0.871 ± 0.223
0.174TrpCys: 0.174 ± 0.119
0.406TrpAsp: 0.406 ± 0.135
1.161TrpGlu: 1.161 ± 0.293
1.045TrpPhe: 1.045 ± 0.396
0.697TrpGly: 0.697 ± 0.166
0.29TrpHis: 0.29 ± 0.127
0.987TrpIle: 0.987 ± 0.283
0.639TrpLys: 0.639 ± 0.243
0.929TrpLeu: 0.929 ± 0.234
0.348TrpMet: 0.348 ± 0.137
1.336TrpAsn: 1.336 ± 0.255
0.116TrpPro: 0.116 ± 0.076
0.755TrpGln: 0.755 ± 0.202
0.639TrpArg: 0.639 ± 0.235
0.755TrpSer: 0.755 ± 0.222
1.103TrpThr: 1.103 ± 0.244
0.697TrpVal: 0.697 ± 0.185
0.348TrpTrp: 0.348 ± 0.168
0.348TrpTyr: 0.348 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.265TyrAla: 2.265 ± 0.353
0.755TyrCys: 0.755 ± 0.24
3.02TyrAsp: 3.02 ± 0.473
2.671TyrGlu: 2.671 ± 0.413
1.278TyrPhe: 1.278 ± 0.281
2.671TyrGly: 2.671 ± 0.513
0.929TyrHis: 0.929 ± 0.259
2.729TyrIle: 2.729 ± 0.453
2.729TyrLys: 2.729 ± 0.416
3.949TyrLeu: 3.949 ± 0.439
1.045TyrMet: 1.045 ± 0.226
1.568TyrAsn: 1.568 ± 0.304
1.452TyrPro: 1.452 ± 0.27
2.09TyrGln: 2.09 ± 0.299
2.032TyrArg: 2.032 ± 0.342
2.497TyrSer: 2.497 ± 0.451
1.974TyrThr: 1.974 ± 0.369
1.916TyrVal: 1.916 ± 0.298
0.348TyrTrp: 0.348 ± 0.11
1.103TyrTyr: 1.103 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (17222 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski