Amino acid dipepetide frequency for Streptomyces phage SF3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.458AlaAla: 22.458 ± 1.349
0.885AlaCys: 0.885 ± 0.231
9.459AlaAsp: 9.459 ± 0.718
9.238AlaGlu: 9.238 ± 1.038
3.098AlaPhe: 3.098 ± 0.546
10.676AlaGly: 10.676 ± 1.13
2.6AlaHis: 2.6 ± 0.425
4.425AlaIle: 4.425 ± 0.528
4.647AlaLys: 4.647 ± 0.738
12.225AlaLeu: 12.225 ± 1.075
2.434AlaMet: 2.434 ± 0.377
2.268AlaAsn: 2.268 ± 0.311
7.412AlaPro: 7.412 ± 0.721
3.761AlaGln: 3.761 ± 0.451
9.072AlaArg: 9.072 ± 0.781
4.757AlaSer: 4.757 ± 0.632
7.246AlaThr: 7.246 ± 0.615
10.344AlaVal: 10.344 ± 0.949
2.102AlaTrp: 2.102 ± 0.328
3.596AlaTyr: 3.596 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.83CysAla: 0.83 ± 0.213
0.166CysCys: 0.166 ± 0.098
0.608CysAsp: 0.608 ± 0.211
0.774CysGlu: 0.774 ± 0.238
0.277CysPhe: 0.277 ± 0.192
1.328CysGly: 1.328 ± 0.279
0.166CysHis: 0.166 ± 0.104
0.443CysIle: 0.443 ± 0.174
0.443CysLys: 0.443 ± 0.136
0.719CysLeu: 0.719 ± 0.226
0.055CysMet: 0.055 ± 0.051
0.111CysAsn: 0.111 ± 0.071
0.885CysPro: 0.885 ± 0.259
0.221CysGln: 0.221 ± 0.11
1.162CysArg: 1.162 ± 0.227
0.774CysSer: 0.774 ± 0.251
0.443CysThr: 0.443 ± 0.171
0.664CysVal: 0.664 ± 0.205
0.166CysTrp: 0.166 ± 0.087
0.111CysTyr: 0.111 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
8.187AspAla: 8.187 ± 0.657
0.443AspCys: 0.443 ± 0.151
5.532AspAsp: 5.532 ± 0.967
5.366AspGlu: 5.366 ± 0.572
1.162AspPhe: 1.162 ± 0.216
7.412AspGly: 7.412 ± 0.873
1.494AspHis: 1.494 ± 0.319
1.715AspIle: 1.715 ± 0.386
2.268AspLys: 2.268 ± 0.545
6.029AspLeu: 6.029 ± 0.668
1.494AspMet: 1.494 ± 0.303
1.162AspAsn: 1.162 ± 0.256
4.923AspPro: 4.923 ± 0.516
2.489AspGln: 2.489 ± 0.436
5.642AspArg: 5.642 ± 0.657
3.374AspSer: 3.374 ± 0.479
3.264AspThr: 3.264 ± 0.43
5.2AspVal: 5.2 ± 0.551
1.438AspTrp: 1.438 ± 0.352
1.604AspTyr: 1.604 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
8.076GluAla: 8.076 ± 0.943
0.664GluCys: 0.664 ± 0.193
3.374GluAsp: 3.374 ± 0.45
2.157GluGlu: 2.157 ± 0.472
1.604GluPhe: 1.604 ± 0.307
4.315GluGly: 4.315 ± 0.528
2.047GluHis: 2.047 ± 0.382
2.655GluIle: 2.655 ± 0.381
1.328GluLys: 1.328 ± 0.323
6.251GluLeu: 6.251 ± 0.643
1.272GluMet: 1.272 ± 0.305
1.217GluAsn: 1.217 ± 0.267
3.817GluPro: 3.817 ± 0.644
2.821GluGln: 2.821 ± 0.421
4.812GluArg: 4.812 ± 0.642
2.323GluSer: 2.323 ± 0.401
3.153GluThr: 3.153 ± 0.458
3.761GluVal: 3.761 ± 0.485
1.106GluTrp: 1.106 ± 0.224
1.494GluTyr: 1.494 ± 0.261
0.0GluXaa: 0.0 ± 0.0
Phe
2.766PheAla: 2.766 ± 0.425
0.221PheCys: 0.221 ± 0.112
1.217PheAsp: 1.217 ± 0.335
1.162PheGlu: 1.162 ± 0.233
0.332PhePhe: 0.332 ± 0.122
2.268PheGly: 2.268 ± 0.518
0.498PheHis: 0.498 ± 0.153
0.83PheIle: 0.83 ± 0.24
0.387PheLys: 0.387 ± 0.169
1.549PheLeu: 1.549 ± 0.238
0.166PheMet: 0.166 ± 0.1
0.719PheAsn: 0.719 ± 0.211
0.664PhePro: 0.664 ± 0.183
0.221PheGln: 0.221 ± 0.107
2.379PheArg: 2.379 ± 0.44
0.885PheSer: 0.885 ± 0.238
1.77PheThr: 1.77 ± 0.326
1.991PheVal: 1.991 ± 0.368
0.277PheTrp: 0.277 ± 0.126
0.443PheTyr: 0.443 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
9.736GlyAla: 9.736 ± 0.788
0.774GlyCys: 0.774 ± 0.193
6.472GlyAsp: 6.472 ± 0.947
5.753GlyGlu: 5.753 ± 0.536
1.991GlyPhe: 1.991 ± 0.575
8.851GlyGly: 8.851 ± 1.255
2.047GlyHis: 2.047 ± 0.344
3.706GlyIle: 3.706 ± 0.692
3.264GlyLys: 3.264 ± 0.518
6.583GlyLeu: 6.583 ± 0.73
1.825GlyMet: 1.825 ± 0.308
1.604GlyAsn: 1.604 ± 0.456
5.144GlyPro: 5.144 ± 0.518
2.876GlyGln: 2.876 ± 0.571
7.357GlyArg: 7.357 ± 0.588
4.702GlySer: 4.702 ± 0.631
4.812GlyThr: 4.812 ± 0.756
6.306GlyVal: 6.306 ± 0.695
2.047GlyTrp: 2.047 ± 0.39
2.323GlyTyr: 2.323 ± 0.381
0.0GlyXaa: 0.0 ± 0.0
His
3.264HisAla: 3.264 ± 0.49
0.277HisCys: 0.277 ± 0.107
1.825HisAsp: 1.825 ± 0.371
1.162HisGlu: 1.162 ± 0.252
0.608HisPhe: 0.608 ± 0.186
2.157HisGly: 2.157 ± 0.392
0.774HisHis: 0.774 ± 0.222
0.774HisIle: 0.774 ± 0.188
0.387HisLys: 0.387 ± 0.152
1.77HisLeu: 1.77 ± 0.366
0.443HisMet: 0.443 ± 0.176
0.885HisAsn: 0.885 ± 0.223
1.494HisPro: 1.494 ± 0.243
0.277HisGln: 0.277 ± 0.119
1.881HisArg: 1.881 ± 0.429
1.106HisSer: 1.106 ± 0.248
1.383HisThr: 1.383 ± 0.26
2.213HisVal: 2.213 ± 0.346
0.221HisTrp: 0.221 ± 0.113
0.277HisTyr: 0.277 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
5.255IleAla: 5.255 ± 0.584
0.332IleCys: 0.332 ± 0.128
2.379IleAsp: 2.379 ± 0.359
3.264IleGlu: 3.264 ± 0.384
0.443IlePhe: 0.443 ± 0.138
3.706IleGly: 3.706 ± 0.473
0.553IleHis: 0.553 ± 0.18
1.438IleIle: 1.438 ± 0.326
1.162IleLys: 1.162 ± 0.335
2.932IleLeu: 2.932 ± 0.476
0.498IleMet: 0.498 ± 0.163
1.328IleAsn: 1.328 ± 0.339
1.715IlePro: 1.715 ± 0.33
0.664IleGln: 0.664 ± 0.181
3.264IleArg: 3.264 ± 0.393
1.881IleSer: 1.881 ± 0.303
3.098IleThr: 3.098 ± 0.45
2.987IleVal: 2.987 ± 0.368
0.443IleTrp: 0.443 ± 0.182
0.664IleTyr: 0.664 ± 0.15
0.0IleXaa: 0.0 ± 0.0
Lys
4.038LysAla: 4.038 ± 0.879
0.387LysCys: 0.387 ± 0.136
1.936LysAsp: 1.936 ± 0.44
1.106LysGlu: 1.106 ± 0.228
0.553LysPhe: 0.553 ± 0.187
2.102LysGly: 2.102 ± 0.361
0.664LysHis: 0.664 ± 0.212
1.549LysIle: 1.549 ± 0.331
1.162LysLys: 1.162 ± 0.302
2.489LysLeu: 2.489 ± 0.379
0.94LysMet: 0.94 ± 0.295
1.162LysAsn: 1.162 ± 0.307
1.494LysPro: 1.494 ± 0.266
0.553LysGln: 0.553 ± 0.197
2.268LysArg: 2.268 ± 0.379
1.383LysSer: 1.383 ± 0.272
2.213LysThr: 2.213 ± 0.386
2.323LysVal: 2.323 ± 0.44
0.719LysTrp: 0.719 ± 0.206
0.608LysTyr: 0.608 ± 0.225
0.0LysXaa: 0.0 ± 0.0
Leu
12.059LeuAla: 12.059 ± 0.941
0.94LeuCys: 0.94 ± 0.273
7.246LeuAsp: 7.246 ± 0.773
2.213LeuGlu: 2.213 ± 0.405
1.383LeuPhe: 1.383 ± 0.331
6.693LeuGly: 6.693 ± 0.587
2.102LeuHis: 2.102 ± 0.393
2.213LeuIle: 2.213 ± 0.356
1.936LeuLys: 1.936 ± 0.493
6.14LeuLeu: 6.14 ± 0.72
1.383LeuMet: 1.383 ± 0.218
1.936LeuAsn: 1.936 ± 0.343
6.417LeuPro: 6.417 ± 0.586
1.217LeuGln: 1.217 ± 0.283
7.8LeuArg: 7.8 ± 0.642
5.31LeuSer: 5.31 ± 0.605
4.591LeuThr: 4.591 ± 0.516
6.693LeuVal: 6.693 ± 0.598
2.102LeuTrp: 2.102 ± 0.421
1.659LeuTyr: 1.659 ± 0.278
0.0LeuXaa: 0.0 ± 0.0
Met
1.659MetAla: 1.659 ± 0.293
0.332MetCys: 0.332 ± 0.129
0.996MetAsp: 0.996 ± 0.263
0.221MetGlu: 0.221 ± 0.111
0.277MetPhe: 0.277 ± 0.119
1.217MetGly: 1.217 ± 0.243
0.443MetHis: 0.443 ± 0.162
0.885MetIle: 0.885 ± 0.262
0.608MetLys: 0.608 ± 0.185
1.604MetLeu: 1.604 ± 0.33
0.443MetMet: 0.443 ± 0.146
0.387MetAsn: 0.387 ± 0.156
1.881MetPro: 1.881 ± 0.332
0.885MetGln: 0.885 ± 0.215
2.102MetArg: 2.102 ± 0.357
1.328MetSer: 1.328 ± 0.239
2.102MetThr: 2.102 ± 0.314
0.774MetVal: 0.774 ± 0.249
0.608MetTrp: 0.608 ± 0.162
0.277MetTyr: 0.277 ± 0.123
0.0MetXaa: 0.0 ± 0.0
Asn
3.54AsnAla: 3.54 ± 0.528
0.332AsnCys: 0.332 ± 0.142
1.494AsnAsp: 1.494 ± 0.303
0.996AsnGlu: 0.996 ± 0.249
0.553AsnPhe: 0.553 ± 0.179
3.042AsnGly: 3.042 ± 0.472
0.111AsnHis: 0.111 ± 0.078
0.885AsnIle: 0.885 ± 0.247
0.387AsnLys: 0.387 ± 0.149
1.272AsnLeu: 1.272 ± 0.272
0.443AsnMet: 0.443 ± 0.16
0.664AsnAsn: 0.664 ± 0.202
1.715AsnPro: 1.715 ± 0.266
0.608AsnGln: 0.608 ± 0.238
1.77AsnArg: 1.77 ± 0.307
0.83AsnSer: 0.83 ± 0.23
1.715AsnThr: 1.715 ± 0.458
1.936AsnVal: 1.936 ± 0.392
0.332AsnTrp: 0.332 ± 0.197
0.332AsnTyr: 0.332 ± 0.156
0.0AsnXaa: 0.0 ± 0.0
Pro
9.016ProAla: 9.016 ± 0.7
0.608ProCys: 0.608 ± 0.188
4.812ProAsp: 4.812 ± 0.699
4.757ProGlu: 4.757 ± 0.625
0.885ProPhe: 0.885 ± 0.218
5.421ProGly: 5.421 ± 0.59
1.162ProHis: 1.162 ± 0.243
2.323ProIle: 2.323 ± 0.453
2.047ProLys: 2.047 ± 0.368
4.259ProLeu: 4.259 ± 0.437
1.106ProMet: 1.106 ± 0.281
1.549ProAsn: 1.549 ± 0.254
3.43ProPro: 3.43 ± 0.521
1.715ProGln: 1.715 ± 0.339
4.591ProArg: 4.591 ± 0.67
2.876ProSer: 2.876 ± 0.384
3.983ProThr: 3.983 ± 0.448
5.2ProVal: 5.2 ± 0.593
0.83ProTrp: 0.83 ± 0.218
2.102ProTyr: 2.102 ± 0.393
0.0ProXaa: 0.0 ± 0.0
Gln
3.319GlnAla: 3.319 ± 0.562
0.498GlnCys: 0.498 ± 0.148
1.162GlnAsp: 1.162 ± 0.249
0.83GlnGlu: 0.83 ± 0.217
1.162GlnPhe: 1.162 ± 0.198
1.991GlnGly: 1.991 ± 0.285
0.94GlnHis: 0.94 ± 0.248
1.549GlnIle: 1.549 ± 0.315
0.608GlnLys: 0.608 ± 0.202
2.876GlnLeu: 2.876 ± 0.399
0.387GlnMet: 0.387 ± 0.133
0.664GlnAsn: 0.664 ± 0.158
2.545GlnPro: 2.545 ± 0.319
1.77GlnGln: 1.77 ± 0.415
3.042GlnArg: 3.042 ± 0.464
1.659GlnSer: 1.659 ± 0.307
2.047GlnThr: 2.047 ± 0.347
2.434GlnVal: 2.434 ± 0.509
0.608GlnTrp: 0.608 ± 0.185
0.664GlnTyr: 0.664 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
9.736ArgAla: 9.736 ± 0.785
1.051ArgCys: 1.051 ± 0.28
4.757ArgAsp: 4.757 ± 0.503
5.255ArgGlu: 5.255 ± 0.594
1.549ArgPhe: 1.549 ± 0.348
5.753ArgGly: 5.753 ± 0.675
2.268ArgHis: 2.268 ± 0.343
4.259ArgIle: 4.259 ± 0.617
2.545ArgLys: 2.545 ± 0.393
6.472ArgLeu: 6.472 ± 0.59
1.604ArgMet: 1.604 ± 0.291
1.494ArgAsn: 1.494 ± 0.26
5.587ArgPro: 5.587 ± 0.639
3.153ArgGln: 3.153 ± 0.439
7.468ArgArg: 7.468 ± 0.972
3.761ArgSer: 3.761 ± 0.462
5.366ArgThr: 5.366 ± 0.473
5.919ArgVal: 5.919 ± 0.598
1.77ArgTrp: 1.77 ± 0.295
2.6ArgTyr: 2.6 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
6.029SerAla: 6.029 ± 0.643
0.387SerCys: 0.387 ± 0.141
3.264SerAsp: 3.264 ± 0.38
2.102SerGlu: 2.102 ± 0.254
0.83SerPhe: 0.83 ± 0.254
5.144SerGly: 5.144 ± 0.557
0.996SerHis: 0.996 ± 0.196
1.549SerIle: 1.549 ± 0.335
1.936SerLys: 1.936 ± 0.404
3.872SerLeu: 3.872 ± 0.502
1.051SerMet: 1.051 ± 0.232
1.051SerAsn: 1.051 ± 0.226
2.489SerPro: 2.489 ± 0.377
1.494SerGln: 1.494 ± 0.377
3.042SerArg: 3.042 ± 0.434
2.545SerSer: 2.545 ± 0.463
3.651SerThr: 3.651 ± 0.429
3.54SerVal: 3.54 ± 0.488
1.604SerTrp: 1.604 ± 0.341
1.051SerTyr: 1.051 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
7.8ThrAla: 7.8 ± 0.752
0.83ThrCys: 0.83 ± 0.206
4.315ThrAsp: 4.315 ± 0.492
3.54ThrGlu: 3.54 ± 0.445
1.438ThrPhe: 1.438 ± 0.358
5.31ThrGly: 5.31 ± 0.532
1.106ThrHis: 1.106 ± 0.263
2.268ThrIle: 2.268 ± 0.345
1.604ThrLys: 1.604 ± 0.305
4.536ThrLeu: 4.536 ± 0.592
0.608ThrMet: 0.608 ± 0.179
1.272ThrAsn: 1.272 ± 0.315
4.647ThrPro: 4.647 ± 0.544
1.881ThrGln: 1.881 ± 0.346
4.481ThrArg: 4.481 ± 0.468
3.098ThrSer: 3.098 ± 0.404
3.153ThrThr: 3.153 ± 0.48
6.195ThrVal: 6.195 ± 0.622
1.106ThrTrp: 1.106 ± 0.291
1.659ThrTyr: 1.659 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
9.57ValAla: 9.57 ± 0.834
0.719ValCys: 0.719 ± 0.225
6.251ValAsp: 6.251 ± 0.566
6.583ValGlu: 6.583 ± 0.8
1.77ValPhe: 1.77 ± 0.277
5.974ValGly: 5.974 ± 0.835
2.047ValHis: 2.047 ± 0.444
3.042ValIle: 3.042 ± 0.394
2.379ValLys: 2.379 ± 0.457
6.195ValLeu: 6.195 ± 0.58
2.047ValMet: 2.047 ± 0.306
2.545ValAsn: 2.545 ± 0.547
4.315ValPro: 4.315 ± 0.487
2.047ValGln: 2.047 ± 0.328
5.476ValArg: 5.476 ± 0.633
2.545ValSer: 2.545 ± 0.366
4.536ValThr: 4.536 ± 0.557
5.366ValVal: 5.366 ± 0.641
1.383ValTrp: 1.383 ± 0.281
1.715ValTyr: 1.715 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
1.881TrpAla: 1.881 ± 0.357
0.387TrpCys: 0.387 ± 0.135
1.217TrpAsp: 1.217 ± 0.288
0.553TrpGlu: 0.553 ± 0.203
0.387TrpPhe: 0.387 ± 0.144
1.549TrpGly: 1.549 ± 0.274
0.719TrpHis: 0.719 ± 0.193
0.774TrpIle: 0.774 ± 0.261
0.166TrpLys: 0.166 ± 0.114
2.268TrpLeu: 2.268 ± 0.431
0.498TrpMet: 0.498 ± 0.2
0.387TrpAsn: 0.387 ± 0.177
0.83TrpPro: 0.83 ± 0.238
1.438TrpGln: 1.438 ± 0.347
2.213TrpArg: 2.213 ± 0.4
1.549TrpSer: 1.549 ± 0.261
0.83TrpThr: 0.83 ± 0.21
1.162TrpVal: 1.162 ± 0.245
0.94TrpTrp: 0.94 ± 0.31
0.166TrpTyr: 0.166 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.485TyrAla: 3.485 ± 0.465
0.111TyrCys: 0.111 ± 0.064
1.881TyrAsp: 1.881 ± 0.36
1.217TyrGlu: 1.217 ± 0.301
0.443TyrPhe: 0.443 ± 0.149
3.264TyrGly: 3.264 ± 0.396
0.443TyrHis: 0.443 ± 0.144
0.719TyrIle: 0.719 ± 0.213
0.387TyrLys: 0.387 ± 0.169
1.991TyrLeu: 1.991 ± 0.317
0.277TyrMet: 0.277 ± 0.126
0.553TyrAsn: 0.553 ± 0.152
1.328TyrPro: 1.328 ± 0.268
0.664TyrGln: 0.664 ± 0.21
2.545TyrArg: 2.545 ± 0.41
0.94TyrSer: 0.94 ± 0.199
1.494TyrThr: 1.494 ± 0.268
1.494TyrVal: 1.494 ± 0.263
0.111TyrTrp: 0.111 ± 0.08
0.719TyrTyr: 0.719 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (18079 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski