Amino acid dipepetide frequency for Streptomyces phage SF1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.8AlaAla: 17.8 ± 1.278
0.665AlaCys: 0.665 ± 0.352
8.199AlaAsp: 8.199 ± 0.817
6.869AlaGlu: 6.869 ± 1.015
3.545AlaPhe: 3.545 ± 0.563
11.153AlaGly: 11.153 ± 1.449
1.182AlaHis: 1.182 ± 0.375
4.506AlaIle: 4.506 ± 0.63
5.023AlaLys: 5.023 ± 0.887
12.704AlaLeu: 12.704 ± 1.291
2.585AlaMet: 2.585 ± 0.549
3.102AlaAsn: 3.102 ± 0.458
6.278AlaPro: 6.278 ± 0.753
3.915AlaGln: 3.915 ± 0.54
8.199AlaArg: 8.199 ± 0.938
7.238AlaSer: 7.238 ± 0.827
8.568AlaThr: 8.568 ± 0.895
11.227AlaVal: 11.227 ± 0.99
2.807AlaTrp: 2.807 ± 0.473
3.324AlaTyr: 3.324 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.517CysAla: 0.517 ± 0.199
0.074CysCys: 0.074 ± 0.074
0.443CysAsp: 0.443 ± 0.208
0.517CysGlu: 0.517 ± 0.209
0.148CysPhe: 0.148 ± 0.104
0.517CysGly: 0.517 ± 0.189
0.148CysHis: 0.148 ± 0.105
0.295CysIle: 0.295 ± 0.155
0.295CysLys: 0.295 ± 0.132
0.443CysLeu: 0.443 ± 0.253
0.0CysMet: 0.0 ± 0.0
0.148CysAsn: 0.148 ± 0.114
0.295CysPro: 0.295 ± 0.151
0.222CysGln: 0.222 ± 0.127
0.739CysArg: 0.739 ± 0.247
0.222CysSer: 0.222 ± 0.119
0.295CysThr: 0.295 ± 0.164
0.369CysVal: 0.369 ± 0.153
0.148CysTrp: 0.148 ± 0.107
0.148CysTyr: 0.148 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
8.716AspAla: 8.716 ± 0.688
0.665AspCys: 0.665 ± 0.232
4.062AspAsp: 4.062 ± 0.872
4.136AspGlu: 4.136 ± 0.666
1.034AspPhe: 1.034 ± 0.227
7.312AspGly: 7.312 ± 0.796
0.812AspHis: 0.812 ± 0.225
1.477AspIle: 1.477 ± 0.369
1.773AspLys: 1.773 ± 0.568
4.727AspLeu: 4.727 ± 0.743
1.182AspMet: 1.182 ± 0.315
1.773AspAsn: 1.773 ± 0.29
4.358AspPro: 4.358 ± 0.606
2.364AspGln: 2.364 ± 0.42
4.653AspArg: 4.653 ± 0.721
3.915AspSer: 3.915 ± 0.615
3.398AspThr: 3.398 ± 0.363
5.54AspVal: 5.54 ± 0.607
1.625AspTrp: 1.625 ± 0.381
1.994AspTyr: 1.994 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
7.238GluAla: 7.238 ± 1.038
0.295GluCys: 0.295 ± 0.153
3.25GluAsp: 3.25 ± 0.647
1.773GluGlu: 1.773 ± 0.44
2.142GluPhe: 2.142 ± 0.395
4.432GluGly: 4.432 ± 0.602
1.403GluHis: 1.403 ± 0.445
2.807GluIle: 2.807 ± 0.395
1.182GluLys: 1.182 ± 0.377
4.21GluLeu: 4.21 ± 0.735
1.256GluMet: 1.256 ± 0.285
1.403GluAsn: 1.403 ± 0.316
2.807GluPro: 2.807 ± 0.608
1.551GluGln: 1.551 ± 0.371
4.136GluArg: 4.136 ± 0.845
2.511GluSer: 2.511 ± 0.381
3.693GluThr: 3.693 ± 0.476
4.136GluVal: 4.136 ± 0.71
0.812GluTrp: 0.812 ± 0.229
1.403GluTyr: 1.403 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.511PheAla: 2.511 ± 0.479
0.148PheCys: 0.148 ± 0.099
1.329PheAsp: 1.329 ± 0.304
1.034PheGlu: 1.034 ± 0.308
0.295PhePhe: 0.295 ± 0.174
3.25PheGly: 3.25 ± 0.548
0.295PheHis: 0.295 ± 0.16
0.96PheIle: 0.96 ± 0.32
1.108PheLys: 1.108 ± 0.342
1.329PheLeu: 1.329 ± 0.24
0.665PheMet: 0.665 ± 0.216
0.665PheAsn: 0.665 ± 0.192
1.034PhePro: 1.034 ± 0.24
0.739PheGln: 0.739 ± 0.281
2.585PheArg: 2.585 ± 0.473
1.108PheSer: 1.108 ± 0.321
1.92PheThr: 1.92 ± 0.391
1.847PheVal: 1.847 ± 0.411
0.369PheTrp: 0.369 ± 0.171
0.665PheTyr: 0.665 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
11.375GlyAla: 11.375 ± 1.062
0.443GlyCys: 0.443 ± 0.17
6.795GlyAsp: 6.795 ± 1.072
5.54GlyGlu: 5.54 ± 0.745
2.659GlyPhe: 2.659 ± 0.625
11.005GlyGly: 11.005 ± 1.637
1.256GlyHis: 1.256 ± 0.352
3.841GlyIle: 3.841 ± 0.768
4.284GlyLys: 4.284 ± 0.812
8.346GlyLeu: 8.346 ± 0.727
2.364GlyMet: 2.364 ± 0.37
2.142GlyAsn: 2.142 ± 0.642
4.506GlyPro: 4.506 ± 0.627
4.062GlyGln: 4.062 ± 0.549
7.017GlyArg: 7.017 ± 0.537
5.466GlySer: 5.466 ± 0.823
6.352GlyThr: 6.352 ± 0.964
7.829GlyVal: 7.829 ± 0.772
2.807GlyTrp: 2.807 ± 0.488
2.659GlyTyr: 2.659 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
1.847HisAla: 1.847 ± 0.374
0.222HisCys: 0.222 ± 0.134
1.108HisAsp: 1.108 ± 0.306
0.517HisGlu: 0.517 ± 0.239
0.295HisPhe: 0.295 ± 0.149
1.773HisGly: 1.773 ± 0.457
0.443HisHis: 0.443 ± 0.215
0.295HisIle: 0.295 ± 0.177
0.517HisLys: 0.517 ± 0.175
1.034HisLeu: 1.034 ± 0.488
0.148HisMet: 0.148 ± 0.172
0.443HisAsn: 0.443 ± 0.179
1.256HisPro: 1.256 ± 0.357
0.148HisGln: 0.148 ± 0.123
0.591HisArg: 0.591 ± 0.26
0.443HisSer: 0.443 ± 0.172
1.034HisThr: 1.034 ± 0.25
1.477HisVal: 1.477 ± 0.347
0.222HisTrp: 0.222 ± 0.136
0.148HisTyr: 0.148 ± 0.118
0.0HisXaa: 0.0 ± 0.0
Ile
4.727IleAla: 4.727 ± 0.636
0.148IleCys: 0.148 ± 0.105
2.216IleAsp: 2.216 ± 0.387
1.994IleGlu: 1.994 ± 0.464
0.443IlePhe: 0.443 ± 0.185
3.841IleGly: 3.841 ± 0.604
0.369IleHis: 0.369 ± 0.165
0.96IleIle: 0.96 ± 0.27
0.96IleLys: 0.96 ± 0.259
2.29IleLeu: 2.29 ± 0.507
0.369IleMet: 0.369 ± 0.169
0.591IleAsn: 0.591 ± 0.193
1.773IlePro: 1.773 ± 0.424
1.034IleGln: 1.034 ± 0.31
3.915IleArg: 3.915 ± 0.623
1.182IleSer: 1.182 ± 0.324
3.841IleThr: 3.841 ± 0.612
3.471IleVal: 3.471 ± 0.517
0.443IleTrp: 0.443 ± 0.168
0.812IleTyr: 0.812 ± 0.221
0.0IleXaa: 0.0 ± 0.0
Lys
4.949LysAla: 4.949 ± 0.945
0.074LysCys: 0.074 ± 0.076
2.068LysAsp: 2.068 ± 0.643
1.699LysGlu: 1.699 ± 0.456
0.739LysPhe: 0.739 ± 0.265
2.733LysGly: 2.733 ± 0.683
0.591LysHis: 0.591 ± 0.26
1.329LysIle: 1.329 ± 0.368
1.256LysLys: 1.256 ± 0.303
2.585LysLeu: 2.585 ± 0.436
0.739LysMet: 0.739 ± 0.283
0.886LysAsn: 0.886 ± 0.244
2.142LysPro: 2.142 ± 0.407
0.443LysGln: 0.443 ± 0.191
2.437LysArg: 2.437 ± 0.462
1.994LysSer: 1.994 ± 0.366
2.142LysThr: 2.142 ± 0.431
3.619LysVal: 3.619 ± 0.645
0.812LysTrp: 0.812 ± 0.227
0.812LysTyr: 0.812 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
10.931LeuAla: 10.931 ± 1.38
0.591LeuCys: 0.591 ± 0.253
5.392LeuAsp: 5.392 ± 0.724
3.471LeuGlu: 3.471 ± 0.565
1.477LeuPhe: 1.477 ± 0.444
7.977LeuGly: 7.977 ± 0.647
1.403LeuHis: 1.403 ± 0.469
2.364LeuIle: 2.364 ± 0.394
2.585LeuLys: 2.585 ± 0.629
6.13LeuLeu: 6.13 ± 0.724
1.256LeuMet: 1.256 ± 0.354
2.142LeuAsn: 2.142 ± 0.377
6.057LeuPro: 6.057 ± 0.618
1.92LeuGln: 1.92 ± 0.307
6.5LeuArg: 6.5 ± 0.836
5.983LeuSer: 5.983 ± 0.754
5.909LeuThr: 5.909 ± 0.767
6.278LeuVal: 6.278 ± 0.651
2.216LeuTrp: 2.216 ± 0.427
1.773LeuTyr: 1.773 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
2.733MetAla: 2.733 ± 0.53
0.074MetCys: 0.074 ± 0.081
0.96MetAsp: 0.96 ± 0.319
0.739MetGlu: 0.739 ± 0.3
0.369MetPhe: 0.369 ± 0.173
1.847MetGly: 1.847 ± 0.319
0.369MetHis: 0.369 ± 0.209
0.739MetIle: 0.739 ± 0.251
0.443MetLys: 0.443 ± 0.202
1.329MetLeu: 1.329 ± 0.373
0.295MetMet: 0.295 ± 0.125
0.369MetAsn: 0.369 ± 0.156
1.329MetPro: 1.329 ± 0.293
0.739MetGln: 0.739 ± 0.239
1.773MetArg: 1.773 ± 0.338
1.329MetSer: 1.329 ± 0.281
1.699MetThr: 1.699 ± 0.438
1.329MetVal: 1.329 ± 0.347
0.591MetTrp: 0.591 ± 0.194
0.148MetTyr: 0.148 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.693AsnAla: 3.693 ± 0.465
0.295AsnCys: 0.295 ± 0.183
1.403AsnAsp: 1.403 ± 0.295
1.403AsnGlu: 1.403 ± 0.288
0.369AsnPhe: 0.369 ± 0.184
3.545AsnGly: 3.545 ± 0.755
0.148AsnHis: 0.148 ± 0.113
0.96AsnIle: 0.96 ± 0.245
0.96AsnLys: 0.96 ± 0.368
1.92AsnLeu: 1.92 ± 0.473
0.222AsnMet: 0.222 ± 0.127
0.665AsnAsn: 0.665 ± 0.273
2.659AsnPro: 2.659 ± 0.465
0.591AsnGln: 0.591 ± 0.19
1.477AsnArg: 1.477 ± 0.296
0.665AsnSer: 0.665 ± 0.304
2.29AsnThr: 2.29 ± 0.707
1.773AsnVal: 1.773 ± 0.484
0.295AsnTrp: 0.295 ± 0.135
0.295AsnTyr: 0.295 ± 0.128
0.0AsnXaa: 0.0 ± 0.0
Pro
7.312ProAla: 7.312 ± 0.724
0.369ProCys: 0.369 ± 0.163
4.875ProAsp: 4.875 ± 0.883
3.324ProGlu: 3.324 ± 0.617
1.034ProPhe: 1.034 ± 0.239
6.204ProGly: 6.204 ± 0.592
0.739ProHis: 0.739 ± 0.327
2.068ProIle: 2.068 ± 0.692
1.477ProLys: 1.477 ± 0.309
4.21ProLeu: 4.21 ± 0.636
0.96ProMet: 0.96 ± 0.289
1.92ProAsn: 1.92 ± 0.473
2.954ProPro: 2.954 ± 0.472
1.477ProGln: 1.477 ± 0.426
2.881ProArg: 2.881 ± 0.488
2.511ProSer: 2.511 ± 0.46
3.767ProThr: 3.767 ± 0.479
5.909ProVal: 5.909 ± 0.688
0.443ProTrp: 0.443 ± 0.165
1.92ProTyr: 1.92 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
4.506GlnAla: 4.506 ± 0.541
0.295GlnCys: 0.295 ± 0.133
1.034GlnAsp: 1.034 ± 0.308
1.625GlnGlu: 1.625 ± 0.39
1.182GlnPhe: 1.182 ± 0.27
2.807GlnGly: 2.807 ± 0.522
0.369GlnHis: 0.369 ± 0.194
1.034GlnIle: 1.034 ± 0.337
0.739GlnLys: 0.739 ± 0.272
2.511GlnLeu: 2.511 ± 0.457
0.517GlnMet: 0.517 ± 0.234
0.96GlnAsn: 0.96 ± 0.349
1.551GlnPro: 1.551 ± 0.325
0.96GlnGln: 0.96 ± 0.248
2.511GlnArg: 2.511 ± 0.48
1.625GlnSer: 1.625 ± 0.316
2.511GlnThr: 2.511 ± 0.396
2.733GlnVal: 2.733 ± 0.606
0.812GlnTrp: 0.812 ± 0.265
0.96GlnTyr: 0.96 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
8.272ArgAla: 8.272 ± 0.783
0.369ArgCys: 0.369 ± 0.202
3.693ArgAsp: 3.693 ± 0.527
3.398ArgGlu: 3.398 ± 0.628
1.329ArgPhe: 1.329 ± 0.265
6.795ArgGly: 6.795 ± 0.773
1.256ArgHis: 1.256 ± 0.299
3.102ArgIle: 3.102 ± 0.475
2.807ArgLys: 2.807 ± 0.33
7.46ArgLeu: 7.46 ± 0.742
1.551ArgMet: 1.551 ± 0.4
1.994ArgAsn: 1.994 ± 0.352
3.767ArgPro: 3.767 ± 0.656
2.659ArgGln: 2.659 ± 0.524
5.835ArgArg: 5.835 ± 1.009
3.841ArgSer: 3.841 ± 0.496
4.358ArgThr: 4.358 ± 0.656
5.54ArgVal: 5.54 ± 0.941
1.477ArgTrp: 1.477 ± 0.364
2.364ArgTyr: 2.364 ± 0.518
0.0ArgXaa: 0.0 ± 0.0
Ser
7.312SerAla: 7.312 ± 0.705
0.148SerCys: 0.148 ± 0.101
3.25SerAsp: 3.25 ± 0.464
2.585SerGlu: 2.585 ± 0.415
1.182SerPhe: 1.182 ± 0.303
6.352SerGly: 6.352 ± 0.685
0.739SerHis: 0.739 ± 0.218
2.437SerIle: 2.437 ± 0.56
2.29SerLys: 2.29 ± 0.465
4.653SerLeu: 4.653 ± 0.696
1.403SerMet: 1.403 ± 0.336
1.108SerAsn: 1.108 ± 0.264
2.585SerPro: 2.585 ± 0.441
1.551SerGln: 1.551 ± 0.349
3.028SerArg: 3.028 ± 0.498
3.324SerSer: 3.324 ± 0.49
4.284SerThr: 4.284 ± 0.757
3.471SerVal: 3.471 ± 0.481
1.699SerTrp: 1.699 ± 0.437
1.256SerTyr: 1.256 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
8.42ThrAla: 8.42 ± 0.922
0.517ThrCys: 0.517 ± 0.201
5.318ThrAsp: 5.318 ± 0.692
3.767ThrGlu: 3.767 ± 0.478
2.437ThrPhe: 2.437 ± 0.523
7.017ThrGly: 7.017 ± 0.674
1.034ThrHis: 1.034 ± 0.342
1.994ThrIle: 1.994 ± 0.419
1.699ThrLys: 1.699 ± 0.301
5.54ThrLeu: 5.54 ± 0.575
0.739ThrMet: 0.739 ± 0.259
1.551ThrAsn: 1.551 ± 0.362
4.653ThrPro: 4.653 ± 0.859
1.847ThrGln: 1.847 ± 0.343
4.506ThrArg: 4.506 ± 0.544
3.324ThrSer: 3.324 ± 0.481
3.619ThrThr: 3.619 ± 0.618
6.204ThrVal: 6.204 ± 0.803
1.256ThrTrp: 1.256 ± 0.378
2.364ThrTyr: 2.364 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
10.045ValAla: 10.045 ± 0.905
0.369ValCys: 0.369 ± 0.206
6.647ValAsp: 6.647 ± 0.744
5.909ValGlu: 5.909 ± 0.777
1.847ValPhe: 1.847 ± 0.275
6.943ValGly: 6.943 ± 0.876
1.256ValHis: 1.256 ± 0.34
3.102ValIle: 3.102 ± 0.416
3.176ValLys: 3.176 ± 0.551
6.426ValLeu: 6.426 ± 0.755
1.92ValMet: 1.92 ± 0.284
2.511ValAsn: 2.511 ± 0.455
4.801ValPro: 4.801 ± 0.596
2.881ValGln: 2.881 ± 0.379
4.875ValArg: 4.875 ± 0.766
4.653ValSer: 4.653 ± 0.684
5.318ValThr: 5.318 ± 0.792
5.466ValVal: 5.466 ± 0.695
1.847ValTrp: 1.847 ± 0.354
2.29ValTyr: 2.29 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
2.29TrpAla: 2.29 ± 0.387
0.148TrpCys: 0.148 ± 0.101
1.625TrpAsp: 1.625 ± 0.329
0.665TrpGlu: 0.665 ± 0.254
0.517TrpPhe: 0.517 ± 0.168
2.364TrpGly: 2.364 ± 0.352
0.074TrpHis: 0.074 ± 0.081
0.369TrpIle: 0.369 ± 0.147
0.886TrpLys: 0.886 ± 0.373
2.511TrpLeu: 2.511 ± 0.507
0.517TrpMet: 0.517 ± 0.285
0.591TrpAsn: 0.591 ± 0.206
0.295TrpPro: 0.295 ± 0.148
1.477TrpGln: 1.477 ± 0.397
1.92TrpArg: 1.92 ± 0.459
1.773TrpSer: 1.773 ± 0.321
0.96TrpThr: 0.96 ± 0.256
1.551TrpVal: 1.551 ± 0.306
0.739TrpTrp: 0.739 ± 0.313
0.295TrpTyr: 0.295 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.693TyrAla: 3.693 ± 0.472
0.074TyrCys: 0.074 ± 0.063
1.847TyrAsp: 1.847 ± 0.464
1.477TyrGlu: 1.477 ± 0.354
0.812TyrPhe: 0.812 ± 0.261
2.881TyrGly: 2.881 ± 0.374
0.074TyrHis: 0.074 ± 0.081
0.739TyrIle: 0.739 ± 0.241
0.591TyrLys: 0.591 ± 0.22
1.994TyrLeu: 1.994 ± 0.317
0.443TyrMet: 0.443 ± 0.171
0.665TyrAsn: 0.665 ± 0.229
1.329TyrPro: 1.329 ± 0.287
0.665TyrGln: 0.665 ± 0.222
2.29TyrArg: 2.29 ± 0.44
1.551TyrSer: 1.551 ± 0.513
1.847TyrThr: 1.847 ± 0.383
2.511TyrVal: 2.511 ± 0.59
0.148TyrTrp: 0.148 ± 0.111
0.591TyrTyr: 0.591 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski