Amino acid dipepetide frequency for Streptomyces phage Vondra

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
29.305AlaAla: 29.305 ± 2.235
1.551AlaCys: 1.551 ± 0.414
11.119AlaAsp: 11.119 ± 0.965
9.912AlaGlu: 9.912 ± 1.119
1.638AlaPhe: 1.638 ± 0.424
12.325AlaGly: 12.325 ± 1.844
3.792AlaHis: 3.792 ± 0.716
5.085AlaIle: 5.085 ± 0.713
2.844AlaLys: 2.844 ± 0.557
13.532AlaLeu: 13.532 ± 1.473
2.672AlaMet: 2.672 ± 0.435
2.069AlaAsn: 2.069 ± 0.485
8.188AlaPro: 8.188 ± 1.176
6.723AlaGln: 6.723 ± 0.97
12.67AlaArg: 12.67 ± 1.161
5.085AlaSer: 5.085 ± 0.756
10.688AlaThr: 10.688 ± 1.003
11.722AlaVal: 11.722 ± 1.135
2.413AlaTrp: 2.413 ± 0.649
3.275AlaTyr: 3.275 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
1.293CysAla: 1.293 ± 0.423
0.0CysCys: 0.0 ± 0.0
0.69CysAsp: 0.69 ± 0.26
0.345CysGlu: 0.345 ± 0.166
0.0CysPhe: 0.0 ± 0.0
1.293CysGly: 1.293 ± 0.389
0.259CysHis: 0.259 ± 0.16
0.172CysIle: 0.172 ± 0.12
0.259CysLys: 0.259 ± 0.142
0.517CysLeu: 0.517 ± 0.213
0.172CysMet: 0.172 ± 0.108
0.172CysAsn: 0.172 ± 0.118
1.724CysPro: 1.724 ± 0.486
0.517CysGln: 0.517 ± 0.247
1.293CysArg: 1.293 ± 0.543
0.345CysSer: 0.345 ± 0.181
0.862CysThr: 0.862 ± 0.332
0.259CysVal: 0.259 ± 0.157
0.259CysTrp: 0.259 ± 0.153
0.086CysTyr: 0.086 ± 0.076
0.0CysXaa: 0.0 ± 0.0
Asp
10.86AspAla: 10.86 ± 0.944
0.517AspCys: 0.517 ± 0.266
5.689AspAsp: 5.689 ± 0.802
3.879AspGlu: 3.879 ± 0.744
1.207AspPhe: 1.207 ± 0.272
7.154AspGly: 7.154 ± 0.734
1.638AspHis: 1.638 ± 0.408
1.638AspIle: 1.638 ± 0.35
1.12AspLys: 1.12 ± 0.356
4.913AspLeu: 4.913 ± 0.713
0.69AspMet: 0.69 ± 0.224
0.776AspAsn: 0.776 ± 0.27
5.689AspPro: 5.689 ± 0.823
3.189AspGln: 3.189 ± 0.542
6.12AspArg: 6.12 ± 0.935
2.5AspSer: 2.5 ± 0.411
3.792AspThr: 3.792 ± 0.597
2.844AspVal: 2.844 ± 0.506
0.603AspTrp: 0.603 ± 0.28
1.638AspTyr: 1.638 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
7.326GluAla: 7.326 ± 0.812
1.207GluCys: 1.207 ± 0.386
3.448GluAsp: 3.448 ± 0.56
2.241GluGlu: 2.241 ± 0.413
0.776GluPhe: 0.776 ± 0.209
3.103GluGly: 3.103 ± 0.468
1.12GluHis: 1.12 ± 0.366
1.379GluIle: 1.379 ± 0.349
1.465GluLys: 1.465 ± 0.438
4.654GluLeu: 4.654 ± 0.726
1.982GluMet: 1.982 ± 0.387
0.431GluAsn: 0.431 ± 0.182
2.5GluPro: 2.5 ± 0.456
3.361GluGln: 3.361 ± 0.656
4.482GluArg: 4.482 ± 0.646
2.327GluSer: 2.327 ± 0.338
3.017GluThr: 3.017 ± 0.452
3.706GluVal: 3.706 ± 0.662
1.379GluTrp: 1.379 ± 0.321
0.862GluTyr: 0.862 ± 0.232
0.0GluXaa: 0.0 ± 0.0
Phe
2.586PheAla: 2.586 ± 0.416
0.172PheCys: 0.172 ± 0.109
1.12PheAsp: 1.12 ± 0.413
0.603PheGlu: 0.603 ± 0.203
0.0PhePhe: 0.0 ± 0.0
1.379PheGly: 1.379 ± 0.419
0.517PheHis: 0.517 ± 0.205
0.776PheIle: 0.776 ± 0.216
0.517PheLys: 0.517 ± 0.181
0.776PheLeu: 0.776 ± 0.265
0.431PheMet: 0.431 ± 0.185
0.086PheAsn: 0.086 ± 0.073
0.69PhePro: 0.69 ± 0.299
0.431PheGln: 0.431 ± 0.233
1.207PheArg: 1.207 ± 0.265
0.69PheSer: 0.69 ± 0.185
1.465PheThr: 1.465 ± 0.354
0.603PheVal: 0.603 ± 0.218
0.345PheTrp: 0.345 ± 0.167
0.259PheTyr: 0.259 ± 0.142
0.0PheXaa: 0.0 ± 0.0
Gly
9.998GlyAla: 9.998 ± 0.906
0.948GlyCys: 0.948 ± 0.337
5.085GlyAsp: 5.085 ± 0.591
3.448GlyGlu: 3.448 ± 0.57
1.379GlyPhe: 1.379 ± 0.268
6.464GlyGly: 6.464 ± 0.787
1.896GlyHis: 1.896 ± 0.447
3.879GlyIle: 3.879 ± 0.783
2.5GlyLys: 2.5 ± 0.592
7.24GlyLeu: 7.24 ± 1.313
1.293GlyMet: 1.293 ± 0.307
1.12GlyAsn: 1.12 ± 0.328
5.344GlyPro: 5.344 ± 0.792
3.879GlyGln: 3.879 ± 0.593
8.878GlyArg: 8.878 ± 0.708
4.31GlySer: 4.31 ± 0.76
6.206GlyThr: 6.206 ± 0.939
4.827GlyVal: 4.827 ± 0.554
1.982GlyTrp: 1.982 ± 0.464
1.982GlyTyr: 1.982 ± 0.384
0.0GlyXaa: 0.0 ± 0.0
His
2.069HisAla: 2.069 ± 0.386
0.172HisCys: 0.172 ± 0.13
1.12HisAsp: 1.12 ± 0.303
1.12HisGlu: 1.12 ± 0.321
0.776HisPhe: 0.776 ± 0.261
1.896HisGly: 1.896 ± 0.486
0.431HisHis: 0.431 ± 0.227
0.517HisIle: 0.517 ± 0.222
0.0HisLys: 0.0 ± 0.0
2.327HisLeu: 2.327 ± 0.496
0.431HisMet: 0.431 ± 0.194
0.172HisAsn: 0.172 ± 0.113
1.465HisPro: 1.465 ± 0.399
0.948HisGln: 0.948 ± 0.26
2.327HisArg: 2.327 ± 0.478
0.862HisSer: 0.862 ± 0.257
1.81HisThr: 1.81 ± 0.427
1.638HisVal: 1.638 ± 0.348
0.603HisTrp: 0.603 ± 0.318
0.431HisTyr: 0.431 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
5.861IleAla: 5.861 ± 0.893
0.172IleCys: 0.172 ± 0.118
1.982IleAsp: 1.982 ± 0.444
2.327IleGlu: 2.327 ± 0.526
0.259IlePhe: 0.259 ± 0.135
2.327IleGly: 2.327 ± 0.424
0.345IleHis: 0.345 ± 0.159
0.862IleIle: 0.862 ± 0.24
0.69IleLys: 0.69 ± 0.262
2.069IleLeu: 2.069 ± 0.474
0.345IleMet: 0.345 ± 0.143
0.862IleAsn: 0.862 ± 0.257
1.896IlePro: 1.896 ± 0.408
0.69IleGln: 0.69 ± 0.275
3.448IleArg: 3.448 ± 0.51
1.465IleSer: 1.465 ± 0.542
5.172IleThr: 5.172 ± 0.683
2.241IleVal: 2.241 ± 0.505
0.259IleTrp: 0.259 ± 0.127
0.517IleTyr: 0.517 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
4.137LysAla: 4.137 ± 0.912
0.172LysCys: 0.172 ± 0.106
0.948LysAsp: 0.948 ± 0.389
0.603LysGlu: 0.603 ± 0.208
0.603LysPhe: 0.603 ± 0.239
1.896LysGly: 1.896 ± 0.642
0.517LysHis: 0.517 ± 0.191
1.379LysIle: 1.379 ± 0.491
1.12LysLys: 1.12 ± 0.411
1.293LysLeu: 1.293 ± 0.497
0.776LysMet: 0.776 ± 0.325
0.431LysAsn: 0.431 ± 0.178
1.034LysPro: 1.034 ± 0.403
1.12LysGln: 1.12 ± 0.309
2.069LysArg: 2.069 ± 0.535
1.207LysSer: 1.207 ± 0.332
1.551LysThr: 1.551 ± 0.406
1.896LysVal: 1.896 ± 0.42
0.69LysTrp: 0.69 ± 0.194
0.517LysTyr: 0.517 ± 0.216
0.0LysXaa: 0.0 ± 0.0
Leu
13.101LeuAla: 13.101 ± 1.017
0.862LeuCys: 0.862 ± 0.296
8.705LeuAsp: 8.705 ± 0.786
2.672LeuGlu: 2.672 ± 0.429
1.724LeuPhe: 1.724 ± 0.313
6.206LeuGly: 6.206 ± 0.845
1.551LeuHis: 1.551 ± 0.385
2.241LeuIle: 2.241 ± 0.339
1.896LeuLys: 1.896 ± 0.626
7.93LeuLeu: 7.93 ± 0.964
1.379LeuMet: 1.379 ± 0.417
1.207LeuAsn: 1.207 ± 0.307
4.827LeuPro: 4.827 ± 0.735
1.896LeuGln: 1.896 ± 0.495
7.154LeuArg: 7.154 ± 0.858
3.879LeuSer: 3.879 ± 0.471
5.947LeuThr: 5.947 ± 0.851
4.654LeuVal: 4.654 ± 0.551
1.207LeuTrp: 1.207 ± 0.318
1.724LeuTyr: 1.724 ± 0.34
0.0LeuXaa: 0.0 ± 0.0
Met
2.672MetAla: 2.672 ± 0.468
0.172MetCys: 0.172 ± 0.115
0.862MetAsp: 0.862 ± 0.321
0.69MetGlu: 0.69 ± 0.203
0.086MetPhe: 0.086 ± 0.087
1.465MetGly: 1.465 ± 0.265
0.431MetHis: 0.431 ± 0.223
0.431MetIle: 0.431 ± 0.181
0.259MetLys: 0.259 ± 0.12
1.293MetLeu: 1.293 ± 0.293
0.603MetMet: 0.603 ± 0.216
0.776MetAsn: 0.776 ± 0.24
1.207MetPro: 1.207 ± 0.317
0.69MetGln: 0.69 ± 0.267
2.5MetArg: 2.5 ± 0.419
1.982MetSer: 1.982 ± 0.445
2.069MetThr: 2.069 ± 0.472
0.603MetVal: 0.603 ± 0.218
0.603MetTrp: 0.603 ± 0.255
0.431MetTyr: 0.431 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
2.586AsnAla: 2.586 ± 0.46
0.172AsnCys: 0.172 ± 0.114
0.603AsnAsp: 0.603 ± 0.192
0.69AsnGlu: 0.69 ± 0.223
0.259AsnPhe: 0.259 ± 0.134
1.12AsnGly: 1.12 ± 0.281
0.172AsnHis: 0.172 ± 0.125
0.431AsnIle: 0.431 ± 0.176
0.431AsnLys: 0.431 ± 0.171
1.638AsnLeu: 1.638 ± 0.325
0.086AsnMet: 0.086 ± 0.087
0.259AsnAsn: 0.259 ± 0.137
1.724AsnPro: 1.724 ± 0.399
0.776AsnGln: 0.776 ± 0.255
1.379AsnArg: 1.379 ± 0.405
0.862AsnSer: 0.862 ± 0.341
1.379AsnThr: 1.379 ± 0.387
0.862AsnVal: 0.862 ± 0.268
0.172AsnTrp: 0.172 ± 0.131
0.345AsnTyr: 0.345 ± 0.164
0.0AsnXaa: 0.0 ± 0.0
Pro
10.084ProAla: 10.084 ± 0.992
0.345ProCys: 0.345 ± 0.173
4.396ProAsp: 4.396 ± 0.969
2.758ProGlu: 2.758 ± 0.401
0.69ProPhe: 0.69 ± 0.25
6.551ProGly: 6.551 ± 0.748
1.034ProHis: 1.034 ± 0.37
2.069ProIle: 2.069 ± 0.473
1.551ProLys: 1.551 ± 0.381
4.31ProLeu: 4.31 ± 0.93
1.638ProMet: 1.638 ± 0.352
1.207ProAsn: 1.207 ± 0.359
4.31ProPro: 4.31 ± 0.82
2.155ProGln: 2.155 ± 0.385
5.172ProArg: 5.172 ± 0.821
3.448ProSer: 3.448 ± 0.556
5.861ProThr: 5.861 ± 0.945
3.965ProVal: 3.965 ± 0.54
1.12ProTrp: 1.12 ± 0.245
1.982ProTyr: 1.982 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
6.464GlnAla: 6.464 ± 1.006
0.086GlnCys: 0.086 ± 0.097
0.69GlnAsp: 0.69 ± 0.216
1.81GlnGlu: 1.81 ± 0.426
0.862GlnPhe: 0.862 ± 0.376
3.534GlnGly: 3.534 ± 0.961
0.603GlnHis: 0.603 ± 0.171
1.293GlnIle: 1.293 ± 0.36
1.207GlnLys: 1.207 ± 0.423
4.223GlnLeu: 4.223 ± 0.529
1.293GlnMet: 1.293 ± 0.265
0.345GlnAsn: 0.345 ± 0.165
2.672GlnPro: 2.672 ± 0.484
2.672GlnGln: 2.672 ± 1.142
3.534GlnArg: 3.534 ± 0.816
1.379GlnSer: 1.379 ± 0.364
3.448GlnThr: 3.448 ± 0.551
1.81GlnVal: 1.81 ± 0.319
0.862GlnTrp: 0.862 ± 0.276
0.776GlnTyr: 0.776 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
14.049ArgAla: 14.049 ± 1.312
1.379ArgCys: 1.379 ± 0.531
4.654ArgAsp: 4.654 ± 0.577
4.999ArgGlu: 4.999 ± 0.945
1.896ArgPhe: 1.896 ± 0.39
5.172ArgGly: 5.172 ± 0.779
2.155ArgHis: 2.155 ± 0.504
3.62ArgIle: 3.62 ± 0.577
1.982ArgLys: 1.982 ± 0.483
8.705ArgLeu: 8.705 ± 0.867
1.982ArgMet: 1.982 ± 0.402
1.982ArgAsn: 1.982 ± 0.416
5.947ArgPro: 5.947 ± 0.825
3.534ArgGln: 3.534 ± 0.541
9.481ArgArg: 9.481 ± 1.272
4.827ArgSer: 4.827 ± 0.566
6.206ArgThr: 6.206 ± 0.7
5.947ArgVal: 5.947 ± 0.59
1.379ArgTrp: 1.379 ± 0.373
2.069ArgTyr: 2.069 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
6.464SerAla: 6.464 ± 0.973
0.345SerCys: 0.345 ± 0.215
3.189SerAsp: 3.189 ± 0.452
2.241SerGlu: 2.241 ± 0.384
0.517SerPhe: 0.517 ± 0.211
4.568SerGly: 4.568 ± 0.826
0.172SerHis: 0.172 ± 0.122
1.034SerIle: 1.034 ± 0.272
1.724SerLys: 1.724 ± 0.401
3.879SerLeu: 3.879 ± 0.669
1.12SerMet: 1.12 ± 0.292
0.603SerAsn: 0.603 ± 0.242
3.189SerPro: 3.189 ± 0.645
1.465SerGln: 1.465 ± 0.585
4.482SerArg: 4.482 ± 0.621
1.81SerSer: 1.81 ± 0.394
3.103SerThr: 3.103 ± 0.467
2.844SerVal: 2.844 ± 0.513
0.862SerTrp: 0.862 ± 0.266
1.379SerTyr: 1.379 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
13.963ThrAla: 13.963 ± 0.951
0.776ThrCys: 0.776 ± 0.306
5.43ThrAsp: 5.43 ± 0.807
2.931ThrGlu: 2.931 ± 0.614
1.207ThrPhe: 1.207 ± 0.307
8.619ThrGly: 8.619 ± 0.921
1.638ThrHis: 1.638 ± 0.484
2.155ThrIle: 2.155 ± 0.444
1.896ThrLys: 1.896 ± 0.605
4.568ThrLeu: 4.568 ± 0.745
1.207ThrMet: 1.207 ± 0.294
2.241ThrAsn: 2.241 ± 0.401
6.551ThrPro: 6.551 ± 0.78
2.155ThrGln: 2.155 ± 0.41
4.396ThrArg: 4.396 ± 0.64
2.672ThrSer: 2.672 ± 0.438
6.895ThrThr: 6.895 ± 1.039
5.689ThrVal: 5.689 ± 0.865
2.069ThrTrp: 2.069 ± 0.492
1.379ThrTyr: 1.379 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
8.447ValAla: 8.447 ± 0.965
0.603ValCys: 0.603 ± 0.247
4.137ValAsp: 4.137 ± 0.666
4.741ValGlu: 4.741 ± 0.649
0.517ValPhe: 0.517 ± 0.236
4.482ValGly: 4.482 ± 0.748
1.724ValHis: 1.724 ± 0.509
3.448ValIle: 3.448 ± 0.559
1.379ValLys: 1.379 ± 0.459
3.879ValLeu: 3.879 ± 0.603
1.12ValMet: 1.12 ± 0.313
0.345ValAsn: 0.345 ± 0.19
4.223ValPro: 4.223 ± 0.771
2.155ValGln: 2.155 ± 0.447
6.12ValArg: 6.12 ± 0.724
2.672ValSer: 2.672 ± 0.457
5.602ValThr: 5.602 ± 0.742
3.189ValVal: 3.189 ± 0.523
0.948ValTrp: 0.948 ± 0.35
2.241ValTyr: 2.241 ± 0.582
0.0ValXaa: 0.0 ± 0.0
Trp
2.327TrpAla: 2.327 ± 0.393
0.345TrpCys: 0.345 ± 0.164
0.776TrpAsp: 0.776 ± 0.281
1.293TrpGlu: 1.293 ± 0.31
0.259TrpPhe: 0.259 ± 0.125
1.034TrpGly: 1.034 ± 0.288
0.776TrpHis: 0.776 ± 0.279
0.603TrpIle: 0.603 ± 0.271
0.603TrpLys: 0.603 ± 0.235
1.12TrpLeu: 1.12 ± 0.336
0.259TrpMet: 0.259 ± 0.144
0.345TrpAsn: 0.345 ± 0.174
0.603TrpPro: 0.603 ± 0.232
0.862TrpGln: 0.862 ± 0.245
2.241TrpArg: 2.241 ± 0.459
1.379TrpSer: 1.379 ± 0.421
2.241TrpThr: 2.241 ± 0.58
1.034TrpVal: 1.034 ± 0.344
0.603TrpTrp: 0.603 ± 0.191
0.259TrpTyr: 0.259 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.017TyrAla: 3.017 ± 0.533
0.517TyrCys: 0.517 ± 0.24
1.81TyrAsp: 1.81 ± 0.397
1.465TyrGlu: 1.465 ± 0.424
0.0TyrPhe: 0.0 ± 0.0
2.069TyrGly: 2.069 ± 0.485
0.345TyrHis: 0.345 ± 0.19
0.776TyrIle: 0.776 ± 0.3
0.517TyrLys: 0.517 ± 0.192
1.896TyrLeu: 1.896 ± 0.436
0.259TyrMet: 0.259 ± 0.147
0.517TyrAsn: 0.517 ± 0.221
0.862TyrPro: 0.862 ± 0.328
0.345TyrGln: 0.345 ± 0.166
3.103TyrArg: 3.103 ± 0.606
1.293TyrSer: 1.293 ± 0.335
1.12TyrThr: 1.12 ± 0.243
1.638TyrVal: 1.638 ± 0.393
0.603TyrTrp: 0.603 ± 0.208
0.259TyrTyr: 0.259 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11603 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski