Amino acid dipepetide frequency for Streptococcus phage Javan68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.647AlaAla: 4.647 ± 0.957
0.447AlaCys: 0.447 ± 0.169
4.468AlaAsp: 4.468 ± 0.674
6.792AlaGlu: 6.792 ± 0.8
2.234AlaPhe: 2.234 ± 0.528
5.362AlaGly: 5.362 ± 0.815
1.162AlaHis: 1.162 ± 0.411
6.345AlaIle: 6.345 ± 0.794
7.149AlaLys: 7.149 ± 1.101
7.507AlaLeu: 7.507 ± 1.042
1.519AlaMet: 1.519 ± 0.276
4.468AlaAsn: 4.468 ± 0.788
2.234AlaPro: 2.234 ± 0.332
4.021AlaGln: 4.021 ± 0.738
4.647AlaArg: 4.647 ± 0.708
4.826AlaSer: 4.826 ± 0.676
5.094AlaThr: 5.094 ± 1.12
4.111AlaVal: 4.111 ± 0.607
1.34AlaTrp: 1.34 ± 0.485
2.502AlaTyr: 2.502 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.177
0.0CysCys: 0.0 ± 0.0
0.179CysAsp: 0.179 ± 0.13
0.447CysGlu: 0.447 ± 0.197
0.447CysPhe: 0.447 ± 0.23
0.804CysGly: 0.804 ± 0.401
0.179CysHis: 0.179 ± 0.106
0.0CysIle: 0.0 ± 0.0
0.715CysLys: 0.715 ± 0.253
0.179CysLeu: 0.179 ± 0.144
0.179CysMet: 0.179 ± 0.153
0.447CysAsn: 0.447 ± 0.203
0.179CysPro: 0.179 ± 0.113
0.268CysGln: 0.268 ± 0.157
0.268CysArg: 0.268 ± 0.172
0.536CysSer: 0.536 ± 0.227
0.268CysThr: 0.268 ± 0.16
0.268CysVal: 0.268 ± 0.164
0.089CysTrp: 0.089 ± 0.097
0.268CysTyr: 0.268 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
3.396AspAla: 3.396 ± 0.47
0.626AspCys: 0.626 ± 0.29
3.396AspAsp: 3.396 ± 0.721
4.736AspGlu: 4.736 ± 0.976
2.681AspPhe: 2.681 ± 0.605
4.736AspGly: 4.736 ± 0.822
0.268AspHis: 0.268 ± 0.172
3.217AspIle: 3.217 ± 0.857
5.451AspLys: 5.451 ± 0.624
4.468AspLeu: 4.468 ± 0.601
1.251AspMet: 1.251 ± 0.468
3.217AspAsn: 3.217 ± 0.622
1.877AspPro: 1.877 ± 0.368
1.43AspGln: 1.43 ± 0.332
2.949AspArg: 2.949 ± 0.519
3.664AspSer: 3.664 ± 0.44
4.021AspThr: 4.021 ± 0.543
3.396AspVal: 3.396 ± 0.491
1.519AspTrp: 1.519 ± 0.414
3.575AspTyr: 3.575 ± 0.766
0.0AspXaa: 0.0 ± 0.0
Glu
5.719GluAla: 5.719 ± 0.962
0.536GluCys: 0.536 ± 0.265
3.396GluAsp: 3.396 ± 0.49
6.613GluGlu: 6.613 ± 0.8
2.77GluPhe: 2.77 ± 0.442
4.29GluGly: 4.29 ± 0.804
0.804GluHis: 0.804 ± 0.248
6.256GluIle: 6.256 ± 0.745
7.328GluLys: 7.328 ± 0.896
8.937GluLeu: 8.937 ± 0.965
2.502GluMet: 2.502 ± 0.57
3.485GluAsn: 3.485 ± 0.392
1.877GluPro: 1.877 ± 0.474
3.843GluGln: 3.843 ± 0.661
3.396GluArg: 3.396 ± 0.724
3.932GluSer: 3.932 ± 0.784
3.932GluThr: 3.932 ± 0.431
5.004GluVal: 5.004 ± 0.73
1.072GluTrp: 1.072 ± 0.295
2.502GluTyr: 2.502 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.502PheAla: 2.502 ± 0.399
0.179PheCys: 0.179 ± 0.155
3.038PheAsp: 3.038 ± 0.622
4.021PheGlu: 4.021 ± 0.617
1.877PhePhe: 1.877 ± 0.545
3.932PheGly: 3.932 ± 0.46
0.0PheHis: 0.0 ± 0.0
1.519PheIle: 1.519 ± 0.426
3.307PheLys: 3.307 ± 0.48
2.055PheLeu: 2.055 ± 0.468
0.894PheMet: 0.894 ± 0.321
1.877PheAsn: 1.877 ± 0.395
1.251PhePro: 1.251 ± 0.485
0.804PheGln: 0.804 ± 0.279
1.251PheArg: 1.251 ± 0.379
1.519PheSer: 1.519 ± 0.333
1.966PheThr: 1.966 ± 0.41
1.787PheVal: 1.787 ± 0.422
0.626PheTrp: 0.626 ± 0.235
1.877PheTyr: 1.877 ± 0.48
0.0PheXaa: 0.0 ± 0.0
Gly
4.558GlyAla: 4.558 ± 0.978
0.268GlyCys: 0.268 ± 0.148
4.826GlyAsp: 4.826 ± 0.717
3.843GlyGlu: 3.843 ± 0.621
2.145GlyPhe: 2.145 ± 0.368
4.558GlyGly: 4.558 ± 0.592
0.894GlyHis: 0.894 ± 0.247
4.379GlyIle: 4.379 ± 0.821
6.702GlyLys: 6.702 ± 0.72
7.328GlyLeu: 7.328 ± 1.002
2.324GlyMet: 2.324 ± 0.405
4.021GlyAsn: 4.021 ± 0.576
0.894GlyPro: 0.894 ± 0.296
2.77GlyGln: 2.77 ± 0.625
2.681GlyArg: 2.681 ± 0.49
3.485GlySer: 3.485 ± 0.438
2.77GlyThr: 2.77 ± 0.582
4.558GlyVal: 4.558 ± 0.514
1.609GlyTrp: 1.609 ± 0.386
2.77GlyTyr: 2.77 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.162HisAla: 1.162 ± 0.299
0.179HisCys: 0.179 ± 0.106
0.626HisAsp: 0.626 ± 0.253
0.804HisGlu: 0.804 ± 0.248
0.804HisPhe: 0.804 ± 0.26
0.894HisGly: 0.894 ± 0.301
0.447HisHis: 0.447 ± 0.201
0.894HisIle: 0.894 ± 0.299
0.894HisLys: 0.894 ± 0.307
0.715HisLeu: 0.715 ± 0.265
0.089HisMet: 0.089 ± 0.087
0.715HisAsn: 0.715 ± 0.24
0.447HisPro: 0.447 ± 0.164
0.536HisGln: 0.536 ± 0.236
0.447HisArg: 0.447 ± 0.22
0.983HisSer: 0.983 ± 0.281
0.983HisThr: 0.983 ± 0.328
0.536HisVal: 0.536 ± 0.193
0.268HisTrp: 0.268 ± 0.184
0.447HisTyr: 0.447 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
6.077IleAla: 6.077 ± 0.858
0.626IleCys: 0.626 ± 0.253
4.111IleAsp: 4.111 ± 0.817
5.362IleGlu: 5.362 ± 0.803
1.43IlePhe: 1.43 ± 0.352
3.038IleGly: 3.038 ± 0.552
0.804IleHis: 0.804 ± 0.247
3.843IleIle: 3.843 ± 0.74
4.826IleLys: 4.826 ± 0.571
4.647IleLeu: 4.647 ± 0.643
0.983IleMet: 0.983 ± 0.406
3.575IleAsn: 3.575 ± 0.515
2.055IlePro: 2.055 ± 0.563
1.519IleGln: 1.519 ± 0.438
2.502IleArg: 2.502 ± 0.513
3.843IleSer: 3.843 ± 0.544
4.021IleThr: 4.021 ± 0.7
3.307IleVal: 3.307 ± 0.547
0.715IleTrp: 0.715 ± 0.2
2.502IleTyr: 2.502 ± 0.496
0.0IleXaa: 0.0 ± 0.0
Lys
7.596LysAla: 7.596 ± 0.986
0.179LysCys: 0.179 ± 0.141
4.379LysAsp: 4.379 ± 0.632
6.524LysGlu: 6.524 ± 0.652
2.77LysPhe: 2.77 ± 0.448
5.809LysGly: 5.809 ± 0.77
1.251LysHis: 1.251 ± 0.265
3.843LysIle: 3.843 ± 0.759
8.222LysLys: 8.222 ± 0.882
5.541LysLeu: 5.541 ± 0.67
2.413LysMet: 2.413 ± 0.378
5.451LysAsn: 5.451 ± 0.772
2.145LysPro: 2.145 ± 0.559
4.021LysGln: 4.021 ± 0.576
4.379LysArg: 4.379 ± 0.76
3.843LysSer: 3.843 ± 0.569
6.613LysThr: 6.613 ± 0.983
5.63LysVal: 5.63 ± 0.835
0.983LysTrp: 0.983 ± 0.279
2.77LysTyr: 2.77 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
7.507LeuAla: 7.507 ± 1.026
0.268LeuCys: 0.268 ± 0.182
5.719LeuAsp: 5.719 ± 0.735
7.775LeuGlu: 7.775 ± 0.935
2.145LeuPhe: 2.145 ± 0.459
6.256LeuGly: 6.256 ± 0.886
0.983LeuHis: 0.983 ± 0.297
4.2LeuIle: 4.2 ± 0.648
6.613LeuLys: 6.613 ± 0.907
6.971LeuLeu: 6.971 ± 0.858
1.162LeuMet: 1.162 ± 0.332
5.183LeuAsn: 5.183 ± 0.631
3.396LeuPro: 3.396 ± 0.604
3.128LeuGln: 3.128 ± 0.503
4.2LeuArg: 4.2 ± 0.6
5.183LeuSer: 5.183 ± 0.587
4.558LeuThr: 4.558 ± 0.783
5.004LeuVal: 5.004 ± 0.54
0.179LeuTrp: 0.179 ± 0.112
2.413LeuTyr: 2.413 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
2.413MetAla: 2.413 ± 0.577
0.089MetCys: 0.089 ± 0.093
1.519MetAsp: 1.519 ± 0.419
1.072MetGlu: 1.072 ± 0.283
1.072MetPhe: 1.072 ± 0.329
0.983MetGly: 0.983 ± 0.334
0.447MetHis: 0.447 ± 0.192
1.519MetIle: 1.519 ± 0.401
1.609MetLys: 1.609 ± 0.364
1.698MetLeu: 1.698 ± 0.316
0.0MetMet: 0.0 ± 0.0
1.162MetAsn: 1.162 ± 0.382
0.894MetPro: 0.894 ± 0.29
1.43MetGln: 1.43 ± 0.3
0.894MetArg: 0.894 ± 0.247
1.787MetSer: 1.787 ± 0.404
2.324MetThr: 2.324 ± 0.586
1.162MetVal: 1.162 ± 0.248
0.536MetTrp: 0.536 ± 0.184
0.983MetTyr: 0.983 ± 0.321
0.0MetXaa: 0.0 ± 0.0
Asn
6.792AsnAla: 6.792 ± 1.016
0.089AsnCys: 0.089 ± 0.096
2.949AsnAsp: 2.949 ± 0.571
2.681AsnGlu: 2.681 ± 0.446
1.966AsnPhe: 1.966 ± 0.279
5.451AsnGly: 5.451 ± 0.574
0.715AsnHis: 0.715 ± 0.229
2.77AsnIle: 2.77 ± 0.591
4.468AsnLys: 4.468 ± 0.667
4.2AsnLeu: 4.2 ± 0.536
1.787AsnMet: 1.787 ± 0.309
3.307AsnAsn: 3.307 ± 0.491
2.234AsnPro: 2.234 ± 0.501
2.324AsnGln: 2.324 ± 0.517
1.34AsnArg: 1.34 ± 0.287
4.736AsnSer: 4.736 ± 0.689
2.145AsnThr: 2.145 ± 0.466
3.396AsnVal: 3.396 ± 0.44
0.804AsnTrp: 0.804 ± 0.358
3.038AsnTyr: 3.038 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
2.234ProAla: 2.234 ± 0.468
0.0ProCys: 0.0 ± 0.0
2.413ProAsp: 2.413 ± 0.647
3.128ProGlu: 3.128 ± 0.412
1.251ProPhe: 1.251 ± 0.324
0.894ProGly: 0.894 ± 0.276
0.357ProHis: 0.357 ± 0.189
1.43ProIle: 1.43 ± 0.358
2.234ProLys: 2.234 ± 0.399
2.949ProLeu: 2.949 ± 0.511
0.626ProMet: 0.626 ± 0.252
2.055ProAsn: 2.055 ± 0.538
0.804ProPro: 0.804 ± 0.294
1.609ProGln: 1.609 ± 0.473
1.162ProArg: 1.162 ± 0.426
2.413ProSer: 2.413 ± 0.524
2.413ProThr: 2.413 ± 0.378
1.787ProVal: 1.787 ± 0.726
0.089ProTrp: 0.089 ± 0.09
1.162ProTyr: 1.162 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
3.217GlnAla: 3.217 ± 0.523
0.179GlnCys: 0.179 ± 0.118
1.698GlnAsp: 1.698 ± 0.337
3.038GlnGlu: 3.038 ± 0.656
1.966GlnPhe: 1.966 ± 0.402
2.681GlnGly: 2.681 ± 0.5
0.536GlnHis: 0.536 ± 0.203
3.217GlnIle: 3.217 ± 0.468
3.217GlnLys: 3.217 ± 0.463
3.664GlnLeu: 3.664 ± 0.514
1.251GlnMet: 1.251 ± 0.389
2.502GlnAsn: 2.502 ± 0.398
1.072GlnPro: 1.072 ± 0.338
2.502GlnGln: 2.502 ± 0.781
1.519GlnArg: 1.519 ± 0.386
2.77GlnSer: 2.77 ± 0.443
3.128GlnThr: 3.128 ± 0.537
2.234GlnVal: 2.234 ± 0.509
0.357GlnTrp: 0.357 ± 0.138
1.43GlnTyr: 1.43 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
3.038ArgAla: 3.038 ± 0.645
0.089ArgCys: 0.089 ± 0.083
2.324ArgAsp: 2.324 ± 0.538
3.396ArgGlu: 3.396 ± 0.413
2.055ArgPhe: 2.055 ± 0.436
2.413ArgGly: 2.413 ± 0.489
1.34ArgHis: 1.34 ± 0.381
2.86ArgIle: 2.86 ± 0.601
3.843ArgLys: 3.843 ± 0.883
3.038ArgLeu: 3.038 ± 0.545
1.162ArgMet: 1.162 ± 0.275
2.86ArgAsn: 2.86 ± 0.511
0.894ArgPro: 0.894 ± 0.294
1.698ArgGln: 1.698 ± 0.449
1.43ArgArg: 1.43 ± 0.36
1.698ArgSer: 1.698 ± 0.353
1.698ArgThr: 1.698 ± 0.358
4.021ArgVal: 4.021 ± 0.648
0.894ArgTrp: 0.894 ± 0.297
2.234ArgTyr: 2.234 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
5.719SerAla: 5.719 ± 0.712
0.268SerCys: 0.268 ± 0.201
4.558SerAsp: 4.558 ± 0.717
4.558SerGlu: 4.558 ± 0.487
2.86SerPhe: 2.86 ± 0.488
3.664SerGly: 3.664 ± 0.614
0.983SerHis: 0.983 ± 0.314
3.753SerIle: 3.753 ± 0.503
4.29SerLys: 4.29 ± 0.743
4.468SerLeu: 4.468 ± 0.509
1.34SerMet: 1.34 ± 0.341
2.592SerAsn: 2.592 ± 0.515
1.787SerPro: 1.787 ± 0.399
3.038SerGln: 3.038 ± 0.461
2.413SerArg: 2.413 ± 0.404
3.932SerSer: 3.932 ± 0.913
3.038SerThr: 3.038 ± 0.635
4.111SerVal: 4.111 ± 0.643
0.804SerTrp: 0.804 ± 0.183
2.324SerTyr: 2.324 ± 0.658
0.0SerXaa: 0.0 ± 0.0
Thr
5.273ThrAla: 5.273 ± 0.94
0.447ThrCys: 0.447 ± 0.229
2.86ThrAsp: 2.86 ± 0.485
4.915ThrGlu: 4.915 ± 0.747
2.234ThrPhe: 2.234 ± 0.531
4.736ThrGly: 4.736 ± 0.697
0.357ThrHis: 0.357 ± 0.158
3.753ThrIle: 3.753 ± 0.468
4.558ThrLys: 4.558 ± 1.046
5.004ThrLeu: 5.004 ± 0.588
1.519ThrMet: 1.519 ± 0.398
3.396ThrAsn: 3.396 ± 0.547
2.681ThrPro: 2.681 ± 0.587
2.234ThrGln: 2.234 ± 0.376
1.609ThrArg: 1.609 ± 0.305
4.111ThrSer: 4.111 ± 0.767
3.485ThrThr: 3.485 ± 0.679
3.664ThrVal: 3.664 ± 0.509
0.536ThrTrp: 0.536 ± 0.209
2.502ThrTyr: 2.502 ± 0.505
0.0ThrXaa: 0.0 ± 0.0
Val
5.451ValAla: 5.451 ± 0.5
0.715ValCys: 0.715 ± 0.445
3.753ValAsp: 3.753 ± 0.446
4.826ValGlu: 4.826 ± 0.655
1.609ValPhe: 1.609 ± 0.379
3.575ValGly: 3.575 ± 0.562
0.715ValHis: 0.715 ± 0.237
2.86ValIle: 2.86 ± 0.541
5.004ValLys: 5.004 ± 0.594
6.166ValLeu: 6.166 ± 0.639
0.983ValMet: 0.983 ± 0.262
3.664ValAsn: 3.664 ± 0.575
2.77ValPro: 2.77 ± 0.427
2.145ValGln: 2.145 ± 0.391
2.77ValArg: 2.77 ± 0.461
4.468ValSer: 4.468 ± 0.648
4.111ValThr: 4.111 ± 0.485
3.664ValVal: 3.664 ± 0.712
0.626ValTrp: 0.626 ± 0.246
1.609ValTyr: 1.609 ± 0.778
0.0ValXaa: 0.0 ± 0.0
Trp
1.072TrpAla: 1.072 ± 0.279
0.268TrpCys: 0.268 ± 0.175
1.43TrpAsp: 1.43 ± 0.374
0.804TrpGlu: 0.804 ± 0.237
0.357TrpPhe: 0.357 ± 0.206
0.983TrpGly: 0.983 ± 0.268
0.179TrpHis: 0.179 ± 0.1
0.715TrpIle: 0.715 ± 0.348
1.162TrpLys: 1.162 ± 0.374
0.894TrpLeu: 0.894 ± 0.29
0.447TrpMet: 0.447 ± 0.191
0.983TrpAsn: 0.983 ± 0.321
0.179TrpPro: 0.179 ± 0.149
0.268TrpGln: 0.268 ± 0.15
0.804TrpArg: 0.804 ± 0.293
0.626TrpSer: 0.626 ± 0.232
0.804TrpThr: 0.804 ± 0.283
1.251TrpVal: 1.251 ± 0.432
0.089TrpTrp: 0.089 ± 0.1
0.626TrpTyr: 0.626 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.145TyrAla: 2.145 ± 0.458
0.626TyrCys: 0.626 ± 0.236
2.234TyrAsp: 2.234 ± 0.483
2.86TyrGlu: 2.86 ± 0.621
1.609TyrPhe: 1.609 ± 0.451
2.324TyrGly: 2.324 ± 0.64
0.447TyrHis: 0.447 ± 0.154
2.502TyrIle: 2.502 ± 0.573
2.77TyrLys: 2.77 ± 0.54
2.413TyrLeu: 2.413 ± 0.437
1.072TyrMet: 1.072 ± 0.339
2.234TyrAsn: 2.234 ± 0.405
1.34TyrPro: 1.34 ± 0.357
2.592TyrGln: 2.592 ± 0.63
2.324TyrArg: 2.324 ± 0.544
2.145TyrSer: 2.145 ± 0.496
2.502TyrThr: 2.502 ± 0.456
2.592TyrVal: 2.592 ± 0.551
0.804TyrTrp: 0.804 ± 0.316
1.162TyrTyr: 1.162 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski