Amino acid dipepetide frequency for Streptococcus phage Javan117

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.582AlaAla: 4.582 ± 1.478
0.509AlaCys: 0.509 ± 0.245
4.242AlaAsp: 4.242 ± 0.706
6.109AlaGlu: 6.109 ± 0.681
1.951AlaPhe: 1.951 ± 0.498
4.497AlaGly: 4.497 ± 0.764
0.679AlaHis: 0.679 ± 0.27
5.176AlaIle: 5.176 ± 0.576
6.363AlaLys: 6.363 ± 0.672
4.582AlaLeu: 4.582 ± 0.477
1.951AlaMet: 1.951 ± 0.402
3.564AlaAsn: 3.564 ± 0.608
2.036AlaPro: 2.036 ± 0.4
2.545AlaGln: 2.545 ± 0.448
2.461AlaArg: 2.461 ± 0.461
3.818AlaSer: 3.818 ± 0.719
4.582AlaThr: 4.582 ± 0.672
4.751AlaVal: 4.751 ± 0.749
1.103AlaTrp: 1.103 ± 0.362
2.376AlaTyr: 2.376 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.164
0.0CysCys: 0.0 ± 0.0
0.17CysAsp: 0.17 ± 0.13
0.933CysGlu: 0.933 ± 0.301
0.17CysPhe: 0.17 ± 0.126
1.018CysGly: 1.018 ± 0.412
0.085CysHis: 0.085 ± 0.091
0.339CysIle: 0.339 ± 0.174
0.339CysLys: 0.339 ± 0.177
0.764CysLeu: 0.764 ± 0.348
0.085CysMet: 0.085 ± 0.082
0.679CysAsn: 0.679 ± 0.23
0.255CysPro: 0.255 ± 0.128
0.424CysGln: 0.424 ± 0.212
0.594CysArg: 0.594 ± 0.301
0.594CysSer: 0.594 ± 0.191
0.255CysThr: 0.255 ± 0.133
0.17CysVal: 0.17 ± 0.109
0.085CysTrp: 0.085 ± 0.084
0.594CysTyr: 0.594 ± 0.294
0.0CysXaa: 0.0 ± 0.0
Asp
3.648AspAla: 3.648 ± 0.532
0.594AspCys: 0.594 ± 0.267
3.988AspAsp: 3.988 ± 0.518
4.327AspGlu: 4.327 ± 0.581
2.8AspPhe: 2.8 ± 0.689
6.363AspGly: 6.363 ± 0.824
0.509AspHis: 0.509 ± 0.192
5.176AspIle: 5.176 ± 0.704
5.939AspLys: 5.939 ± 0.589
5.6AspLeu: 5.6 ± 0.814
0.933AspMet: 0.933 ± 0.27
4.751AspAsn: 4.751 ± 0.572
0.848AspPro: 0.848 ± 0.279
1.442AspGln: 1.442 ± 0.305
2.291AspArg: 2.291 ± 0.479
4.582AspSer: 4.582 ± 0.802
4.667AspThr: 4.667 ± 0.631
4.582AspVal: 4.582 ± 0.657
0.764AspTrp: 0.764 ± 0.296
3.394AspTyr: 3.394 ± 0.591
0.0AspXaa: 0.0 ± 0.0
Glu
5.685GluAla: 5.685 ± 0.838
0.424GluCys: 0.424 ± 0.174
3.054GluAsp: 3.054 ± 0.624
5.26GluGlu: 5.26 ± 0.725
2.97GluPhe: 2.97 ± 0.516
3.054GluGly: 3.054 ± 0.5
0.594GluHis: 0.594 ± 0.178
4.836GluIle: 4.836 ± 0.596
6.279GluLys: 6.279 ± 0.678
7.212GluLeu: 7.212 ± 0.783
1.697GluMet: 1.697 ± 0.409
3.988GluAsn: 3.988 ± 0.668
1.697GluPro: 1.697 ± 0.379
3.818GluGln: 3.818 ± 0.563
2.291GluArg: 2.291 ± 0.474
3.648GluSer: 3.648 ± 0.474
4.412GluThr: 4.412 ± 0.617
5.685GluVal: 5.685 ± 0.629
0.933GluTrp: 0.933 ± 0.253
2.121GluTyr: 2.121 ± 0.409
0.0GluXaa: 0.0 ± 0.0
Phe
2.121PheAla: 2.121 ± 0.447
0.255PheCys: 0.255 ± 0.153
2.97PheAsp: 2.97 ± 0.505
3.564PheGlu: 3.564 ± 0.75
0.848PhePhe: 0.848 ± 0.275
3.054PheGly: 3.054 ± 0.511
0.255PheHis: 0.255 ± 0.148
2.461PheIle: 2.461 ± 0.399
3.309PheLys: 3.309 ± 0.692
2.206PheLeu: 2.206 ± 0.526
0.764PheMet: 0.764 ± 0.291
2.121PheAsn: 2.121 ± 0.36
1.103PhePro: 1.103 ± 0.361
0.933PheGln: 0.933 ± 0.295
1.358PheArg: 1.358 ± 0.419
2.8PheSer: 2.8 ± 0.476
2.8PheThr: 2.8 ± 0.441
2.545PheVal: 2.545 ± 0.435
0.594PheTrp: 0.594 ± 0.187
1.442PheTyr: 1.442 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
4.497GlyAla: 4.497 ± 0.678
0.424GlyCys: 0.424 ± 0.219
4.667GlyAsp: 4.667 ± 0.685
2.545GlyGlu: 2.545 ± 0.406
2.8GlyPhe: 2.8 ± 0.554
4.497GlyGly: 4.497 ± 0.718
1.103GlyHis: 1.103 ± 0.326
5.77GlyIle: 5.77 ± 0.812
6.533GlyLys: 6.533 ± 0.712
4.836GlyLeu: 4.836 ± 0.782
1.612GlyMet: 1.612 ± 0.409
3.309GlyAsn: 3.309 ± 0.483
1.103GlyPro: 1.103 ± 0.42
2.291GlyGln: 2.291 ± 0.404
2.461GlyArg: 2.461 ± 0.405
5.006GlySer: 5.006 ± 1.269
4.242GlyThr: 4.242 ± 0.762
3.988GlyVal: 3.988 ± 0.439
1.612GlyTrp: 1.612 ± 0.376
3.733GlyTyr: 3.733 ± 0.608
0.0GlyXaa: 0.0 ± 0.0
His
0.764HisAla: 0.764 ± 0.278
0.509HisCys: 0.509 ± 0.261
1.188HisAsp: 1.188 ± 0.316
0.848HisGlu: 0.848 ± 0.259
0.339HisPhe: 0.339 ± 0.14
0.509HisGly: 0.509 ± 0.241
0.0HisHis: 0.0 ± 0.0
0.933HisIle: 0.933 ± 0.31
0.764HisLys: 0.764 ± 0.186
0.764HisLeu: 0.764 ± 0.267
0.339HisMet: 0.339 ± 0.16
1.103HisAsn: 1.103 ± 0.394
0.594HisPro: 0.594 ± 0.199
0.509HisGln: 0.509 ± 0.185
0.255HisArg: 0.255 ± 0.134
0.848HisSer: 0.848 ± 0.247
0.764HisThr: 0.764 ± 0.237
0.764HisVal: 0.764 ± 0.242
0.339HisTrp: 0.339 ± 0.144
0.255HisTyr: 0.255 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
5.77IleAla: 5.77 ± 0.685
0.509IleCys: 0.509 ± 0.203
6.618IleAsp: 6.618 ± 0.833
5.854IleGlu: 5.854 ± 0.842
2.291IlePhe: 2.291 ± 0.503
3.988IleGly: 3.988 ± 0.584
0.764IleHis: 0.764 ± 0.249
2.885IleIle: 2.885 ± 0.515
7.382IleLys: 7.382 ± 0.757
4.242IleLeu: 4.242 ± 0.529
0.848IleMet: 0.848 ± 0.317
4.582IleAsn: 4.582 ± 0.664
1.697IlePro: 1.697 ± 0.551
1.273IleGln: 1.273 ± 0.32
1.697IleArg: 1.697 ± 0.363
3.309IleSer: 3.309 ± 0.483
5.77IleThr: 5.77 ± 0.841
4.921IleVal: 4.921 ± 0.682
0.933IleTrp: 0.933 ± 0.264
2.885IleTyr: 2.885 ± 0.412
0.0IleXaa: 0.0 ± 0.0
Lys
4.582LysAla: 4.582 ± 0.5
0.509LysCys: 0.509 ± 0.225
5.939LysAsp: 5.939 ± 0.811
5.685LysGlu: 5.685 ± 0.808
2.63LysPhe: 2.63 ± 0.416
4.751LysGly: 4.751 ± 0.873
2.121LysHis: 2.121 ± 0.415
6.448LysIle: 6.448 ± 0.899
8.06LysLys: 8.06 ± 1.219
7.042LysLeu: 7.042 ± 0.705
1.782LysMet: 1.782 ± 0.403
5.77LysAsn: 5.77 ± 0.554
2.715LysPro: 2.715 ± 0.577
4.073LysGln: 4.073 ± 0.594
3.139LysArg: 3.139 ± 0.705
6.618LysSer: 6.618 ± 0.797
6.533LysThr: 6.533 ± 0.763
6.533LysVal: 6.533 ± 0.735
0.848LysTrp: 0.848 ± 0.347
3.224LysTyr: 3.224 ± 0.597
0.0LysXaa: 0.0 ± 0.0
Leu
6.194LeuAla: 6.194 ± 0.814
0.764LeuCys: 0.764 ± 0.246
6.448LeuAsp: 6.448 ± 0.814
5.939LeuGlu: 5.939 ± 0.822
2.885LeuPhe: 2.885 ± 0.594
5.006LeuGly: 5.006 ± 0.775
0.339LeuHis: 0.339 ± 0.171
5.77LeuIle: 5.77 ± 0.852
7.127LeuLys: 7.127 ± 0.784
5.77LeuLeu: 5.77 ± 0.595
1.358LeuMet: 1.358 ± 0.325
4.497LeuAsn: 4.497 ± 0.597
1.273LeuPro: 1.273 ± 0.428
3.224LeuGln: 3.224 ± 0.517
2.8LeuArg: 2.8 ± 0.49
5.43LeuSer: 5.43 ± 0.798
5.515LeuThr: 5.515 ± 0.682
4.157LeuVal: 4.157 ± 0.615
1.103LeuTrp: 1.103 ± 0.244
2.715LeuTyr: 2.715 ± 0.473
0.0LeuXaa: 0.0 ± 0.0
Met
1.442MetAla: 1.442 ± 0.405
0.085MetCys: 0.085 ± 0.083
0.679MetAsp: 0.679 ± 0.225
1.442MetGlu: 1.442 ± 0.421
0.933MetPhe: 0.933 ± 0.27
1.018MetGly: 1.018 ± 0.321
0.255MetHis: 0.255 ± 0.147
1.612MetIle: 1.612 ± 0.353
1.867MetLys: 1.867 ± 0.449
2.036MetLeu: 2.036 ± 0.425
0.255MetMet: 0.255 ± 0.142
1.358MetAsn: 1.358 ± 0.29
0.509MetPro: 0.509 ± 0.205
0.764MetGln: 0.764 ± 0.223
0.679MetArg: 0.679 ± 0.208
2.036MetSer: 2.036 ± 0.469
2.036MetThr: 2.036 ± 0.38
1.103MetVal: 1.103 ± 0.292
0.085MetTrp: 0.085 ± 0.096
0.424MetTyr: 0.424 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.733AsnAla: 3.733 ± 0.5
0.255AsnCys: 0.255 ± 0.164
3.648AsnAsp: 3.648 ± 0.579
3.903AsnGlu: 3.903 ± 0.633
2.121AsnPhe: 2.121 ± 0.354
5.515AsnGly: 5.515 ± 0.68
0.679AsnHis: 0.679 ± 0.201
3.988AsnIle: 3.988 ± 0.592
5.77AsnLys: 5.77 ± 0.737
4.667AsnLeu: 4.667 ± 0.571
0.933AsnMet: 0.933 ± 0.313
4.497AsnAsn: 4.497 ± 0.786
2.036AsnPro: 2.036 ± 0.412
2.376AsnGln: 2.376 ± 0.493
2.036AsnArg: 2.036 ± 0.376
3.648AsnSer: 3.648 ± 0.638
2.885AsnThr: 2.885 ± 0.457
4.497AsnVal: 4.497 ± 0.596
0.933AsnTrp: 0.933 ± 0.313
2.461AsnTyr: 2.461 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
1.612ProAla: 1.612 ± 0.348
0.339ProCys: 0.339 ± 0.172
1.782ProAsp: 1.782 ± 0.385
1.867ProGlu: 1.867 ± 0.369
0.848ProPhe: 0.848 ± 0.224
0.594ProGly: 0.594 ± 0.191
0.509ProHis: 0.509 ± 0.18
1.697ProIle: 1.697 ± 0.4
2.545ProLys: 2.545 ± 0.441
1.697ProLeu: 1.697 ± 0.375
0.17ProMet: 0.17 ± 0.119
1.103ProAsn: 1.103 ± 0.253
1.188ProPro: 1.188 ± 0.277
1.358ProGln: 1.358 ± 0.335
0.764ProArg: 0.764 ± 0.245
2.715ProSer: 2.715 ± 0.621
1.527ProThr: 1.527 ± 0.339
2.036ProVal: 2.036 ± 0.414
0.255ProTrp: 0.255 ± 0.192
1.697ProTyr: 1.697 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
2.63GlnAla: 2.63 ± 0.54
0.509GlnCys: 0.509 ± 0.185
1.697GlnAsp: 1.697 ± 0.334
2.8GlnGlu: 2.8 ± 0.629
1.951GlnPhe: 1.951 ± 0.415
2.291GlnGly: 2.291 ± 0.503
0.933GlnHis: 0.933 ± 0.317
2.8GlnIle: 2.8 ± 0.438
3.903GlnLys: 3.903 ± 0.544
3.054GlnLeu: 3.054 ± 0.502
0.764GlnMet: 0.764 ± 0.304
2.206GlnAsn: 2.206 ± 0.435
1.188GlnPro: 1.188 ± 0.398
1.103GlnGln: 1.103 ± 0.376
1.188GlnArg: 1.188 ± 0.364
3.224GlnSer: 3.224 ± 0.552
1.867GlnThr: 1.867 ± 0.327
1.782GlnVal: 1.782 ± 0.394
0.509GlnTrp: 0.509 ± 0.224
1.188GlnTyr: 1.188 ± 0.254
0.0GlnXaa: 0.0 ± 0.0
Arg
2.376ArgAla: 2.376 ± 0.416
0.424ArgCys: 0.424 ± 0.18
1.527ArgAsp: 1.527 ± 0.339
2.885ArgGlu: 2.885 ± 0.521
1.273ArgPhe: 1.273 ± 0.244
2.461ArgGly: 2.461 ± 0.382
0.424ArgHis: 0.424 ± 0.217
2.206ArgIle: 2.206 ± 0.476
2.885ArgLys: 2.885 ± 0.486
3.818ArgLeu: 3.818 ± 0.573
0.594ArgMet: 0.594 ± 0.179
1.782ArgAsn: 1.782 ± 0.413
0.594ArgPro: 0.594 ± 0.263
2.121ArgGln: 2.121 ± 0.501
2.036ArgArg: 2.036 ± 0.469
1.527ArgSer: 1.527 ± 0.352
2.206ArgThr: 2.206 ± 0.698
2.63ArgVal: 2.63 ± 0.502
0.594ArgTrp: 0.594 ± 0.257
1.442ArgTyr: 1.442 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
4.751SerAla: 4.751 ± 0.744
0.509SerCys: 0.509 ± 0.273
4.497SerAsp: 4.497 ± 0.581
4.497SerGlu: 4.497 ± 0.691
2.545SerPhe: 2.545 ± 0.579
5.345SerGly: 5.345 ± 0.783
0.764SerHis: 0.764 ± 0.253
4.157SerIle: 4.157 ± 0.571
4.497SerLys: 4.497 ± 0.659
5.26SerLeu: 5.26 ± 0.745
1.782SerMet: 1.782 ± 0.452
4.497SerAsn: 4.497 ± 0.901
1.442SerPro: 1.442 ± 0.545
2.545SerGln: 2.545 ± 0.537
2.206SerArg: 2.206 ± 0.568
3.054SerSer: 3.054 ± 0.891
3.818SerThr: 3.818 ± 0.706
5.091SerVal: 5.091 ± 0.609
0.848SerTrp: 0.848 ± 0.27
2.885SerTyr: 2.885 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
4.157ThrAla: 4.157 ± 0.498
0.085ThrCys: 0.085 ± 0.087
3.988ThrAsp: 3.988 ± 0.547
4.242ThrGlu: 4.242 ± 0.725
3.224ThrPhe: 3.224 ± 0.605
6.194ThrGly: 6.194 ± 0.654
0.764ThrHis: 0.764 ± 0.218
4.836ThrIle: 4.836 ± 0.719
4.921ThrLys: 4.921 ± 0.598
5.26ThrLeu: 5.26 ± 0.733
1.358ThrMet: 1.358 ± 0.379
3.224ThrAsn: 3.224 ± 0.443
1.951ThrPro: 1.951 ± 0.439
2.715ThrGln: 2.715 ± 0.534
2.63ThrArg: 2.63 ± 0.308
3.818ThrSer: 3.818 ± 0.485
4.412ThrThr: 4.412 ± 0.745
4.582ThrVal: 4.582 ± 1.086
0.933ThrTrp: 0.933 ± 0.305
3.054ThrTyr: 3.054 ± 0.554
0.0ThrXaa: 0.0 ± 0.0
Val
5.176ValAla: 5.176 ± 0.649
0.339ValCys: 0.339 ± 0.187
6.194ValAsp: 6.194 ± 0.636
4.751ValGlu: 4.751 ± 0.706
3.054ValPhe: 3.054 ± 0.605
3.903ValGly: 3.903 ± 0.729
0.764ValHis: 0.764 ± 0.204
3.903ValIle: 3.903 ± 0.558
5.091ValLys: 5.091 ± 0.875
4.327ValLeu: 4.327 ± 0.581
2.121ValMet: 2.121 ± 0.426
3.903ValAsn: 3.903 ± 0.504
2.291ValPro: 2.291 ± 0.583
2.036ValGln: 2.036 ± 0.525
2.376ValArg: 2.376 ± 0.476
4.836ValSer: 4.836 ± 0.645
4.836ValThr: 4.836 ± 0.786
4.073ValVal: 4.073 ± 0.651
0.764ValTrp: 0.764 ± 0.299
2.715ValTyr: 2.715 ± 0.441
0.0ValXaa: 0.0 ± 0.0
Trp
1.188TrpAla: 1.188 ± 0.52
0.0TrpCys: 0.0 ± 0.0
0.764TrpAsp: 0.764 ± 0.293
0.764TrpGlu: 0.764 ± 0.243
0.424TrpPhe: 0.424 ± 0.226
0.679TrpGly: 0.679 ± 0.294
0.424TrpHis: 0.424 ± 0.18
1.018TrpIle: 1.018 ± 0.396
1.358TrpLys: 1.358 ± 0.359
1.103TrpLeu: 1.103 ± 0.322
0.085TrpMet: 0.085 ± 0.082
0.679TrpAsn: 0.679 ± 0.227
0.339TrpPro: 0.339 ± 0.225
0.594TrpGln: 0.594 ± 0.236
0.764TrpArg: 0.764 ± 0.301
1.103TrpSer: 1.103 ± 0.287
1.273TrpThr: 1.273 ± 0.246
0.848TrpVal: 0.848 ± 0.291
0.0TrpTrp: 0.0 ± 0.0
0.424TrpTyr: 0.424 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.715TyrAla: 2.715 ± 0.707
0.764TyrCys: 0.764 ± 0.255
3.139TyrAsp: 3.139 ± 0.544
1.442TyrGlu: 1.442 ± 0.335
1.527TyrPhe: 1.527 ± 0.362
2.291TyrGly: 2.291 ± 0.493
0.424TyrHis: 0.424 ± 0.191
2.206TyrIle: 2.206 ± 0.439
4.073TyrLys: 4.073 ± 0.647
4.073TyrLeu: 4.073 ± 0.675
1.103TyrMet: 1.103 ± 0.293
2.97TyrAsn: 2.97 ± 0.512
1.442TyrPro: 1.442 ± 0.397
1.442TyrGln: 1.442 ± 0.458
1.782TyrArg: 1.782 ± 0.39
2.376TyrSer: 2.376 ± 0.424
1.951TyrThr: 1.951 ± 0.389
2.715TyrVal: 2.715 ± 0.542
0.594TyrTrp: 0.594 ± 0.265
2.461TyrTyr: 2.461 ± 0.535
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski