Amino acid dipepetide frequency for Streptococcus phage 23782

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.888AlaAla: 2.888 ± 0.954
0.516AlaCys: 0.516 ± 0.23
4.847AlaAsp: 4.847 ± 0.632
6.188AlaGlu: 6.188 ± 0.823
2.682AlaPhe: 2.682 ± 0.573
4.744AlaGly: 4.744 ± 0.746
0.413AlaHis: 0.413 ± 0.255
5.054AlaIle: 5.054 ± 1.218
5.054AlaLys: 5.054 ± 0.668
6.291AlaLeu: 6.291 ± 0.917
1.856AlaMet: 1.856 ± 0.344
4.022AlaAsn: 4.022 ± 1.059
1.134AlaPro: 1.134 ± 0.279
1.96AlaGln: 1.96 ± 0.396
2.785AlaArg: 2.785 ± 0.453
4.744AlaSer: 4.744 ± 0.694
3.507AlaThr: 3.507 ± 0.762
5.26AlaVal: 5.26 ± 0.777
1.753AlaTrp: 1.753 ± 0.678
1.65AlaTyr: 1.65 ± 0.445
0.0AlaXaa: 0.0 ± 0.0
Cys
0.103CysAla: 0.103 ± 0.078
0.206CysCys: 0.206 ± 0.157
0.206CysAsp: 0.206 ± 0.127
0.722CysGlu: 0.722 ± 0.278
0.206CysPhe: 0.206 ± 0.132
0.309CysGly: 0.309 ± 0.246
0.309CysHis: 0.309 ± 0.19
0.103CysIle: 0.103 ± 0.099
0.309CysLys: 0.309 ± 0.179
0.825CysLeu: 0.825 ± 0.28
0.103CysMet: 0.103 ± 0.112
0.103CysAsn: 0.103 ± 0.114
0.0CysPro: 0.0 ± 0.0
0.206CysGln: 0.206 ± 0.145
0.722CysArg: 0.722 ± 0.271
0.103CysSer: 0.103 ± 0.112
0.206CysThr: 0.206 ± 0.143
0.516CysVal: 0.516 ± 0.276
0.103CysTrp: 0.103 ± 0.078
0.619CysTyr: 0.619 ± 0.254
0.0CysXaa: 0.0 ± 0.0
Asp
4.538AspAla: 4.538 ± 0.656
0.413AspCys: 0.413 ± 0.191
4.229AspAsp: 4.229 ± 0.838
4.641AspGlu: 4.641 ± 1.082
3.094AspPhe: 3.094 ± 0.411
5.363AspGly: 5.363 ± 0.831
1.134AspHis: 1.134 ± 0.275
4.332AspIle: 4.332 ± 0.527
4.847AspLys: 4.847 ± 0.894
5.569AspLeu: 5.569 ± 0.664
1.134AspMet: 1.134 ± 0.268
2.991AspAsn: 2.991 ± 0.413
1.134AspPro: 1.134 ± 0.28
1.134AspGln: 1.134 ± 0.488
2.475AspArg: 2.475 ± 0.432
2.682AspSer: 2.682 ± 0.447
2.578AspThr: 2.578 ± 0.462
3.816AspVal: 3.816 ± 0.632
1.134AspTrp: 1.134 ± 0.364
3.3AspTyr: 3.3 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
6.291GluAla: 6.291 ± 0.883
0.206GluCys: 0.206 ± 0.124
4.332GluAsp: 4.332 ± 0.732
4.95GluGlu: 4.95 ± 0.913
3.197GluPhe: 3.197 ± 0.664
3.713GluGly: 3.713 ± 0.661
0.516GluHis: 0.516 ± 0.266
5.363GluIle: 5.363 ± 0.804
6.807GluLys: 6.807 ± 1.018
6.807GluLeu: 6.807 ± 1.208
1.547GluMet: 1.547 ± 0.423
4.744GluAsn: 4.744 ± 0.671
2.475GluPro: 2.475 ± 0.679
3.507GluGln: 3.507 ± 0.869
3.507GluArg: 3.507 ± 0.78
4.125GluSer: 4.125 ± 0.579
3.816GluThr: 3.816 ± 0.464
3.919GluVal: 3.919 ± 0.758
1.031GluTrp: 1.031 ± 0.319
2.682GluTyr: 2.682 ± 0.529
0.0GluXaa: 0.0 ± 0.0
Phe
3.197PheAla: 3.197 ± 0.637
0.103PheCys: 0.103 ± 0.106
3.61PheAsp: 3.61 ± 0.617
3.403PheGlu: 3.403 ± 0.508
2.063PhePhe: 2.063 ± 0.511
2.682PheGly: 2.682 ± 0.547
0.309PheHis: 0.309 ± 0.175
2.372PheIle: 2.372 ± 0.55
2.682PheLys: 2.682 ± 0.535
2.475PheLeu: 2.475 ± 0.682
1.547PheMet: 1.547 ± 0.407
2.269PheAsn: 2.269 ± 0.456
0.825PhePro: 0.825 ± 0.396
1.444PheGln: 1.444 ± 0.433
1.753PheArg: 1.753 ± 0.338
3.3PheSer: 3.3 ± 0.872
2.888PheThr: 2.888 ± 0.501
2.269PheVal: 2.269 ± 0.409
0.413PheTrp: 0.413 ± 0.188
2.475PheTyr: 2.475 ± 0.466
0.0PheXaa: 0.0 ± 0.0
Gly
3.919GlyAla: 3.919 ± 0.843
0.206GlyCys: 0.206 ± 0.125
2.888GlyAsp: 2.888 ± 0.648
3.919GlyGlu: 3.919 ± 0.493
3.713GlyPhe: 3.713 ± 0.494
4.95GlyGly: 4.95 ± 0.819
0.722GlyHis: 0.722 ± 0.274
5.466GlyIle: 5.466 ± 1.132
4.95GlyLys: 4.95 ± 0.571
6.91GlyLeu: 6.91 ± 1.279
2.269GlyMet: 2.269 ± 0.642
4.744GlyAsn: 4.744 ± 0.623
1.031GlyPro: 1.031 ± 0.33
3.197GlyGln: 3.197 ± 0.651
3.507GlyArg: 3.507 ± 0.51
4.332GlySer: 4.332 ± 0.823
3.403GlyThr: 3.403 ± 0.586
3.816GlyVal: 3.816 ± 0.764
1.238GlyTrp: 1.238 ± 0.408
4.022GlyTyr: 4.022 ± 0.654
0.0GlyXaa: 0.0 ± 0.0
His
0.722HisAla: 0.722 ± 0.342
0.413HisCys: 0.413 ± 0.218
0.413HisAsp: 0.413 ± 0.197
1.031HisGlu: 1.031 ± 0.3
0.516HisPhe: 0.516 ± 0.251
0.413HisGly: 0.413 ± 0.183
0.206HisHis: 0.206 ± 0.149
1.031HisIle: 1.031 ± 0.369
1.134HisLys: 1.134 ± 0.237
0.722HisLeu: 0.722 ± 0.316
0.206HisMet: 0.206 ± 0.17
0.928HisAsn: 0.928 ± 0.29
0.619HisPro: 0.619 ± 0.27
0.928HisGln: 0.928 ± 0.311
0.928HisArg: 0.928 ± 0.311
1.341HisSer: 1.341 ± 0.442
0.928HisThr: 0.928 ± 0.266
0.825HisVal: 0.825 ± 0.341
0.309HisTrp: 0.309 ± 0.176
0.722HisTyr: 0.722 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.363IleAla: 5.363 ± 0.966
0.619IleCys: 0.619 ± 0.25
4.229IleAsp: 4.229 ± 0.585
6.291IleGlu: 6.291 ± 0.653
2.785IlePhe: 2.785 ± 0.596
4.847IleGly: 4.847 ± 0.765
0.619IleHis: 0.619 ± 0.244
2.991IleIle: 2.991 ± 0.532
5.26IleLys: 5.26 ± 0.818
5.982IleLeu: 5.982 ± 0.925
0.928IleMet: 0.928 ± 0.314
3.3IleAsn: 3.3 ± 0.562
2.063IlePro: 2.063 ± 0.619
2.372IleGln: 2.372 ± 0.558
2.372IleArg: 2.372 ± 0.514
5.26IleSer: 5.26 ± 0.902
4.847IleThr: 4.847 ± 0.545
3.816IleVal: 3.816 ± 0.522
1.238IleTrp: 1.238 ± 0.76
2.063IleTyr: 2.063 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
5.569LysAla: 5.569 ± 0.709
0.103LysCys: 0.103 ± 0.107
5.672LysAsp: 5.672 ± 0.616
5.569LysGlu: 5.569 ± 0.901
2.063LysPhe: 2.063 ± 0.476
5.26LysGly: 5.26 ± 0.762
1.341LysHis: 1.341 ± 0.406
5.776LysIle: 5.776 ± 1.065
7.735LysLys: 7.735 ± 1.285
5.466LysLeu: 5.466 ± 0.701
2.475LysMet: 2.475 ± 0.415
5.054LysAsn: 5.054 ± 0.657
2.372LysPro: 2.372 ± 0.58
4.744LysGln: 4.744 ± 0.617
3.816LysArg: 3.816 ± 0.792
5.363LysSer: 5.363 ± 0.924
4.95LysThr: 4.95 ± 0.653
4.847LysVal: 4.847 ± 0.808
1.031LysTrp: 1.031 ± 0.319
3.61LysTyr: 3.61 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
5.776LeuAla: 5.776 ± 0.593
0.309LeuCys: 0.309 ± 0.243
6.188LeuAsp: 6.188 ± 0.691
5.363LeuGlu: 5.363 ± 0.838
3.3LeuPhe: 3.3 ± 0.703
6.601LeuGly: 6.601 ± 0.888
0.825LeuHis: 0.825 ± 0.347
4.95LeuIle: 4.95 ± 0.58
8.045LeuLys: 8.045 ± 0.858
7.323LeuLeu: 7.323 ± 1.041
1.856LeuMet: 1.856 ± 0.497
4.229LeuAsn: 4.229 ± 0.711
2.578LeuPro: 2.578 ± 0.671
2.475LeuGln: 2.475 ± 0.728
4.847LeuArg: 4.847 ± 0.81
6.085LeuSer: 6.085 ± 0.653
5.569LeuThr: 5.569 ± 0.718
4.641LeuVal: 4.641 ± 0.606
1.134LeuTrp: 1.134 ± 0.442
1.753LeuTyr: 1.753 ± 0.412
0.0LeuXaa: 0.0 ± 0.0
Met
1.547MetAla: 1.547 ± 0.415
0.413MetCys: 0.413 ± 0.171
0.825MetAsp: 0.825 ± 0.3
2.578MetGlu: 2.578 ± 0.592
0.619MetPhe: 0.619 ± 0.228
1.65MetGly: 1.65 ± 0.458
0.309MetHis: 0.309 ± 0.209
2.063MetIle: 2.063 ± 0.454
2.475MetLys: 2.475 ± 0.602
1.65MetLeu: 1.65 ± 0.443
0.619MetMet: 0.619 ± 0.221
1.444MetAsn: 1.444 ± 0.487
0.722MetPro: 0.722 ± 0.242
1.134MetGln: 1.134 ± 0.334
1.134MetArg: 1.134 ± 0.379
1.238MetSer: 1.238 ± 0.43
1.547MetThr: 1.547 ± 0.446
1.65MetVal: 1.65 ± 0.446
0.103MetTrp: 0.103 ± 0.107
0.722MetTyr: 0.722 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
3.61AsnAla: 3.61 ± 0.913
0.309AsnCys: 0.309 ± 0.216
2.475AsnAsp: 2.475 ± 0.566
3.3AsnGlu: 3.3 ± 0.53
2.063AsnPhe: 2.063 ± 0.361
4.95AsnGly: 4.95 ± 0.918
0.928AsnHis: 0.928 ± 0.32
3.713AsnIle: 3.713 ± 0.602
4.229AsnLys: 4.229 ± 0.641
5.569AsnLeu: 5.569 ± 0.9
1.341AsnMet: 1.341 ± 0.351
2.785AsnAsn: 2.785 ± 0.522
2.578AsnPro: 2.578 ± 0.521
3.094AsnGln: 3.094 ± 0.666
2.578AsnArg: 2.578 ± 0.455
3.61AsnSer: 3.61 ± 0.713
2.888AsnThr: 2.888 ± 0.669
3.507AsnVal: 3.507 ± 0.543
0.928AsnTrp: 0.928 ± 0.276
2.166AsnTyr: 2.166 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
1.65ProAla: 1.65 ± 0.577
0.103ProCys: 0.103 ± 0.116
1.856ProAsp: 1.856 ± 0.402
2.578ProGlu: 2.578 ± 0.587
1.547ProPhe: 1.547 ± 0.425
0.825ProGly: 0.825 ± 0.309
0.825ProHis: 0.825 ± 0.314
1.856ProIle: 1.856 ± 0.452
3.3ProLys: 3.3 ± 0.491
2.269ProLeu: 2.269 ± 0.441
0.516ProMet: 0.516 ± 0.207
1.134ProAsn: 1.134 ± 0.298
0.928ProPro: 0.928 ± 0.357
1.341ProGln: 1.341 ± 0.401
1.238ProArg: 1.238 ± 0.3
1.856ProSer: 1.856 ± 0.496
1.444ProThr: 1.444 ± 0.432
1.444ProVal: 1.444 ± 0.436
0.103ProTrp: 0.103 ± 0.105
0.928ProTyr: 0.928 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.269GlnAla: 2.269 ± 0.605
0.103GlnCys: 0.103 ± 0.114
1.444GlnAsp: 1.444 ± 0.367
3.507GlnGlu: 3.507 ± 0.53
1.341GlnPhe: 1.341 ± 0.364
2.991GlnGly: 2.991 ± 0.732
0.516GlnHis: 0.516 ± 0.234
3.094GlnIle: 3.094 ± 0.583
4.125GlnLys: 4.125 ± 0.693
3.094GlnLeu: 3.094 ± 0.532
1.134GlnMet: 1.134 ± 0.33
2.475GlnAsn: 2.475 ± 0.403
1.341GlnPro: 1.341 ± 0.385
1.341GlnGln: 1.341 ± 0.395
2.166GlnArg: 2.166 ± 0.485
2.682GlnSer: 2.682 ± 0.554
1.547GlnThr: 1.547 ± 0.462
2.991GlnVal: 2.991 ± 0.573
0.309GlnTrp: 0.309 ± 0.133
1.031GlnTyr: 1.031 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
3.507ArgAla: 3.507 ± 0.628
0.413ArgCys: 0.413 ± 0.244
1.96ArgAsp: 1.96 ± 0.463
2.682ArgGlu: 2.682 ± 0.53
1.65ArgPhe: 1.65 ± 0.506
1.134ArgGly: 1.134 ± 0.351
0.928ArgHis: 0.928 ± 0.356
3.403ArgIle: 3.403 ± 0.793
5.26ArgLys: 5.26 ± 0.821
3.919ArgLeu: 3.919 ± 0.704
1.134ArgMet: 1.134 ± 0.31
3.403ArgAsn: 3.403 ± 0.616
1.444ArgPro: 1.444 ± 0.446
2.166ArgGln: 2.166 ± 0.475
2.063ArgArg: 2.063 ± 0.492
1.856ArgSer: 1.856 ± 0.504
2.682ArgThr: 2.682 ± 0.629
2.475ArgVal: 2.475 ± 0.373
0.516ArgTrp: 0.516 ± 0.211
2.888ArgTyr: 2.888 ± 0.776
0.0ArgXaa: 0.0 ± 0.0
Ser
5.054SerAla: 5.054 ± 1.066
0.516SerCys: 0.516 ± 0.253
4.125SerAsp: 4.125 ± 0.685
4.435SerGlu: 4.435 ± 0.77
2.888SerPhe: 2.888 ± 0.443
5.776SerGly: 5.776 ± 0.767
1.753SerHis: 1.753 ± 0.42
3.919SerIle: 3.919 ± 0.709
4.125SerLys: 4.125 ± 0.854
5.672SerLeu: 5.672 ± 0.712
2.475SerMet: 2.475 ± 0.62
3.507SerAsn: 3.507 ± 0.531
1.96SerPro: 1.96 ± 0.38
1.65SerGln: 1.65 ± 0.448
1.96SerArg: 1.96 ± 0.561
4.538SerSer: 4.538 ± 0.847
4.332SerThr: 4.332 ± 0.599
3.919SerVal: 3.919 ± 0.743
1.134SerTrp: 1.134 ± 0.329
2.372SerTyr: 2.372 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
3.919ThrAla: 3.919 ± 1.119
0.206ThrCys: 0.206 ± 0.149
4.435ThrAsp: 4.435 ± 0.864
3.507ThrGlu: 3.507 ± 0.552
3.61ThrPhe: 3.61 ± 0.828
5.466ThrGly: 5.466 ± 0.808
0.722ThrHis: 0.722 ± 0.319
4.332ThrIle: 4.332 ± 0.572
2.888ThrLys: 2.888 ± 0.496
4.229ThrLeu: 4.229 ± 0.741
0.516ThrMet: 0.516 ± 0.284
3.094ThrAsn: 3.094 ± 0.481
1.341ThrPro: 1.341 ± 0.333
1.856ThrGln: 1.856 ± 0.513
2.063ThrArg: 2.063 ± 0.481
4.229ThrSer: 4.229 ± 0.599
4.022ThrThr: 4.022 ± 0.701
4.332ThrVal: 4.332 ± 0.633
0.825ThrTrp: 0.825 ± 0.298
1.96ThrTyr: 1.96 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
4.022ValAla: 4.022 ± 0.69
0.309ValCys: 0.309 ± 0.186
3.816ValAsp: 3.816 ± 0.721
4.744ValGlu: 4.744 ± 0.7
2.475ValPhe: 2.475 ± 0.558
4.229ValGly: 4.229 ± 0.737
1.031ValHis: 1.031 ± 0.46
3.713ValIle: 3.713 ± 0.489
5.26ValLys: 5.26 ± 0.6
4.847ValLeu: 4.847 ± 0.943
0.825ValMet: 0.825 ± 0.35
2.991ValAsn: 2.991 ± 0.503
2.063ValPro: 2.063 ± 0.593
2.475ValGln: 2.475 ± 0.53
3.094ValArg: 3.094 ± 0.614
4.641ValSer: 4.641 ± 0.662
4.229ValThr: 4.229 ± 0.782
4.125ValVal: 4.125 ± 0.516
0.413ValTrp: 0.413 ± 0.177
2.063ValTyr: 2.063 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
0.825TrpAla: 0.825 ± 0.291
0.103TrpCys: 0.103 ± 0.112
0.516TrpAsp: 0.516 ± 0.244
1.65TrpGlu: 1.65 ± 0.542
0.516TrpPhe: 0.516 ± 0.227
1.238TrpGly: 1.238 ± 0.357
0.0TrpHis: 0.0 ± 0.0
1.134TrpIle: 1.134 ± 0.32
1.134TrpLys: 1.134 ± 0.294
0.516TrpLeu: 0.516 ± 0.239
0.413TrpMet: 0.413 ± 0.182
1.341TrpAsn: 1.341 ± 0.422
0.103TrpPro: 0.103 ± 0.107
0.619TrpGln: 0.619 ± 0.219
0.516TrpArg: 0.516 ± 0.263
1.031TrpSer: 1.031 ± 0.347
0.413TrpThr: 0.413 ± 0.198
1.238TrpVal: 1.238 ± 0.255
0.206TrpTrp: 0.206 ± 0.117
0.928TrpTyr: 0.928 ± 0.503
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.578TyrAla: 2.578 ± 0.465
0.413TyrCys: 0.413 ± 0.176
2.475TyrAsp: 2.475 ± 0.469
2.475TyrGlu: 2.475 ± 0.544
1.753TyrPhe: 1.753 ± 0.423
2.063TyrGly: 2.063 ± 0.414
0.928TyrHis: 0.928 ± 0.283
2.578TyrIle: 2.578 ± 0.568
3.197TyrLys: 3.197 ± 0.578
3.713TyrLeu: 3.713 ± 0.615
1.444TyrMet: 1.444 ± 0.329
2.063TyrAsn: 2.063 ± 0.456
1.031TyrPro: 1.031 ± 0.361
1.856TyrGln: 1.856 ± 0.417
2.063TyrArg: 2.063 ± 0.448
3.094TyrSer: 3.094 ± 0.589
1.753TyrThr: 1.753 ± 0.38
1.96TyrVal: 1.96 ± 0.496
0.516TyrTrp: 0.516 ± 0.19
1.856TyrTyr: 1.856 ± 0.748
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (9697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski