Amino acid dipepetide frequency for Streptococcus phage Javan452

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.564AlaAla: 4.564 ± 1.245
0.315AlaCys: 0.315 ± 0.15
4.092AlaAsp: 4.092 ± 0.522
4.721AlaGlu: 4.721 ± 0.69
3.305AlaPhe: 3.305 ± 0.64
4.092AlaGly: 4.092 ± 1.185
0.472AlaHis: 0.472 ± 0.16
6.374AlaIle: 6.374 ± 1.032
7.397AlaLys: 7.397 ± 0.91
5.194AlaLeu: 5.194 ± 0.622
2.518AlaMet: 2.518 ± 0.593
3.62AlaAsn: 3.62 ± 0.519
0.866AlaPro: 0.866 ± 0.225
3.305AlaGln: 3.305 ± 0.666
2.754AlaArg: 2.754 ± 0.387
4.721AlaSer: 4.721 ± 0.94
5.115AlaThr: 5.115 ± 0.851
5.351AlaVal: 5.351 ± 1.293
0.866AlaTrp: 0.866 ± 0.28
1.81AlaTyr: 1.81 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.236CysAla: 0.236 ± 0.121
0.079CysCys: 0.079 ± 0.082
0.393CysAsp: 0.393 ± 0.206
0.63CysGlu: 0.63 ± 0.243
0.079CysPhe: 0.079 ± 0.089
0.315CysGly: 0.315 ± 0.179
0.0CysHis: 0.0 ± 0.0
0.157CysIle: 0.157 ± 0.111
0.393CysLys: 0.393 ± 0.181
0.472CysLeu: 0.472 ± 0.153
0.079CysMet: 0.079 ± 0.066
0.157CysAsn: 0.157 ± 0.104
0.0CysPro: 0.0 ± 0.0
0.393CysGln: 0.393 ± 0.189
0.393CysArg: 0.393 ± 0.181
0.315CysSer: 0.315 ± 0.161
0.079CysThr: 0.079 ± 0.076
0.472CysVal: 0.472 ± 0.2
0.079CysTrp: 0.079 ± 0.092
0.157CysTyr: 0.157 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
3.305AspAla: 3.305 ± 0.469
0.787AspCys: 0.787 ± 0.234
3.541AspAsp: 3.541 ± 0.599
4.721AspGlu: 4.721 ± 0.659
3.069AspPhe: 3.069 ± 0.478
4.092AspGly: 4.092 ± 0.715
0.708AspHis: 0.708 ± 0.227
5.036AspIle: 5.036 ± 0.606
5.823AspLys: 5.823 ± 0.737
5.587AspLeu: 5.587 ± 0.759
1.653AspMet: 1.653 ± 0.391
3.935AspAsn: 3.935 ± 0.538
1.416AspPro: 1.416 ± 0.337
1.259AspGln: 1.259 ± 0.248
2.361AspArg: 2.361 ± 0.436
4.013AspSer: 4.013 ± 0.533
2.754AspThr: 2.754 ± 0.517
3.384AspVal: 3.384 ± 0.505
1.102AspTrp: 1.102 ± 0.319
2.833AspTyr: 2.833 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
5.351GluAla: 5.351 ± 0.801
0.393GluCys: 0.393 ± 0.177
2.912GluAsp: 2.912 ± 0.595
6.217GluGlu: 6.217 ± 1.113
3.069GluPhe: 3.069 ± 0.419
4.171GluGly: 4.171 ± 0.423
1.967GluHis: 1.967 ± 0.359
6.217GluIle: 6.217 ± 0.74
7.003GluLys: 7.003 ± 0.925
8.263GluLeu: 8.263 ± 1.067
2.203GluMet: 2.203 ± 0.292
3.935GluAsn: 3.935 ± 0.75
2.046GluPro: 2.046 ± 0.463
3.856GluGln: 3.856 ± 0.55
2.99GluArg: 2.99 ± 0.463
4.8GluSer: 4.8 ± 0.578
4.171GluThr: 4.171 ± 0.789
4.8GluVal: 4.8 ± 0.915
0.63GluTrp: 0.63 ± 0.279
3.069GluTyr: 3.069 ± 0.544
0.0GluXaa: 0.0 ± 0.0
Phe
2.203PheAla: 2.203 ± 0.638
0.157PheCys: 0.157 ± 0.104
3.698PheAsp: 3.698 ± 0.508
3.226PheGlu: 3.226 ± 0.456
1.416PhePhe: 1.416 ± 0.395
3.384PheGly: 3.384 ± 0.667
0.787PheHis: 0.787 ± 0.33
2.754PheIle: 2.754 ± 0.492
3.856PheLys: 3.856 ± 0.528
2.125PheLeu: 2.125 ± 0.39
0.708PheMet: 0.708 ± 0.229
2.754PheAsn: 2.754 ± 0.552
0.866PhePro: 0.866 ± 0.284
0.787PheGln: 0.787 ± 0.212
2.282PheArg: 2.282 ± 0.439
2.675PheSer: 2.675 ± 0.522
2.282PheThr: 2.282 ± 0.322
2.754PheVal: 2.754 ± 0.425
0.236PheTrp: 0.236 ± 0.119
1.102PheTyr: 1.102 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
4.721GlyAla: 4.721 ± 1.385
0.079GlyCys: 0.079 ± 0.073
3.226GlyAsp: 3.226 ± 0.607
3.777GlyGlu: 3.777 ± 0.411
2.99GlyPhe: 2.99 ± 0.56
3.856GlyGly: 3.856 ± 0.775
0.708GlyHis: 0.708 ± 0.226
3.935GlyIle: 3.935 ± 0.773
6.374GlyLys: 6.374 ± 0.584
6.217GlyLeu: 6.217 ± 0.963
1.653GlyMet: 1.653 ± 0.364
2.439GlyAsn: 2.439 ± 0.47
0.944GlyPro: 0.944 ± 0.243
2.439GlyGln: 2.439 ± 0.529
2.518GlyArg: 2.518 ± 0.519
2.833GlySer: 2.833 ± 0.551
4.879GlyThr: 4.879 ± 0.609
3.935GlyVal: 3.935 ± 0.58
0.63GlyTrp: 0.63 ± 0.206
2.675GlyTyr: 2.675 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
0.708HisAla: 0.708 ± 0.225
0.157HisCys: 0.157 ± 0.107
0.944HisAsp: 0.944 ± 0.281
1.416HisGlu: 1.416 ± 0.365
0.551HisPhe: 0.551 ± 0.162
0.787HisGly: 0.787 ± 0.216
0.157HisHis: 0.157 ± 0.102
0.708HisIle: 0.708 ± 0.256
0.866HisLys: 0.866 ± 0.298
0.787HisLeu: 0.787 ± 0.256
0.157HisMet: 0.157 ± 0.1
0.551HisAsn: 0.551 ± 0.246
0.63HisPro: 0.63 ± 0.275
0.708HisGln: 0.708 ± 0.245
0.63HisArg: 0.63 ± 0.179
1.416HisSer: 1.416 ± 0.369
0.944HisThr: 0.944 ± 0.223
0.866HisVal: 0.866 ± 0.227
0.157HisTrp: 0.157 ± 0.114
0.708HisTyr: 0.708 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
4.958IleAla: 4.958 ± 0.652
0.393IleCys: 0.393 ± 0.163
5.508IleAsp: 5.508 ± 0.724
6.138IleGlu: 6.138 ± 0.629
2.125IlePhe: 2.125 ± 0.5
3.935IleGly: 3.935 ± 0.793
1.18IleHis: 1.18 ± 0.275
4.013IleIle: 4.013 ± 0.467
6.61IleLys: 6.61 ± 0.781
5.115IleLeu: 5.115 ± 0.608
1.495IleMet: 1.495 ± 0.305
4.958IleAsn: 4.958 ± 0.664
1.81IlePro: 1.81 ± 0.393
2.518IleGln: 2.518 ± 0.369
2.912IleArg: 2.912 ± 0.477
5.036IleSer: 5.036 ± 0.976
5.508IleThr: 5.508 ± 0.505
3.226IleVal: 3.226 ± 0.499
0.315IleTrp: 0.315 ± 0.137
2.675IleTyr: 2.675 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
6.374LysAla: 6.374 ± 0.613
0.079LysCys: 0.079 ± 0.071
4.8LysAsp: 4.8 ± 0.586
6.689LysGlu: 6.689 ± 0.904
3.384LysPhe: 3.384 ± 0.491
4.485LysGly: 4.485 ± 0.645
1.338LysHis: 1.338 ± 0.307
6.531LysIle: 6.531 ± 0.784
8.577LysLys: 8.577 ± 1.271
7.869LysLeu: 7.869 ± 0.938
2.833LysMet: 2.833 ± 0.394
5.823LysAsn: 5.823 ± 0.676
2.361LysPro: 2.361 ± 0.46
4.564LysGln: 4.564 ± 0.517
4.407LysArg: 4.407 ± 0.701
5.194LysSer: 5.194 ± 0.601
5.98LysThr: 5.98 ± 0.545
6.295LysVal: 6.295 ± 0.814
0.63LysTrp: 0.63 ± 0.21
3.226LysTyr: 3.226 ± 0.539
0.0LysXaa: 0.0 ± 0.0
Leu
6.217LeuAla: 6.217 ± 0.844
0.079LeuCys: 0.079 ± 0.092
6.846LeuAsp: 6.846 ± 0.672
9.128LeuGlu: 9.128 ± 0.84
3.148LeuPhe: 3.148 ± 0.43
5.43LeuGly: 5.43 ± 0.862
1.102LeuHis: 1.102 ± 0.296
5.43LeuIle: 5.43 ± 0.761
9.128LeuLys: 9.128 ± 0.964
6.846LeuLeu: 6.846 ± 0.937
1.653LeuMet: 1.653 ± 0.368
4.407LeuAsn: 4.407 ± 0.646
1.889LeuPro: 1.889 ± 0.286
2.912LeuGln: 2.912 ± 0.421
3.462LeuArg: 3.462 ± 0.635
5.823LeuSer: 5.823 ± 0.628
5.272LeuThr: 5.272 ± 0.529
4.879LeuVal: 4.879 ± 0.482
0.472LeuTrp: 0.472 ± 0.192
1.967LeuTyr: 1.967 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
3.148MetAla: 3.148 ± 0.627
0.0MetCys: 0.0 ± 0.0
1.023MetAsp: 1.023 ± 0.244
1.81MetGlu: 1.81 ± 0.478
1.416MetPhe: 1.416 ± 0.364
0.944MetGly: 0.944 ± 0.294
0.236MetHis: 0.236 ± 0.145
1.967MetIle: 1.967 ± 0.373
1.416MetLys: 1.416 ± 0.414
2.361MetLeu: 2.361 ± 0.452
0.551MetMet: 0.551 ± 0.214
1.023MetAsn: 1.023 ± 0.256
0.708MetPro: 0.708 ± 0.217
1.495MetGln: 1.495 ± 0.309
1.18MetArg: 1.18 ± 0.32
1.259MetSer: 1.259 ± 0.281
2.597MetThr: 2.597 ± 0.446
0.866MetVal: 0.866 ± 0.209
0.236MetTrp: 0.236 ± 0.139
0.708MetTyr: 0.708 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
4.013AsnAla: 4.013 ± 0.641
0.315AsnCys: 0.315 ± 0.138
2.439AsnAsp: 2.439 ± 0.466
4.013AsnGlu: 4.013 ± 0.731
1.967AsnPhe: 1.967 ± 0.403
4.407AsnGly: 4.407 ± 0.839
1.023AsnHis: 1.023 ± 0.288
3.935AsnIle: 3.935 ± 0.703
4.643AsnLys: 4.643 ± 0.583
5.902AsnLeu: 5.902 ± 0.506
1.259AsnMet: 1.259 ± 0.351
2.597AsnAsn: 2.597 ± 0.537
1.731AsnPro: 1.731 ± 0.387
2.99AsnGln: 2.99 ± 0.529
1.731AsnArg: 1.731 ± 0.42
2.912AsnSer: 2.912 ± 0.555
2.597AsnThr: 2.597 ± 0.441
2.439AsnVal: 2.439 ± 0.406
0.866AsnTrp: 0.866 ± 0.231
2.203AsnTyr: 2.203 ± 0.499
0.0AsnXaa: 0.0 ± 0.0
Pro
1.653ProAla: 1.653 ± 0.388
0.157ProCys: 0.157 ± 0.118
1.889ProAsp: 1.889 ± 0.428
1.889ProGlu: 1.889 ± 0.409
0.944ProPhe: 0.944 ± 0.246
0.551ProGly: 0.551 ± 0.182
0.393ProHis: 0.393 ± 0.161
2.125ProIle: 2.125 ± 0.375
2.125ProLys: 2.125 ± 0.444
1.338ProLeu: 1.338 ± 0.399
0.393ProMet: 0.393 ± 0.159
1.023ProAsn: 1.023 ± 0.26
0.787ProPro: 0.787 ± 0.224
1.338ProGln: 1.338 ± 0.353
1.259ProArg: 1.259 ± 0.386
1.889ProSer: 1.889 ± 0.295
1.653ProThr: 1.653 ± 0.349
1.495ProVal: 1.495 ± 0.329
0.157ProTrp: 0.157 ± 0.119
0.866ProTyr: 0.866 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
3.777GlnAla: 3.777 ± 0.852
0.315GlnCys: 0.315 ± 0.169
2.361GlnAsp: 2.361 ± 0.356
3.305GlnGlu: 3.305 ± 0.588
1.338GlnPhe: 1.338 ± 0.369
2.754GlnGly: 2.754 ± 0.437
0.393GlnHis: 0.393 ± 0.162
3.541GlnIle: 3.541 ± 0.554
2.99GlnLys: 2.99 ± 0.439
3.935GlnLeu: 3.935 ± 0.644
1.338GlnMet: 1.338 ± 0.375
2.439GlnAsn: 2.439 ± 0.55
1.18GlnPro: 1.18 ± 0.351
2.282GlnGln: 2.282 ± 0.521
1.495GlnArg: 1.495 ± 0.31
4.485GlnSer: 4.485 ± 0.706
2.439GlnThr: 2.439 ± 0.555
2.046GlnVal: 2.046 ± 0.356
0.079GlnTrp: 0.079 ± 0.079
1.102GlnTyr: 1.102 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
2.99ArgAla: 2.99 ± 0.45
0.157ArgCys: 0.157 ± 0.098
2.046ArgAsp: 2.046 ± 0.407
2.912ArgGlu: 2.912 ± 0.586
1.259ArgPhe: 1.259 ± 0.334
2.754ArgGly: 2.754 ± 0.393
0.393ArgHis: 0.393 ± 0.151
3.069ArgIle: 3.069 ± 0.523
3.935ArgLys: 3.935 ± 0.677
4.958ArgLeu: 4.958 ± 0.601
1.102ArgMet: 1.102 ± 0.262
1.889ArgAsn: 1.889 ± 0.381
0.708ArgPro: 0.708 ± 0.231
1.731ArgGln: 1.731 ± 0.329
1.81ArgArg: 1.81 ± 0.425
2.597ArgSer: 2.597 ± 0.469
1.574ArgThr: 1.574 ± 0.318
2.675ArgVal: 2.675 ± 0.436
0.393ArgTrp: 0.393 ± 0.183
1.731ArgTyr: 1.731 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
5.194SerAla: 5.194 ± 1.479
0.315SerCys: 0.315 ± 0.162
3.305SerAsp: 3.305 ± 0.476
4.564SerGlu: 4.564 ± 0.62
2.754SerPhe: 2.754 ± 0.375
4.171SerGly: 4.171 ± 0.873
0.866SerHis: 0.866 ± 0.284
4.643SerIle: 4.643 ± 0.566
4.721SerLys: 4.721 ± 0.619
5.036SerLeu: 5.036 ± 0.793
2.046SerMet: 2.046 ± 0.571
3.935SerAsn: 3.935 ± 0.654
1.495SerPro: 1.495 ± 0.3
3.62SerGln: 3.62 ± 0.78
2.361SerArg: 2.361 ± 0.437
5.272SerSer: 5.272 ± 1.088
3.541SerThr: 3.541 ± 0.598
4.958SerVal: 4.958 ± 0.553
0.393SerTrp: 0.393 ± 0.161
2.833SerTyr: 2.833 ± 0.496
0.0SerXaa: 0.0 ± 0.0
Thr
4.643ThrAla: 4.643 ± 0.795
0.236ThrCys: 0.236 ± 0.173
4.171ThrAsp: 4.171 ± 0.556
4.8ThrGlu: 4.8 ± 0.686
2.754ThrPhe: 2.754 ± 0.576
4.958ThrGly: 4.958 ± 0.72
0.63ThrHis: 0.63 ± 0.218
3.62ThrIle: 3.62 ± 0.461
5.351ThrLys: 5.351 ± 0.86
5.351ThrLeu: 5.351 ± 0.754
0.944ThrMet: 0.944 ± 0.265
2.675ThrAsn: 2.675 ± 0.417
1.967ThrPro: 1.967 ± 0.389
2.518ThrGln: 2.518 ± 0.328
2.518ThrArg: 2.518 ± 0.391
4.328ThrSer: 4.328 ± 0.468
4.643ThrThr: 4.643 ± 0.761
4.328ThrVal: 4.328 ± 0.651
0.787ThrTrp: 0.787 ± 0.337
1.889ThrTyr: 1.889 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
4.564ValAla: 4.564 ± 1.04
0.236ValCys: 0.236 ± 0.118
4.092ValAsp: 4.092 ± 0.547
5.272ValGlu: 5.272 ± 0.83
2.675ValPhe: 2.675 ± 0.445
3.777ValGly: 3.777 ± 0.619
0.708ValHis: 0.708 ± 0.235
3.226ValIle: 3.226 ± 0.528
6.059ValLys: 6.059 ± 0.677
4.328ValLeu: 4.328 ± 0.674
1.102ValMet: 1.102 ± 0.287
2.99ValAsn: 2.99 ± 0.393
1.574ValPro: 1.574 ± 0.384
2.754ValGln: 2.754 ± 0.401
1.889ValArg: 1.889 ± 0.44
3.935ValSer: 3.935 ± 0.627
4.721ValThr: 4.721 ± 0.619
4.407ValVal: 4.407 ± 0.582
0.63ValTrp: 0.63 ± 0.204
1.653ValTyr: 1.653 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.63TrpAla: 0.63 ± 0.19
0.0TrpCys: 0.0 ± 0.0
0.551TrpAsp: 0.551 ± 0.183
0.472TrpGlu: 0.472 ± 0.2
0.393TrpPhe: 0.393 ± 0.18
0.787TrpGly: 0.787 ± 0.278
0.079TrpHis: 0.079 ± 0.087
0.551TrpIle: 0.551 ± 0.277
1.18TrpLys: 1.18 ± 0.286
1.18TrpLeu: 1.18 ± 0.4
0.315TrpMet: 0.315 ± 0.174
0.551TrpAsn: 0.551 ± 0.242
0.079TrpPro: 0.079 ± 0.072
0.315TrpGln: 0.315 ± 0.152
0.472TrpArg: 0.472 ± 0.236
0.63TrpSer: 0.63 ± 0.23
0.236TrpThr: 0.236 ± 0.173
0.157TrpVal: 0.157 ± 0.106
0.236TrpTrp: 0.236 ± 0.12
0.393TrpTyr: 0.393 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 0.501
0.63TyrCys: 0.63 ± 0.246
3.541TyrAsp: 3.541 ± 0.57
2.361TyrGlu: 2.361 ± 0.439
1.416TyrPhe: 1.416 ± 0.411
1.259TyrGly: 1.259 ± 0.308
0.708TyrHis: 0.708 ± 0.375
2.597TyrIle: 2.597 ± 0.39
2.99TyrLys: 2.99 ± 0.662
3.226TyrLeu: 3.226 ± 0.701
0.866TyrMet: 0.866 ± 0.236
2.282TyrAsn: 2.282 ± 0.435
0.866TyrPro: 0.866 ± 0.285
1.81TyrGln: 1.81 ± 0.467
1.259TyrArg: 1.259 ± 0.441
1.889TyrSer: 1.889 ± 0.473
2.125TyrThr: 2.125 ± 0.389
1.259TyrVal: 1.259 ± 0.3
0.315TyrTrp: 0.315 ± 0.152
1.889TyrTyr: 1.889 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12709 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski