Amino acid dipepetide frequency for Streptococcus phage Javan115

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.931AlaAla: 5.931 ± 1.69
0.698AlaCys: 0.698 ± 0.303
5.233AlaAsp: 5.233 ± 0.779
6.105AlaGlu: 6.105 ± 0.796
3.314AlaPhe: 3.314 ± 1.151
6.192AlaGly: 6.192 ± 1.089
0.959AlaHis: 0.959 ± 0.338
6.89AlaIle: 6.89 ± 1.024
5.582AlaLys: 5.582 ± 0.635
7.413AlaLeu: 7.413 ± 1.006
2.18AlaMet: 2.18 ± 0.529
5.058AlaAsn: 5.058 ± 0.73
2.529AlaPro: 2.529 ± 0.479
2.704AlaGln: 2.704 ± 0.727
2.616AlaArg: 2.616 ± 0.457
4.361AlaSer: 4.361 ± 1.229
4.448AlaThr: 4.448 ± 0.782
5.756AlaVal: 5.756 ± 1.326
1.047AlaTrp: 1.047 ± 0.298
2.616AlaTyr: 2.616 ± 0.576
0.0AlaXaa: 0.0 ± 0.0
Cys
0.262CysAla: 0.262 ± 0.15
0.174CysCys: 0.174 ± 0.129
0.349CysAsp: 0.349 ± 0.199
0.436CysGlu: 0.436 ± 0.176
0.262CysPhe: 0.262 ± 0.163
0.436CysGly: 0.436 ± 0.227
0.262CysHis: 0.262 ± 0.185
0.087CysIle: 0.087 ± 0.091
0.436CysLys: 0.436 ± 0.197
0.349CysLeu: 0.349 ± 0.154
0.174CysMet: 0.174 ± 0.132
0.174CysAsn: 0.174 ± 0.154
0.087CysPro: 0.087 ± 0.082
0.262CysGln: 0.262 ± 0.163
0.174CysArg: 0.174 ± 0.138
0.349CysSer: 0.349 ± 0.19
0.087CysThr: 0.087 ± 0.093
0.436CysVal: 0.436 ± 0.177
0.087CysTrp: 0.087 ± 0.092
0.087CysTyr: 0.087 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
4.71AspAla: 4.71 ± 0.777
0.262AspCys: 0.262 ± 0.17
4.186AspAsp: 4.186 ± 0.679
4.622AspGlu: 4.622 ± 0.984
3.663AspPhe: 3.663 ± 0.517
6.279AspGly: 6.279 ± 1.07
0.785AspHis: 0.785 ± 0.271
4.535AspIle: 4.535 ± 0.752
5.931AspLys: 5.931 ± 0.959
4.361AspLeu: 4.361 ± 0.675
1.57AspMet: 1.57 ± 0.291
4.099AspAsn: 4.099 ± 0.503
2.093AspPro: 2.093 ± 0.491
1.919AspGln: 1.919 ± 0.421
2.006AspArg: 2.006 ± 0.396
4.012AspSer: 4.012 ± 0.648
4.622AspThr: 4.622 ± 0.645
3.314AspVal: 3.314 ± 0.58
0.262AspTrp: 0.262 ± 0.166
3.489AspTyr: 3.489 ± 0.686
0.0AspXaa: 0.0 ± 0.0
Glu
4.884GluAla: 4.884 ± 0.748
0.174GluCys: 0.174 ± 0.133
3.925GluAsp: 3.925 ± 0.609
4.535GluGlu: 4.535 ± 0.779
2.616GluPhe: 2.616 ± 0.593
2.18GluGly: 2.18 ± 0.273
0.698GluHis: 0.698 ± 0.25
5.32GluIle: 5.32 ± 0.861
6.454GluLys: 6.454 ± 1.356
7.064GluLeu: 7.064 ± 1.289
1.657GluMet: 1.657 ± 0.38
4.099GluAsn: 4.099 ± 0.768
1.483GluPro: 1.483 ± 0.473
3.227GluGln: 3.227 ± 0.689
4.186GluArg: 4.186 ± 0.61
2.791GluSer: 2.791 ± 0.466
3.489GluThr: 3.489 ± 0.488
3.489GluVal: 3.489 ± 0.667
0.785GluTrp: 0.785 ± 0.227
2.704GluTyr: 2.704 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
2.704PheAla: 2.704 ± 0.502
0.174PheCys: 0.174 ± 0.127
4.274PheAsp: 4.274 ± 0.813
3.314PheGlu: 3.314 ± 0.771
1.134PhePhe: 1.134 ± 0.442
2.616PheGly: 2.616 ± 0.666
0.523PheHis: 0.523 ± 0.237
2.093PheIle: 2.093 ± 0.404
3.314PheLys: 3.314 ± 0.608
2.878PheLeu: 2.878 ± 0.716
0.785PheMet: 0.785 ± 0.238
2.355PheAsn: 2.355 ± 0.414
0.698PhePro: 0.698 ± 0.32
1.047PheGln: 1.047 ± 0.383
1.047PheArg: 1.047 ± 0.315
2.878PheSer: 2.878 ± 0.49
2.704PheThr: 2.704 ± 0.427
3.14PheVal: 3.14 ± 0.628
0.611PheTrp: 0.611 ± 0.268
1.395PheTyr: 1.395 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
5.756GlyAla: 5.756 ± 1.227
0.087GlyCys: 0.087 ± 0.086
3.75GlyAsp: 3.75 ± 0.622
3.227GlyGlu: 3.227 ± 0.587
2.18GlyPhe: 2.18 ± 0.406
3.489GlyGly: 3.489 ± 0.553
1.134GlyHis: 1.134 ± 0.313
5.233GlyIle: 5.233 ± 0.693
5.582GlyLys: 5.582 ± 0.844
5.495GlyLeu: 5.495 ± 0.657
2.006GlyMet: 2.006 ± 0.587
4.274GlyAsn: 4.274 ± 0.684
0.698GlyPro: 0.698 ± 0.38
3.75GlyGln: 3.75 ± 0.627
3.053GlyArg: 3.053 ± 0.627
3.489GlySer: 3.489 ± 0.523
4.274GlyThr: 4.274 ± 0.795
5.233GlyVal: 5.233 ± 0.766
0.785GlyTrp: 0.785 ± 0.364
1.744GlyTyr: 1.744 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.785HisAla: 0.785 ± 0.311
0.087HisCys: 0.087 ± 0.086
0.785HisAsp: 0.785 ± 0.259
0.872HisGlu: 0.872 ± 0.241
0.959HisPhe: 0.959 ± 0.295
0.436HisGly: 0.436 ± 0.189
0.349HisHis: 0.349 ± 0.244
1.395HisIle: 1.395 ± 0.426
1.57HisLys: 1.57 ± 0.41
1.134HisLeu: 1.134 ± 0.286
0.262HisMet: 0.262 ± 0.162
0.959HisAsn: 0.959 ± 0.384
0.611HisPro: 0.611 ± 0.297
0.436HisGln: 0.436 ± 0.175
0.872HisArg: 0.872 ± 0.268
1.047HisSer: 1.047 ± 0.258
0.611HisThr: 0.611 ± 0.274
0.523HisVal: 0.523 ± 0.189
0.174HisTrp: 0.174 ± 0.123
0.698HisTyr: 0.698 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.495IleAla: 5.495 ± 0.785
0.087IleCys: 0.087 ± 0.092
5.146IleAsp: 5.146 ± 0.663
5.146IleGlu: 5.146 ± 0.915
1.919IlePhe: 1.919 ± 0.444
5.407IleGly: 5.407 ± 0.809
0.872IleHis: 0.872 ± 0.227
3.663IleIle: 3.663 ± 0.746
5.146IleLys: 5.146 ± 0.782
5.058IleLeu: 5.058 ± 0.839
1.657IleMet: 1.657 ± 0.377
3.837IleAsn: 3.837 ± 0.472
2.878IlePro: 2.878 ± 0.755
2.529IleGln: 2.529 ± 0.502
2.878IleArg: 2.878 ± 0.427
5.32IleSer: 5.32 ± 0.66
4.448IleThr: 4.448 ± 0.742
4.186IleVal: 4.186 ± 0.56
0.698IleTrp: 0.698 ± 0.245
2.442IleTyr: 2.442 ± 0.388
0.0IleXaa: 0.0 ± 0.0
Lys
7.152LysAla: 7.152 ± 0.921
0.523LysCys: 0.523 ± 0.296
3.663LysAsp: 3.663 ± 0.712
5.495LysGlu: 5.495 ± 1.059
2.965LysPhe: 2.965 ± 0.575
4.71LysGly: 4.71 ± 1.292
0.959LysHis: 0.959 ± 0.348
4.884LysIle: 4.884 ± 0.685
6.716LysLys: 6.716 ± 1.309
6.628LysLeu: 6.628 ± 0.931
1.832LysMet: 1.832 ± 0.465
3.663LysAsn: 3.663 ± 0.567
2.965LysPro: 2.965 ± 0.587
3.576LysGln: 3.576 ± 0.875
3.401LysArg: 3.401 ± 0.717
5.931LysSer: 5.931 ± 0.94
4.797LysThr: 4.797 ± 0.825
5.233LysVal: 5.233 ± 0.674
1.395LysTrp: 1.395 ± 0.408
2.878LysTyr: 2.878 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
6.977LeuAla: 6.977 ± 1.035
0.872LeuCys: 0.872 ± 0.31
5.931LeuAsp: 5.931 ± 0.975
5.582LeuGlu: 5.582 ± 0.747
2.529LeuPhe: 2.529 ± 0.398
4.71LeuGly: 4.71 ± 0.868
0.872LeuHis: 0.872 ± 0.278
3.576LeuIle: 3.576 ± 0.66
7.849LeuLys: 7.849 ± 1.023
4.448LeuLeu: 4.448 ± 0.642
1.221LeuMet: 1.221 ± 0.366
4.274LeuAsn: 4.274 ± 0.615
2.965LeuPro: 2.965 ± 0.571
4.099LeuGln: 4.099 ± 0.6
3.576LeuArg: 3.576 ± 0.76
6.018LeuSer: 6.018 ± 1.004
4.971LeuThr: 4.971 ± 0.683
5.756LeuVal: 5.756 ± 0.604
0.698LeuTrp: 0.698 ± 0.248
2.878LeuTyr: 2.878 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.657MetAla: 1.657 ± 0.615
0.087MetCys: 0.087 ± 0.082
1.57MetAsp: 1.57 ± 0.396
0.872MetGlu: 0.872 ± 0.342
0.523MetPhe: 0.523 ± 0.212
0.959MetGly: 0.959 ± 0.404
0.611MetHis: 0.611 ± 0.294
1.308MetIle: 1.308 ± 0.292
2.355MetLys: 2.355 ± 0.468
1.395MetLeu: 1.395 ± 0.346
0.436MetMet: 0.436 ± 0.17
0.872MetAsn: 0.872 ± 0.207
1.221MetPro: 1.221 ± 0.415
2.093MetGln: 2.093 ± 0.409
1.221MetArg: 1.221 ± 0.332
1.308MetSer: 1.308 ± 0.31
1.657MetThr: 1.657 ± 0.353
1.483MetVal: 1.483 ± 0.419
0.262MetTrp: 0.262 ± 0.153
1.047MetTyr: 1.047 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
5.146AsnAla: 5.146 ± 0.734
0.174AsnCys: 0.174 ± 0.165
2.878AsnAsp: 2.878 ± 0.705
3.14AsnGlu: 3.14 ± 0.484
2.268AsnPhe: 2.268 ± 0.42
3.663AsnGly: 3.663 ± 0.71
0.959AsnHis: 0.959 ± 0.316
3.401AsnIle: 3.401 ± 0.501
3.925AsnLys: 3.925 ± 0.678
5.582AsnLeu: 5.582 ± 0.786
1.221AsnMet: 1.221 ± 0.288
3.663AsnAsn: 3.663 ± 0.815
2.268AsnPro: 2.268 ± 0.609
2.006AsnGln: 2.006 ± 0.59
1.221AsnArg: 1.221 ± 0.278
3.227AsnSer: 3.227 ± 0.523
3.489AsnThr: 3.489 ± 0.56
4.099AsnVal: 4.099 ± 0.585
0.523AsnTrp: 0.523 ± 0.198
1.832AsnTyr: 1.832 ± 0.506
0.0AsnXaa: 0.0 ± 0.0
Pro
2.616ProAla: 2.616 ± 0.482
0.0ProCys: 0.0 ± 0.0
2.616ProAsp: 2.616 ± 0.603
2.093ProGlu: 2.093 ± 0.465
2.006ProPhe: 2.006 ± 0.409
1.395ProGly: 1.395 ± 0.493
0.611ProHis: 0.611 ± 0.282
1.832ProIle: 1.832 ± 0.355
2.442ProLys: 2.442 ± 0.447
2.093ProLeu: 2.093 ± 0.423
0.698ProMet: 0.698 ± 0.361
1.308ProAsn: 1.308 ± 0.391
0.436ProPro: 0.436 ± 0.164
1.134ProGln: 1.134 ± 0.328
0.785ProArg: 0.785 ± 0.3
1.395ProSer: 1.395 ± 0.325
2.18ProThr: 2.18 ± 0.499
2.268ProVal: 2.268 ± 0.409
0.087ProTrp: 0.087 ± 0.079
1.395ProTyr: 1.395 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
4.274GlnAla: 4.274 ± 1.094
0.174GlnCys: 0.174 ± 0.132
2.878GlnAsp: 2.878 ± 0.651
3.314GlnGlu: 3.314 ± 0.712
0.611GlnPhe: 0.611 ± 0.221
3.75GlnGly: 3.75 ± 0.897
0.698GlnHis: 0.698 ± 0.228
2.965GlnIle: 2.965 ± 0.591
2.704GlnLys: 2.704 ± 0.53
3.314GlnLeu: 3.314 ± 0.485
1.483GlnMet: 1.483 ± 0.342
2.18GlnAsn: 2.18 ± 0.589
0.959GlnPro: 0.959 ± 0.33
2.006GlnGln: 2.006 ± 0.833
2.006GlnArg: 2.006 ± 0.403
3.663GlnSer: 3.663 ± 0.629
2.355GlnThr: 2.355 ± 0.515
2.18GlnVal: 2.18 ± 0.446
0.349GlnTrp: 0.349 ± 0.173
1.657GlnTyr: 1.657 ± 0.42
0.0GlnXaa: 0.0 ± 0.0
Arg
2.268ArgAla: 2.268 ± 0.581
0.262ArgCys: 0.262 ± 0.137
2.093ArgAsp: 2.093 ± 0.469
1.483ArgGlu: 1.483 ± 0.381
1.832ArgPhe: 1.832 ± 0.366
2.18ArgGly: 2.18 ± 0.606
0.785ArgHis: 0.785 ± 0.359
2.355ArgIle: 2.355 ± 0.41
3.75ArgLys: 3.75 ± 0.59
2.965ArgLeu: 2.965 ± 0.548
1.134ArgMet: 1.134 ± 0.322
2.442ArgAsn: 2.442 ± 0.583
1.047ArgPro: 1.047 ± 0.265
1.657ArgGln: 1.657 ± 0.329
1.657ArgArg: 1.657 ± 0.345
2.006ArgSer: 2.006 ± 0.315
2.878ArgThr: 2.878 ± 0.651
2.355ArgVal: 2.355 ± 0.527
0.959ArgTrp: 0.959 ± 0.423
2.442ArgTyr: 2.442 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
5.843SerAla: 5.843 ± 1.204
0.087SerCys: 0.087 ± 0.089
4.71SerAsp: 4.71 ± 0.73
4.186SerGlu: 4.186 ± 0.673
2.268SerPhe: 2.268 ± 0.411
4.012SerGly: 4.012 ± 0.791
0.611SerHis: 0.611 ± 0.281
4.71SerIle: 4.71 ± 0.691
3.314SerLys: 3.314 ± 0.543
6.367SerLeu: 6.367 ± 0.964
1.134SerMet: 1.134 ± 0.34
3.401SerAsn: 3.401 ± 0.544
1.744SerPro: 1.744 ± 0.387
3.227SerGln: 3.227 ± 0.605
1.57SerArg: 1.57 ± 0.397
4.884SerSer: 4.884 ± 0.821
3.925SerThr: 3.925 ± 0.734
3.925SerVal: 3.925 ± 0.718
0.785SerTrp: 0.785 ± 0.23
2.791SerTyr: 2.791 ± 0.562
0.0SerXaa: 0.0 ± 0.0
Thr
4.884ThrAla: 4.884 ± 0.675
0.262ThrCys: 0.262 ± 0.155
4.274ThrAsp: 4.274 ± 0.839
3.489ThrGlu: 3.489 ± 0.528
3.227ThrPhe: 3.227 ± 0.589
5.233ThrGly: 5.233 ± 0.783
1.308ThrHis: 1.308 ± 0.384
6.105ThrIle: 6.105 ± 0.83
4.535ThrLys: 4.535 ± 0.572
4.012ThrLeu: 4.012 ± 0.621
1.483ThrMet: 1.483 ± 0.382
2.442ThrAsn: 2.442 ± 0.356
1.832ThrPro: 1.832 ± 0.403
3.14ThrGln: 3.14 ± 0.575
1.657ThrArg: 1.657 ± 0.398
2.529ThrSer: 2.529 ± 0.554
4.274ThrThr: 4.274 ± 0.852
5.407ThrVal: 5.407 ± 0.685
0.611ThrTrp: 0.611 ± 0.238
3.227ThrTyr: 3.227 ± 0.64
0.0ThrXaa: 0.0 ± 0.0
Val
6.541ValAla: 6.541 ± 1.13
0.349ValCys: 0.349 ± 0.179
5.146ValAsp: 5.146 ± 0.788
5.058ValGlu: 5.058 ± 0.721
3.227ValPhe: 3.227 ± 0.481
4.186ValGly: 4.186 ± 0.95
0.698ValHis: 0.698 ± 0.32
4.448ValIle: 4.448 ± 0.548
4.361ValLys: 4.361 ± 0.508
4.448ValLeu: 4.448 ± 0.568
1.047ValMet: 1.047 ± 0.322
2.704ValAsn: 2.704 ± 0.457
1.483ValPro: 1.483 ± 0.369
1.57ValGln: 1.57 ± 0.422
2.093ValArg: 2.093 ± 0.425
5.233ValSer: 5.233 ± 0.654
5.669ValThr: 5.669 ± 0.74
4.186ValVal: 4.186 ± 0.606
0.785ValTrp: 0.785 ± 0.316
2.616ValTyr: 2.616 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.785TrpAla: 0.785 ± 0.28
0.174TrpCys: 0.174 ± 0.134
0.436TrpAsp: 0.436 ± 0.255
0.611TrpGlu: 0.611 ± 0.204
0.611TrpPhe: 0.611 ± 0.308
0.872TrpGly: 0.872 ± 0.364
0.262TrpHis: 0.262 ± 0.142
0.523TrpIle: 0.523 ± 0.212
0.349TrpLys: 0.349 ± 0.147
1.134TrpLeu: 1.134 ± 0.264
0.174TrpMet: 0.174 ± 0.109
0.698TrpAsn: 0.698 ± 0.234
0.0TrpPro: 0.0 ± 0.0
0.523TrpGln: 0.523 ± 0.243
0.611TrpArg: 0.611 ± 0.214
1.047TrpSer: 1.047 ± 0.349
0.436TrpThr: 0.436 ± 0.225
1.047TrpVal: 1.047 ± 0.26
0.174TrpTrp: 0.174 ± 0.183
0.698TrpTyr: 0.698 ± 0.288
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.965TyrAla: 2.965 ± 0.662
0.262TyrCys: 0.262 ± 0.185
2.878TyrAsp: 2.878 ± 0.62
2.18TyrGlu: 2.18 ± 0.522
1.57TyrPhe: 1.57 ± 0.437
2.529TyrGly: 2.529 ± 0.616
0.611TyrHis: 0.611 ± 0.344
3.489TyrIle: 3.489 ± 0.618
3.053TyrLys: 3.053 ± 0.544
3.663TyrLeu: 3.663 ± 0.613
0.698TyrMet: 0.698 ± 0.199
2.18TyrAsn: 2.18 ± 0.462
1.395TyrPro: 1.395 ± 0.339
2.704TyrGln: 2.704 ± 0.42
1.919TyrArg: 1.919 ± 0.475
2.006TyrSer: 2.006 ± 0.398
2.616TyrThr: 2.616 ± 0.656
1.832TyrVal: 1.832 ± 0.467
0.087TyrTrp: 0.087 ± 0.082
1.308TyrTyr: 1.308 ± 0.41
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11467 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski