Amino acid dipepetide frequency for Streptococcus phage Javan330

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.82AlaAla: 3.82 ± 0.951
0.477AlaCys: 0.477 ± 0.193
4.488AlaAsp: 4.488 ± 0.699
4.966AlaGlu: 4.966 ± 0.753
2.483AlaPhe: 2.483 ± 0.378
4.202AlaGly: 4.202 ± 0.903
0.859AlaHis: 0.859 ± 0.274
5.157AlaIle: 5.157 ± 0.894
6.684AlaLys: 6.684 ± 0.597
5.73AlaLeu: 5.73 ± 0.683
1.528AlaMet: 1.528 ± 0.305
3.342AlaAsn: 3.342 ± 0.482
1.719AlaPro: 1.719 ± 0.492
2.483AlaGln: 2.483 ± 0.512
3.342AlaArg: 3.342 ± 0.505
4.584AlaSer: 4.584 ± 0.879
3.724AlaThr: 3.724 ± 0.536
4.488AlaVal: 4.488 ± 0.85
2.005AlaTrp: 2.005 ± 0.681
2.769AlaTyr: 2.769 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.095CysAla: 0.095 ± 0.093
0.191CysCys: 0.191 ± 0.129
0.191CysAsp: 0.191 ± 0.117
0.573CysGlu: 0.573 ± 0.272
0.382CysPhe: 0.382 ± 0.17
0.286CysGly: 0.286 ± 0.145
0.286CysHis: 0.286 ± 0.165
0.286CysIle: 0.286 ± 0.151
0.573CysLys: 0.573 ± 0.213
0.668CysLeu: 0.668 ± 0.258
0.0CysMet: 0.0 ± 0.0
0.382CysAsn: 0.382 ± 0.223
0.382CysPro: 0.382 ± 0.232
0.286CysGln: 0.286 ± 0.141
0.286CysArg: 0.286 ± 0.163
0.0CysSer: 0.0 ± 0.0
0.191CysThr: 0.191 ± 0.118
0.382CysVal: 0.382 ± 0.205
0.095CysTrp: 0.095 ± 0.089
0.286CysTyr: 0.286 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
3.915AspAla: 3.915 ± 0.668
0.573AspCys: 0.573 ± 0.206
3.247AspAsp: 3.247 ± 0.58
4.011AspGlu: 4.011 ± 0.647
3.342AspPhe: 3.342 ± 0.505
5.348AspGly: 5.348 ± 0.722
0.668AspHis: 0.668 ± 0.204
5.157AspIle: 5.157 ± 0.783
3.915AspLys: 3.915 ± 0.528
5.157AspLeu: 5.157 ± 0.74
2.005AspMet: 2.005 ± 0.344
2.769AspAsn: 2.769 ± 0.485
2.005AspPro: 2.005 ± 0.606
0.955AspGln: 0.955 ± 0.309
2.865AspArg: 2.865 ± 0.454
2.865AspSer: 2.865 ± 0.486
2.96AspThr: 2.96 ± 0.591
4.106AspVal: 4.106 ± 0.77
0.573AspTrp: 0.573 ± 0.194
3.247AspTyr: 3.247 ± 0.581
0.0AspXaa: 0.0 ± 0.0
Glu
5.157GluAla: 5.157 ± 0.673
0.286GluCys: 0.286 ± 0.161
3.533GluAsp: 3.533 ± 0.663
5.061GluGlu: 5.061 ± 0.858
2.96GluPhe: 2.96 ± 0.522
3.724GluGly: 3.724 ± 0.586
1.05GluHis: 1.05 ± 0.311
6.398GluIle: 6.398 ± 0.725
6.112GluLys: 6.112 ± 0.838
8.785GluLeu: 8.785 ± 1.095
1.814GluMet: 1.814 ± 0.327
3.629GluAsn: 3.629 ± 0.502
2.674GluPro: 2.674 ± 0.562
4.297GluGln: 4.297 ± 0.653
3.915GluArg: 3.915 ± 0.789
4.87GluSer: 4.87 ± 0.6
3.724GluThr: 3.724 ± 0.486
4.775GluVal: 4.775 ± 0.908
1.146GluTrp: 1.146 ± 0.363
2.483GluTyr: 2.483 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
3.724PheAla: 3.724 ± 0.506
0.095PheCys: 0.095 ± 0.089
3.438PheAsp: 3.438 ± 0.546
3.342PheGlu: 3.342 ± 0.493
1.623PhePhe: 1.623 ± 0.379
2.865PheGly: 2.865 ± 0.563
0.573PheHis: 0.573 ± 0.264
2.196PheIle: 2.196 ± 0.397
3.629PheLys: 3.629 ± 0.607
2.96PheLeu: 2.96 ± 0.581
1.146PheMet: 1.146 ± 0.295
1.623PheAsn: 1.623 ± 0.416
1.05PhePro: 1.05 ± 0.34
1.146PheGln: 1.146 ± 0.354
1.528PheArg: 1.528 ± 0.403
3.056PheSer: 3.056 ± 0.6
2.292PheThr: 2.292 ± 0.428
2.769PheVal: 2.769 ± 0.54
0.477PheTrp: 0.477 ± 0.177
1.623PheTyr: 1.623 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
4.775GlyAla: 4.775 ± 0.685
0.191GlyCys: 0.191 ± 0.12
3.533GlyAsp: 3.533 ± 0.584
5.348GlyGlu: 5.348 ± 0.668
3.82GlyPhe: 3.82 ± 0.441
5.539GlyGly: 5.539 ± 0.763
0.764GlyHis: 0.764 ± 0.293
4.775GlyIle: 4.775 ± 1.127
5.539GlyLys: 5.539 ± 0.626
5.252GlyLeu: 5.252 ± 1.091
2.005GlyMet: 2.005 ± 0.397
4.011GlyAsn: 4.011 ± 0.688
1.05GlyPro: 1.05 ± 0.639
2.483GlyGln: 2.483 ± 0.591
3.342GlyArg: 3.342 ± 0.541
3.629GlySer: 3.629 ± 0.475
3.438GlyThr: 3.438 ± 0.574
3.82GlyVal: 3.82 ± 0.847
1.623GlyTrp: 1.623 ± 0.488
4.202GlyTyr: 4.202 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
1.146HisAla: 1.146 ± 0.284
0.286HisCys: 0.286 ± 0.219
1.05HisAsp: 1.05 ± 0.39
0.955HisGlu: 0.955 ± 0.452
0.859HisPhe: 0.859 ± 0.247
1.05HisGly: 1.05 ± 0.305
0.382HisHis: 0.382 ± 0.182
0.955HisIle: 0.955 ± 0.308
1.146HisLys: 1.146 ± 0.312
1.241HisLeu: 1.241 ± 0.352
0.382HisMet: 0.382 ± 0.159
0.477HisAsn: 0.477 ± 0.22
0.191HisPro: 0.191 ± 0.143
0.764HisGln: 0.764 ± 0.264
0.573HisArg: 0.573 ± 0.205
1.146HisSer: 1.146 ± 0.323
0.764HisThr: 0.764 ± 0.235
0.859HisVal: 0.859 ± 0.32
0.191HisTrp: 0.191 ± 0.113
0.764HisTyr: 0.764 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
4.679IleAla: 4.679 ± 0.723
0.191IleCys: 0.191 ± 0.136
4.106IleAsp: 4.106 ± 0.492
6.494IleGlu: 6.494 ± 0.79
3.629IlePhe: 3.629 ± 0.59
4.679IleGly: 4.679 ± 0.781
0.859IleHis: 0.859 ± 0.238
4.011IleIle: 4.011 ± 0.674
5.157IleLys: 5.157 ± 0.711
6.207IleLeu: 6.207 ± 0.894
1.05IleMet: 1.05 ± 0.296
2.674IleAsn: 2.674 ± 0.45
2.674IlePro: 2.674 ± 0.488
2.387IleGln: 2.387 ± 0.476
2.578IleArg: 2.578 ± 0.426
5.73IleSer: 5.73 ± 0.819
4.393IleThr: 4.393 ± 0.567
4.584IleVal: 4.584 ± 0.661
0.955IleTrp: 0.955 ± 0.669
1.91IleTyr: 1.91 ± 0.414
0.0IleXaa: 0.0 ± 0.0
Lys
4.966LysAla: 4.966 ± 0.585
0.286LysCys: 0.286 ± 0.206
4.584LysAsp: 4.584 ± 0.685
6.78LysGlu: 6.78 ± 0.741
1.91LysPhe: 1.91 ± 0.379
3.915LysGly: 3.915 ± 0.46
0.859LysHis: 0.859 ± 0.285
5.825LysIle: 5.825 ± 0.672
6.971LysLys: 6.971 ± 1.041
6.112LysLeu: 6.112 ± 0.575
1.719LysMet: 1.719 ± 0.4
5.634LysAsn: 5.634 ± 0.635
2.483LysPro: 2.483 ± 0.497
3.82LysGln: 3.82 ± 0.54
3.724LysArg: 3.724 ± 0.714
5.73LysSer: 5.73 ± 0.79
5.252LysThr: 5.252 ± 0.826
5.443LysVal: 5.443 ± 0.953
1.623LysTrp: 1.623 ± 0.431
3.915LysTyr: 3.915 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
6.303LeuAla: 6.303 ± 0.766
0.286LeuCys: 0.286 ± 0.163
6.016LeuAsp: 6.016 ± 0.644
6.398LeuGlu: 6.398 ± 0.658
3.82LeuPhe: 3.82 ± 0.614
5.921LeuGly: 5.921 ± 1.319
1.05LeuHis: 1.05 ± 0.249
5.061LeuIle: 5.061 ± 0.663
7.639LeuLys: 7.639 ± 0.841
5.252LeuLeu: 5.252 ± 0.599
1.623LeuMet: 1.623 ± 0.431
4.297LeuAsn: 4.297 ± 0.597
2.865LeuPro: 2.865 ± 0.487
2.769LeuGln: 2.769 ± 0.53
3.247LeuArg: 3.247 ± 0.573
6.398LeuSer: 6.398 ± 0.99
4.87LeuThr: 4.87 ± 0.733
4.011LeuVal: 4.011 ± 0.535
0.955LeuTrp: 0.955 ± 0.366
2.292LeuTyr: 2.292 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
1.814MetAla: 1.814 ± 0.394
0.286MetCys: 0.286 ± 0.146
1.241MetAsp: 1.241 ± 0.313
2.292MetGlu: 2.292 ± 0.363
0.573MetPhe: 0.573 ± 0.213
1.814MetGly: 1.814 ± 0.376
0.286MetHis: 0.286 ± 0.168
2.292MetIle: 2.292 ± 0.525
1.91MetLys: 1.91 ± 0.447
1.623MetLeu: 1.623 ± 0.42
0.286MetMet: 0.286 ± 0.148
1.432MetAsn: 1.432 ± 0.332
0.764MetPro: 0.764 ± 0.268
1.241MetGln: 1.241 ± 0.315
0.764MetArg: 0.764 ± 0.341
1.337MetSer: 1.337 ± 0.312
1.337MetThr: 1.337 ± 0.356
1.432MetVal: 1.432 ± 0.336
0.0MetTrp: 0.0 ± 0.0
0.764MetTyr: 0.764 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
3.533AsnAla: 3.533 ± 0.435
0.477AsnCys: 0.477 ± 0.187
2.96AsnAsp: 2.96 ± 0.434
3.915AsnGlu: 3.915 ± 0.755
2.196AsnPhe: 2.196 ± 0.412
4.297AsnGly: 4.297 ± 0.599
1.146AsnHis: 1.146 ± 0.397
3.915AsnIle: 3.915 ± 0.895
4.393AsnLys: 4.393 ± 0.671
4.775AsnLeu: 4.775 ± 0.558
0.477AsnMet: 0.477 ± 0.208
2.578AsnAsn: 2.578 ± 0.527
2.387AsnPro: 2.387 ± 0.469
2.578AsnGln: 2.578 ± 0.498
2.769AsnArg: 2.769 ± 0.479
2.483AsnSer: 2.483 ± 0.504
3.533AsnThr: 3.533 ± 0.597
3.533AsnVal: 3.533 ± 0.442
0.764AsnTrp: 0.764 ± 0.269
2.101AsnTyr: 2.101 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
2.578ProAla: 2.578 ± 0.45
0.286ProCys: 0.286 ± 0.155
1.814ProAsp: 1.814 ± 0.482
2.483ProGlu: 2.483 ± 0.507
1.241ProPhe: 1.241 ± 0.286
1.337ProGly: 1.337 ± 0.409
0.191ProHis: 0.191 ± 0.134
2.101ProIle: 2.101 ± 0.417
2.674ProLys: 2.674 ± 0.485
2.101ProLeu: 2.101 ± 0.391
0.764ProMet: 0.764 ± 0.186
1.528ProAsn: 1.528 ± 0.405
0.286ProPro: 0.286 ± 0.146
1.241ProGln: 1.241 ± 0.281
1.719ProArg: 1.719 ± 0.405
2.578ProSer: 2.578 ± 0.414
2.387ProThr: 2.387 ± 0.561
1.91ProVal: 1.91 ± 0.365
0.095ProTrp: 0.095 ± 0.101
1.146ProTyr: 1.146 ± 0.378
0.0ProXaa: 0.0 ± 0.0
Gln
2.96GlnAla: 2.96 ± 0.699
0.191GlnCys: 0.191 ± 0.138
1.814GlnAsp: 1.814 ± 0.293
3.342GlnGlu: 3.342 ± 0.584
1.432GlnPhe: 1.432 ± 0.462
3.629GlnGly: 3.629 ± 0.87
0.668GlnHis: 0.668 ± 0.274
2.101GlnIle: 2.101 ± 0.563
3.438GlnLys: 3.438 ± 0.648
2.96GlnLeu: 2.96 ± 0.439
1.05GlnMet: 1.05 ± 0.278
3.056GlnAsn: 3.056 ± 0.418
1.241GlnPro: 1.241 ± 0.367
0.764GlnGln: 0.764 ± 0.23
1.814GlnArg: 1.814 ± 0.414
2.101GlnSer: 2.101 ± 0.45
2.387GlnThr: 2.387 ± 0.432
1.814GlnVal: 1.814 ± 0.416
0.382GlnTrp: 0.382 ± 0.177
1.241GlnTyr: 1.241 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.578ArgAla: 2.578 ± 0.502
0.382ArgCys: 0.382 ± 0.19
2.483ArgAsp: 2.483 ± 0.511
2.674ArgGlu: 2.674 ± 0.502
2.196ArgPhe: 2.196 ± 0.456
2.196ArgGly: 2.196 ± 0.518
1.337ArgHis: 1.337 ± 0.435
2.865ArgIle: 2.865 ± 0.468
4.106ArgLys: 4.106 ± 0.896
3.915ArgLeu: 3.915 ± 0.679
0.955ArgMet: 0.955 ± 0.306
3.342ArgAsn: 3.342 ± 0.648
1.623ArgPro: 1.623 ± 0.423
1.146ArgGln: 1.146 ± 0.335
1.814ArgArg: 1.814 ± 0.528
1.623ArgSer: 1.623 ± 0.481
3.533ArgThr: 3.533 ± 0.534
2.865ArgVal: 2.865 ± 0.512
0.573ArgTrp: 0.573 ± 0.224
2.483ArgTyr: 2.483 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
4.584SerAla: 4.584 ± 1.095
0.191SerCys: 0.191 ± 0.123
4.297SerAsp: 4.297 ± 0.546
5.539SerGlu: 5.539 ± 0.739
1.91SerPhe: 1.91 ± 0.396
4.966SerGly: 4.966 ± 0.597
0.955SerHis: 0.955 ± 0.311
4.488SerIle: 4.488 ± 0.761
3.915SerLys: 3.915 ± 0.629
5.825SerLeu: 5.825 ± 0.696
2.292SerMet: 2.292 ± 0.586
3.247SerAsn: 3.247 ± 0.614
1.623SerPro: 1.623 ± 0.405
2.101SerGln: 2.101 ± 0.443
2.005SerArg: 2.005 ± 0.499
4.202SerSer: 4.202 ± 0.63
4.106SerThr: 4.106 ± 0.561
4.202SerVal: 4.202 ± 0.707
0.859SerTrp: 0.859 ± 0.268
2.578SerTyr: 2.578 ± 0.492
0.0SerXaa: 0.0 ± 0.0
Thr
4.011ThrAla: 4.011 ± 0.677
0.382ThrCys: 0.382 ± 0.152
3.342ThrAsp: 3.342 ± 0.47
3.247ThrGlu: 3.247 ± 0.685
2.674ThrPhe: 2.674 ± 0.734
5.825ThrGly: 5.825 ± 0.747
1.05ThrHis: 1.05 ± 0.346
4.297ThrIle: 4.297 ± 0.58
3.629ThrLys: 3.629 ± 0.645
4.775ThrLeu: 4.775 ± 0.612
1.814ThrMet: 1.814 ± 0.424
3.247ThrAsn: 3.247 ± 0.496
2.005ThrPro: 2.005 ± 0.445
2.674ThrGln: 2.674 ± 0.587
1.719ThrArg: 1.719 ± 0.456
2.865ThrSer: 2.865 ± 0.434
3.247ThrThr: 3.247 ± 0.495
4.966ThrVal: 4.966 ± 0.653
0.477ThrTrp: 0.477 ± 0.227
2.674ThrTyr: 2.674 ± 0.479
0.0ThrXaa: 0.0 ± 0.0
Val
5.348ValAla: 5.348 ± 0.748
0.191ValCys: 0.191 ± 0.127
4.297ValAsp: 4.297 ± 0.535
5.157ValGlu: 5.157 ± 0.696
1.528ValPhe: 1.528 ± 0.338
3.82ValGly: 3.82 ± 0.727
1.146ValHis: 1.146 ± 0.428
3.438ValIle: 3.438 ± 0.551
5.252ValLys: 5.252 ± 0.739
3.533ValLeu: 3.533 ± 0.518
1.146ValMet: 1.146 ± 0.307
3.247ValAsn: 3.247 ± 0.513
1.814ValPro: 1.814 ± 0.466
2.483ValGln: 2.483 ± 0.575
2.96ValArg: 2.96 ± 0.573
5.634ValSer: 5.634 ± 0.665
5.061ValThr: 5.061 ± 0.786
4.106ValVal: 4.106 ± 0.677
0.191ValTrp: 0.191 ± 0.128
2.101ValTyr: 2.101 ± 0.422
0.0ValXaa: 0.0 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.291
0.191TrpCys: 0.191 ± 0.138
0.573TrpAsp: 0.573 ± 0.327
1.146TrpGlu: 1.146 ± 0.378
0.668TrpPhe: 0.668 ± 0.263
1.241TrpGly: 1.241 ± 0.367
0.191TrpHis: 0.191 ± 0.126
0.573TrpIle: 0.573 ± 0.237
1.432TrpLys: 1.432 ± 0.303
0.764TrpLeu: 0.764 ± 0.301
0.477TrpMet: 0.477 ± 0.21
1.241TrpAsn: 1.241 ± 0.315
0.382TrpPro: 0.382 ± 0.165
0.286TrpGln: 0.286 ± 0.14
0.573TrpArg: 0.573 ± 0.222
0.955TrpSer: 0.955 ± 0.311
0.382TrpThr: 0.382 ± 0.172
0.668TrpVal: 0.668 ± 0.22
0.286TrpTrp: 0.286 ± 0.181
0.764TrpTyr: 0.764 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.101TyrAla: 2.101 ± 0.428
0.382TyrCys: 0.382 ± 0.193
2.674TyrAsp: 2.674 ± 0.5
2.674TyrGlu: 2.674 ± 0.548
1.91TyrPhe: 1.91 ± 0.549
2.483TyrGly: 2.483 ± 0.51
0.859TyrHis: 0.859 ± 0.243
3.056TyrIle: 3.056 ± 0.643
3.438TyrLys: 3.438 ± 0.576
3.438TyrLeu: 3.438 ± 0.649
0.859TyrMet: 0.859 ± 0.251
3.056TyrAsn: 3.056 ± 0.617
1.241TyrPro: 1.241 ± 0.315
2.483TyrGln: 2.483 ± 0.396
3.151TyrArg: 3.151 ± 0.45
2.101TyrSer: 2.101 ± 0.361
1.337TyrThr: 1.337 ± 0.259
1.719TyrVal: 1.719 ± 0.404
0.382TyrTrp: 0.382 ± 0.21
2.101TyrTyr: 2.101 ± 0.437
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10473 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski