Amino acid dipepetide frequency for Streptococcus phage Javan284

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.058AlaAla: 4.058 ± 1.164
0.345AlaCys: 0.345 ± 0.173
4.749AlaAsp: 4.749 ± 0.802
6.476AlaGlu: 6.476 ± 0.869
3.626AlaPhe: 3.626 ± 0.831
4.662AlaGly: 4.662 ± 1.209
0.604AlaHis: 0.604 ± 0.265
6.13AlaIle: 6.13 ± 1.085
5.958AlaLys: 5.958 ± 0.838
7.684AlaLeu: 7.684 ± 1.354
2.763AlaMet: 2.763 ± 0.876
4.49AlaAsn: 4.49 ± 0.554
1.986AlaPro: 1.986 ± 0.308
3.454AlaGln: 3.454 ± 0.865
1.899AlaArg: 1.899 ± 0.34
4.576AlaSer: 4.576 ± 0.986
4.835AlaThr: 4.835 ± 1.061
5.008AlaVal: 5.008 ± 1.388
0.691AlaTrp: 0.691 ± 0.237
3.195AlaTyr: 3.195 ± 0.634
0.0AlaXaa: 0.0 ± 0.0
Cys
0.259CysAla: 0.259 ± 0.147
0.0CysCys: 0.0 ± 0.0
0.518CysAsp: 0.518 ± 0.257
0.777CysGlu: 0.777 ± 0.294
0.086CysPhe: 0.086 ± 0.099
0.777CysGly: 0.777 ± 0.226
0.173CysHis: 0.173 ± 0.12
0.173CysIle: 0.173 ± 0.106
0.345CysLys: 0.345 ± 0.174
0.259CysLeu: 0.259 ± 0.149
0.0CysMet: 0.0 ± 0.0
0.259CysAsn: 0.259 ± 0.15
0.173CysPro: 0.173 ± 0.199
0.173CysGln: 0.173 ± 0.134
0.173CysArg: 0.173 ± 0.114
0.173CysSer: 0.173 ± 0.135
0.518CysThr: 0.518 ± 0.211
0.432CysVal: 0.432 ± 0.217
0.086CysTrp: 0.086 ± 0.088
0.173CysTyr: 0.173 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
4.317AspAla: 4.317 ± 0.533
0.432AspCys: 0.432 ± 0.194
4.403AspAsp: 4.403 ± 0.845
4.835AspGlu: 4.835 ± 0.897
3.626AspPhe: 3.626 ± 0.496
6.821AspGly: 6.821 ± 0.685
0.691AspHis: 0.691 ± 0.2
4.49AspIle: 4.49 ± 0.828
4.749AspLys: 4.749 ± 0.826
4.749AspLeu: 4.749 ± 0.701
1.64AspMet: 1.64 ± 0.384
3.799AspAsn: 3.799 ± 0.51
0.95AspPro: 0.95 ± 0.34
1.64AspGln: 1.64 ± 0.322
2.418AspArg: 2.418 ± 0.433
3.367AspSer: 3.367 ± 0.462
4.317AspThr: 4.317 ± 0.714
3.799AspVal: 3.799 ± 0.633
0.259AspTrp: 0.259 ± 0.141
2.936AspTyr: 2.936 ± 0.593
0.0AspXaa: 0.0 ± 0.0
Glu
4.403GluAla: 4.403 ± 0.672
0.345GluCys: 0.345 ± 0.195
4.49GluAsp: 4.49 ± 0.757
5.785GluGlu: 5.785 ± 0.991
3.367GluPhe: 3.367 ± 0.572
3.022GluGly: 3.022 ± 0.486
1.209GluHis: 1.209 ± 0.347
5.008GluIle: 5.008 ± 1.068
4.403GluLys: 4.403 ± 0.776
6.217GluLeu: 6.217 ± 0.959
2.849GluMet: 2.849 ± 0.472
3.713GluAsn: 3.713 ± 0.677
1.727GluPro: 1.727 ± 0.524
4.403GluGln: 4.403 ± 0.745
3.713GluArg: 3.713 ± 0.718
2.331GluSer: 2.331 ± 0.433
3.713GluThr: 3.713 ± 0.764
4.576GluVal: 4.576 ± 0.766
0.777GluTrp: 0.777 ± 0.243
2.677GluTyr: 2.677 ± 0.465
0.0GluXaa: 0.0 ± 0.0
Phe
2.59PheAla: 2.59 ± 0.558
0.259PheCys: 0.259 ± 0.126
3.799PheAsp: 3.799 ± 0.573
3.54PheGlu: 3.54 ± 0.753
1.468PhePhe: 1.468 ± 0.265
2.59PheGly: 2.59 ± 0.608
0.259PheHis: 0.259 ± 0.143
2.418PheIle: 2.418 ± 0.487
5.094PheLys: 5.094 ± 0.574
1.986PheLeu: 1.986 ± 0.499
1.122PheMet: 1.122 ± 0.334
3.367PheAsn: 3.367 ± 0.446
1.209PhePro: 1.209 ± 0.336
1.468PheGln: 1.468 ± 0.42
1.295PheArg: 1.295 ± 0.291
3.54PheSer: 3.54 ± 0.532
2.072PheThr: 2.072 ± 0.344
2.504PheVal: 2.504 ± 0.457
0.863PheTrp: 0.863 ± 0.346
1.122PheTyr: 1.122 ± 0.304
0.0PheXaa: 0.0 ± 0.0
Gly
5.18GlyAla: 5.18 ± 1.745
0.432GlyCys: 0.432 ± 0.177
2.504GlyAsp: 2.504 ± 0.498
3.108GlyGlu: 3.108 ± 0.421
2.936GlyPhe: 2.936 ± 0.527
4.403GlyGly: 4.403 ± 0.619
1.036GlyHis: 1.036 ± 0.384
4.921GlyIle: 4.921 ± 0.892
5.526GlyLys: 5.526 ± 0.634
5.008GlyLeu: 5.008 ± 0.772
1.381GlyMet: 1.381 ± 0.388
5.353GlyAsn: 5.353 ± 1.128
0.345GlyPro: 0.345 ± 0.15
2.59GlyGln: 2.59 ± 0.469
3.022GlyArg: 3.022 ± 0.449
4.662GlySer: 4.662 ± 0.854
4.403GlyThr: 4.403 ± 0.827
4.835GlyVal: 4.835 ± 0.709
0.863GlyTrp: 0.863 ± 0.269
3.022GlyTyr: 3.022 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
0.777HisAla: 0.777 ± 0.195
0.259HisCys: 0.259 ± 0.143
0.777HisAsp: 0.777 ± 0.254
0.691HisGlu: 0.691 ± 0.244
0.604HisPhe: 0.604 ± 0.235
0.863HisGly: 0.863 ± 0.246
0.259HisHis: 0.259 ± 0.13
0.518HisIle: 0.518 ± 0.254
0.863HisLys: 0.863 ± 0.262
0.863HisLeu: 0.863 ± 0.249
0.518HisMet: 0.518 ± 0.234
0.777HisAsn: 0.777 ± 0.258
0.432HisPro: 0.432 ± 0.225
0.863HisGln: 0.863 ± 0.305
0.432HisArg: 0.432 ± 0.174
0.777HisSer: 0.777 ± 0.239
0.95HisThr: 0.95 ± 0.355
0.95HisVal: 0.95 ± 0.307
0.173HisTrp: 0.173 ± 0.127
0.432HisTyr: 0.432 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
6.562IleAla: 6.562 ± 1.036
0.173IleCys: 0.173 ± 0.124
5.094IleAsp: 5.094 ± 0.507
3.885IleGlu: 3.885 ± 0.692
2.418IlePhe: 2.418 ± 0.498
3.799IleGly: 3.799 ± 1.02
0.777IleHis: 0.777 ± 0.251
3.626IleIle: 3.626 ± 0.759
6.13IleLys: 6.13 ± 0.672
3.626IleLeu: 3.626 ± 0.515
0.95IleMet: 0.95 ± 0.283
5.526IleAsn: 5.526 ± 0.781
1.899IlePro: 1.899 ± 0.428
2.245IleGln: 2.245 ± 0.391
2.159IleArg: 2.159 ± 0.474
4.835IleSer: 4.835 ± 0.879
5.18IleThr: 5.18 ± 0.73
3.713IleVal: 3.713 ± 0.659
1.295IleTrp: 1.295 ± 0.387
2.418IleTyr: 2.418 ± 0.479
0.0IleXaa: 0.0 ± 0.0
Lys
6.735LysAla: 6.735 ± 0.91
0.863LysCys: 0.863 ± 0.25
4.921LysAsp: 4.921 ± 0.664
5.353LysGlu: 5.353 ± 0.916
2.331LysPhe: 2.331 ± 0.445
4.231LysGly: 4.231 ± 0.587
1.468LysHis: 1.468 ± 0.349
5.698LysIle: 5.698 ± 0.719
6.821LysLys: 6.821 ± 0.923
6.217LysLeu: 6.217 ± 0.932
1.64LysMet: 1.64 ± 0.414
3.972LysAsn: 3.972 ± 0.675
2.677LysPro: 2.677 ± 0.518
3.626LysGln: 3.626 ± 0.718
3.454LysArg: 3.454 ± 0.815
5.698LysSer: 5.698 ± 0.811
5.008LysThr: 5.008 ± 0.56
5.18LysVal: 5.18 ± 0.481
0.691LysTrp: 0.691 ± 0.217
3.626LysTyr: 3.626 ± 0.778
0.0LysXaa: 0.0 ± 0.0
Leu
5.612LeuAla: 5.612 ± 0.712
0.432LeuCys: 0.432 ± 0.209
4.835LeuAsp: 4.835 ± 0.811
5.612LeuGlu: 5.612 ± 0.853
3.195LeuPhe: 3.195 ± 0.544
5.267LeuGly: 5.267 ± 1.075
1.122LeuHis: 1.122 ± 0.329
3.713LeuIle: 3.713 ± 0.527
6.13LeuLys: 6.13 ± 0.696
4.058LeuLeu: 4.058 ± 0.791
1.899LeuMet: 1.899 ± 0.555
4.662LeuAsn: 4.662 ± 0.774
3.022LeuPro: 3.022 ± 0.425
3.022LeuGln: 3.022 ± 0.539
2.418LeuArg: 2.418 ± 0.495
5.785LeuSer: 5.785 ± 0.821
6.562LeuThr: 6.562 ± 0.921
4.058LeuVal: 4.058 ± 0.63
0.777LeuTrp: 0.777 ± 0.296
2.504LeuTyr: 2.504 ± 0.586
0.0LeuXaa: 0.0 ± 0.0
Met
1.899MetAla: 1.899 ± 0.691
0.173MetCys: 0.173 ± 0.113
1.381MetAsp: 1.381 ± 0.315
1.209MetGlu: 1.209 ± 0.284
1.468MetPhe: 1.468 ± 0.345
1.813MetGly: 1.813 ± 0.411
0.259MetHis: 0.259 ± 0.186
1.209MetIle: 1.209 ± 0.277
2.331MetLys: 2.331 ± 0.438
1.554MetLeu: 1.554 ± 0.345
0.691MetMet: 0.691 ± 0.2
1.209MetAsn: 1.209 ± 0.406
0.777MetPro: 0.777 ± 0.248
1.64MetGln: 1.64 ± 0.436
1.122MetArg: 1.122 ± 0.4
1.813MetSer: 1.813 ± 0.474
2.331MetThr: 2.331 ± 0.443
1.381MetVal: 1.381 ± 0.343
0.345MetTrp: 0.345 ± 0.139
0.863MetTyr: 0.863 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
5.698AsnAla: 5.698 ± 0.761
0.086AsnCys: 0.086 ± 0.099
3.972AsnAsp: 3.972 ± 0.622
5.008AsnGlu: 5.008 ± 0.797
2.936AsnPhe: 2.936 ± 0.788
5.18AsnGly: 5.18 ± 1.104
0.604AsnHis: 0.604 ± 0.192
3.713AsnIle: 3.713 ± 0.549
3.799AsnLys: 3.799 ± 0.743
4.749AsnLeu: 4.749 ± 0.482
1.64AsnMet: 1.64 ± 0.403
4.144AsnAsn: 4.144 ± 0.65
2.504AsnPro: 2.504 ± 0.531
2.59AsnGln: 2.59 ± 0.794
1.986AsnArg: 1.986 ± 0.373
3.108AsnSer: 3.108 ± 0.627
3.54AsnThr: 3.54 ± 0.44
4.921AsnVal: 4.921 ± 0.75
0.604AsnTrp: 0.604 ± 0.263
1.554AsnTyr: 1.554 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
2.504ProAla: 2.504 ± 0.523
0.173ProCys: 0.173 ± 0.118
1.899ProAsp: 1.899 ± 0.45
2.245ProGlu: 2.245 ± 0.452
1.468ProPhe: 1.468 ± 0.254
1.209ProGly: 1.209 ± 0.405
0.259ProHis: 0.259 ± 0.144
1.468ProIle: 1.468 ± 0.436
1.899ProLys: 1.899 ± 0.497
2.331ProLeu: 2.331 ± 0.415
0.604ProMet: 0.604 ± 0.196
1.295ProAsn: 1.295 ± 0.324
0.691ProPro: 0.691 ± 0.22
1.295ProGln: 1.295 ± 0.442
1.036ProArg: 1.036 ± 0.308
1.554ProSer: 1.554 ± 0.36
2.159ProThr: 2.159 ± 0.361
1.899ProVal: 1.899 ± 0.388
0.0ProTrp: 0.0 ± 0.0
1.036ProTyr: 1.036 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
3.885GlnAla: 3.885 ± 0.789
0.173GlnCys: 0.173 ± 0.131
2.159GlnAsp: 2.159 ± 0.478
3.195GlnGlu: 3.195 ± 0.606
1.554GlnPhe: 1.554 ± 0.292
3.108GlnGly: 3.108 ± 0.664
0.518GlnHis: 0.518 ± 0.193
3.972GlnIle: 3.972 ± 0.712
3.367GlnLys: 3.367 ± 0.739
3.885GlnLeu: 3.885 ± 0.575
1.727GlnMet: 1.727 ± 0.494
2.677GlnAsn: 2.677 ± 0.941
0.691GlnPro: 0.691 ± 0.262
2.072GlnGln: 2.072 ± 0.627
1.295GlnArg: 1.295 ± 0.415
2.245GlnSer: 2.245 ± 0.56
2.59GlnThr: 2.59 ± 0.6
2.763GlnVal: 2.763 ± 0.579
0.518GlnTrp: 0.518 ± 0.171
1.295GlnTyr: 1.295 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
3.367ArgAla: 3.367 ± 0.622
0.0ArgCys: 0.0 ± 0.0
2.418ArgAsp: 2.418 ± 0.503
2.331ArgGlu: 2.331 ± 0.531
1.209ArgPhe: 1.209 ± 0.305
1.209ArgGly: 1.209 ± 0.312
0.259ArgHis: 0.259 ± 0.228
2.331ArgIle: 2.331 ± 0.456
3.281ArgLys: 3.281 ± 0.627
3.108ArgLeu: 3.108 ± 0.411
1.036ArgMet: 1.036 ± 0.306
2.072ArgAsn: 2.072 ± 0.349
1.468ArgPro: 1.468 ± 0.414
1.036ArgGln: 1.036 ± 0.316
1.209ArgArg: 1.209 ± 0.422
1.554ArgSer: 1.554 ± 0.393
2.504ArgThr: 2.504 ± 0.503
2.418ArgVal: 2.418 ± 0.439
1.122ArgTrp: 1.122 ± 0.342
1.986ArgTyr: 1.986 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
5.353SerAla: 5.353 ± 1.367
0.086SerCys: 0.086 ± 0.091
5.267SerAsp: 5.267 ± 0.62
3.454SerGlu: 3.454 ± 0.609
2.072SerPhe: 2.072 ± 0.502
5.18SerGly: 5.18 ± 0.807
1.036SerHis: 1.036 ± 0.285
4.576SerIle: 4.576 ± 0.813
4.749SerLys: 4.749 ± 0.544
4.317SerLeu: 4.317 ± 0.504
1.554SerMet: 1.554 ± 0.518
4.835SerAsn: 4.835 ± 0.499
1.64SerPro: 1.64 ± 0.296
2.677SerGln: 2.677 ± 0.633
1.468SerArg: 1.468 ± 0.27
4.144SerSer: 4.144 ± 0.606
3.885SerThr: 3.885 ± 0.59
3.885SerVal: 3.885 ± 0.665
1.209SerTrp: 1.209 ± 0.28
2.849SerTyr: 2.849 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
6.044ThrAla: 6.044 ± 1.002
0.345ThrCys: 0.345 ± 0.164
3.454ThrAsp: 3.454 ± 0.53
3.713ThrGlu: 3.713 ± 0.589
3.367ThrPhe: 3.367 ± 0.561
4.058ThrGly: 4.058 ± 0.702
0.604ThrHis: 0.604 ± 0.236
4.576ThrIle: 4.576 ± 0.66
4.921ThrLys: 4.921 ± 0.733
6.303ThrLeu: 6.303 ± 0.925
0.95ThrMet: 0.95 ± 0.316
3.454ThrAsn: 3.454 ± 0.534
2.331ThrPro: 2.331 ± 0.403
3.626ThrGln: 3.626 ± 0.721
1.986ThrArg: 1.986 ± 0.41
4.317ThrSer: 4.317 ± 0.684
3.281ThrThr: 3.281 ± 0.643
5.353ThrVal: 5.353 ± 0.798
0.777ThrTrp: 0.777 ± 0.29
2.849ThrTyr: 2.849 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
4.749ValAla: 4.749 ± 1.297
0.259ValCys: 0.259 ± 0.152
3.799ValAsp: 3.799 ± 0.556
4.403ValGlu: 4.403 ± 0.836
2.677ValPhe: 2.677 ± 0.483
4.749ValGly: 4.749 ± 1.221
0.691ValHis: 0.691 ± 0.242
4.231ValIle: 4.231 ± 0.715
5.094ValLys: 5.094 ± 0.665
4.317ValLeu: 4.317 ± 0.654
1.381ValMet: 1.381 ± 0.343
3.195ValAsn: 3.195 ± 0.519
1.122ValPro: 1.122 ± 0.377
2.849ValGln: 2.849 ± 0.584
2.072ValArg: 2.072 ± 0.381
5.871ValSer: 5.871 ± 0.712
4.576ValThr: 4.576 ± 0.722
3.972ValVal: 3.972 ± 0.456
1.209ValTrp: 1.209 ± 0.312
3.022ValTyr: 3.022 ± 0.503
0.0ValXaa: 0.0 ± 0.0
Trp
0.518TrpAla: 0.518 ± 0.176
0.173TrpCys: 0.173 ± 0.11
0.691TrpAsp: 0.691 ± 0.233
0.95TrpGlu: 0.95 ± 0.368
0.259TrpPhe: 0.259 ± 0.155
0.691TrpGly: 0.691 ± 0.24
0.432TrpHis: 0.432 ± 0.189
0.604TrpIle: 0.604 ± 0.261
1.64TrpLys: 1.64 ± 0.515
0.691TrpLeu: 0.691 ± 0.269
0.518TrpMet: 0.518 ± 0.199
0.518TrpAsn: 0.518 ± 0.237
0.0TrpPro: 0.0 ± 0.0
0.604TrpGln: 0.604 ± 0.26
0.604TrpArg: 0.604 ± 0.254
1.036TrpSer: 1.036 ± 0.227
1.122TrpThr: 1.122 ± 0.446
0.691TrpVal: 0.691 ± 0.187
0.173TrpTrp: 0.173 ± 0.117
0.777TrpTyr: 0.777 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.677TyrAla: 2.677 ± 0.588
0.518TyrCys: 0.518 ± 0.26
3.195TyrAsp: 3.195 ± 0.709
2.159TyrGlu: 2.159 ± 0.538
1.899TyrPhe: 1.899 ± 0.512
1.899TyrGly: 1.899 ± 0.454
0.518TyrHis: 0.518 ± 0.255
2.936TyrIle: 2.936 ± 0.485
3.108TyrLys: 3.108 ± 0.575
2.763TyrLeu: 2.763 ± 0.616
0.518TyrMet: 0.518 ± 0.189
3.022TyrAsn: 3.022 ± 0.596
1.381TyrPro: 1.381 ± 0.393
1.813TyrGln: 1.813 ± 0.356
2.159TyrArg: 2.159 ± 0.428
2.849TyrSer: 2.849 ± 0.47
2.763TyrThr: 2.763 ± 0.551
1.899TyrVal: 1.899 ± 0.431
0.259TyrTrp: 0.259 ± 0.139
1.986TyrTyr: 1.986 ± 0.575
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11583 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski