Amino acid dipepetide frequency for Streptococcus phage Javan233

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.953AlaAla: 3.953 ± 1.192
0.439AlaCys: 0.439 ± 0.19
4.568AlaAsp: 4.568 ± 0.533
5.535AlaGlu: 5.535 ± 0.844
2.021AlaPhe: 2.021 ± 0.335
4.568AlaGly: 4.568 ± 0.932
0.966AlaHis: 0.966 ± 0.243
5.798AlaIle: 5.798 ± 0.9
5.886AlaLys: 5.886 ± 0.691
5.447AlaLeu: 5.447 ± 0.685
1.933AlaMet: 1.933 ± 0.398
4.217AlaAsn: 4.217 ± 0.675
1.493AlaPro: 1.493 ± 0.435
3.338AlaGln: 3.338 ± 0.626
3.426AlaArg: 3.426 ± 0.573
5.535AlaSer: 5.535 ± 0.844
4.568AlaThr: 4.568 ± 0.561
5.183AlaVal: 5.183 ± 0.695
1.406AlaTrp: 1.406 ± 0.529
2.46AlaTyr: 2.46 ± 0.548
0.0AlaXaa: 0.0 ± 0.0
Cys
0.439CysAla: 0.439 ± 0.195
0.0CysCys: 0.0 ± 0.0
0.791CysAsp: 0.791 ± 0.243
0.176CysGlu: 0.176 ± 0.131
0.088CysPhe: 0.088 ± 0.089
0.615CysGly: 0.615 ± 0.255
0.176CysHis: 0.176 ± 0.102
0.176CysIle: 0.176 ± 0.111
0.351CysLys: 0.351 ± 0.136
0.264CysLeu: 0.264 ± 0.136
0.088CysMet: 0.088 ± 0.094
0.0CysAsn: 0.0 ± 0.0
0.176CysPro: 0.176 ± 0.14
0.264CysGln: 0.264 ± 0.26
0.176CysArg: 0.176 ± 0.114
0.615CysSer: 0.615 ± 0.208
0.264CysThr: 0.264 ± 0.175
0.351CysVal: 0.351 ± 0.161
0.176CysTrp: 0.176 ± 0.137
0.351CysTyr: 0.351 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
2.636AspAla: 2.636 ± 0.523
0.439AspCys: 0.439 ± 0.183
4.832AspAsp: 4.832 ± 0.727
6.062AspGlu: 6.062 ± 0.943
3.25AspPhe: 3.25 ± 0.613
4.305AspGly: 4.305 ± 0.84
0.527AspHis: 0.527 ± 0.208
3.426AspIle: 3.426 ± 0.574
3.953AspLys: 3.953 ± 0.633
6.062AspLeu: 6.062 ± 0.648
2.46AspMet: 2.46 ± 0.459
3.865AspAsn: 3.865 ± 0.521
1.142AspPro: 1.142 ± 0.295
1.318AspGln: 1.318 ± 0.345
1.757AspArg: 1.757 ± 0.303
3.778AspSer: 3.778 ± 0.643
4.305AspThr: 4.305 ± 0.619
3.953AspVal: 3.953 ± 0.662
1.318AspTrp: 1.318 ± 0.314
3.514AspTyr: 3.514 ± 0.592
0.0AspXaa: 0.0 ± 0.0
Glu
6.764GluAla: 6.764 ± 0.835
0.088GluCys: 0.088 ± 0.081
3.865GluAsp: 3.865 ± 0.685
4.656GluGlu: 4.656 ± 0.802
3.338GluPhe: 3.338 ± 0.53
3.25GluGly: 3.25 ± 0.673
0.966GluHis: 0.966 ± 0.229
6.413GluIle: 6.413 ± 0.891
5.007GluLys: 5.007 ± 0.855
8.697GluLeu: 8.697 ± 1.215
2.108GluMet: 2.108 ± 0.553
3.953GluAsn: 3.953 ± 0.742
2.284GluPro: 2.284 ± 0.353
2.636GluGln: 2.636 ± 0.613
2.284GluArg: 2.284 ± 0.435
3.163GluSer: 3.163 ± 0.583
4.48GluThr: 4.48 ± 0.525
5.007GluVal: 5.007 ± 0.697
0.879GluTrp: 0.879 ± 0.41
2.548GluTyr: 2.548 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
2.372PheAla: 2.372 ± 0.509
0.088PheCys: 0.088 ± 0.082
3.338PheAsp: 3.338 ± 0.524
4.48PheGlu: 4.48 ± 0.911
1.669PhePhe: 1.669 ± 0.386
2.548PheGly: 2.548 ± 0.436
0.439PheHis: 0.439 ± 0.217
2.372PheIle: 2.372 ± 0.365
3.514PheLys: 3.514 ± 0.45
2.899PheLeu: 2.899 ± 0.596
1.23PheMet: 1.23 ± 0.327
2.636PheAsn: 2.636 ± 0.504
1.318PhePro: 1.318 ± 0.367
0.527PheGln: 0.527 ± 0.235
1.142PheArg: 1.142 ± 0.279
2.987PheSer: 2.987 ± 0.426
2.021PheThr: 2.021 ± 0.321
2.46PheVal: 2.46 ± 0.538
0.527PheTrp: 0.527 ± 0.243
1.581PheTyr: 1.581 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
4.305GlyAla: 4.305 ± 0.835
0.439GlyCys: 0.439 ± 0.201
2.723GlyAsp: 2.723 ± 0.392
3.338GlyGlu: 3.338 ± 0.489
2.636GlyPhe: 2.636 ± 0.502
3.163GlyGly: 3.163 ± 0.646
0.966GlyHis: 0.966 ± 0.248
5.622GlyIle: 5.622 ± 0.89
4.744GlyLys: 4.744 ± 0.667
5.535GlyLeu: 5.535 ± 0.856
1.23GlyMet: 1.23 ± 0.331
4.041GlyAsn: 4.041 ± 0.65
0.439GlyPro: 0.439 ± 0.24
2.196GlyGln: 2.196 ± 0.482
2.108GlyArg: 2.108 ± 0.502
4.129GlySer: 4.129 ± 0.861
3.69GlyThr: 3.69 ± 0.681
3.602GlyVal: 3.602 ± 0.512
0.791GlyTrp: 0.791 ± 0.256
2.811GlyTyr: 2.811 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
0.791HisAla: 0.791 ± 0.292
0.176HisCys: 0.176 ± 0.168
0.439HisAsp: 0.439 ± 0.218
1.054HisGlu: 1.054 ± 0.408
0.966HisPhe: 0.966 ± 0.268
0.966HisGly: 0.966 ± 0.295
0.176HisHis: 0.176 ± 0.113
0.703HisIle: 0.703 ± 0.273
0.966HisLys: 0.966 ± 0.271
0.703HisLeu: 0.703 ± 0.254
0.351HisMet: 0.351 ± 0.198
0.351HisAsn: 0.351 ± 0.136
0.439HisPro: 0.439 ± 0.214
0.791HisGln: 0.791 ± 0.254
0.176HisArg: 0.176 ± 0.128
1.142HisSer: 1.142 ± 0.267
0.703HisThr: 0.703 ± 0.225
0.879HisVal: 0.879 ± 0.308
0.264HisTrp: 0.264 ± 0.162
0.527HisTyr: 0.527 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
6.677IleAla: 6.677 ± 0.954
0.615IleCys: 0.615 ± 0.182
4.305IleAsp: 4.305 ± 0.573
5.007IleGlu: 5.007 ± 0.823
2.284IlePhe: 2.284 ± 0.57
2.899IleGly: 2.899 ± 0.457
0.615IleHis: 0.615 ± 0.23
3.338IleIle: 3.338 ± 0.735
7.204IleLys: 7.204 ± 0.857
5.095IleLeu: 5.095 ± 0.738
1.581IleMet: 1.581 ± 0.351
3.953IleAsn: 3.953 ± 0.591
1.669IlePro: 1.669 ± 0.306
2.46IleGln: 2.46 ± 0.359
2.021IleArg: 2.021 ± 0.354
4.568IleSer: 4.568 ± 0.666
4.48IleThr: 4.48 ± 1.033
5.007IleVal: 5.007 ± 0.824
1.142IleTrp: 1.142 ± 0.344
2.811IleTyr: 2.811 ± 0.539
0.0IleXaa: 0.0 ± 0.0
Lys
6.062LysAla: 6.062 ± 0.762
0.351LysCys: 0.351 ± 0.179
4.92LysAsp: 4.92 ± 0.681
7.028LysGlu: 7.028 ± 1.043
2.987LysPhe: 2.987 ± 0.519
3.426LysGly: 3.426 ± 0.612
0.527LysHis: 0.527 ± 0.239
5.535LysIle: 5.535 ± 0.678
6.15LysLys: 6.15 ± 1.105
6.589LysLeu: 6.589 ± 0.924
1.845LysMet: 1.845 ± 0.485
6.062LysAsn: 6.062 ± 0.713
1.933LysPro: 1.933 ± 0.502
4.217LysGln: 4.217 ± 0.666
4.48LysArg: 4.48 ± 0.833
4.393LysSer: 4.393 ± 0.585
4.832LysThr: 4.832 ± 0.623
5.71LysVal: 5.71 ± 0.784
0.966LysTrp: 0.966 ± 0.286
3.514LysTyr: 3.514 ± 0.544
0.0LysXaa: 0.0 ± 0.0
Leu
6.237LeuAla: 6.237 ± 0.883
0.264LeuCys: 0.264 ± 0.178
5.622LeuAsp: 5.622 ± 0.719
6.413LeuGlu: 6.413 ± 1.106
2.372LeuPhe: 2.372 ± 0.481
5.095LeuGly: 5.095 ± 0.8
1.406LeuHis: 1.406 ± 0.383
4.832LeuIle: 4.832 ± 0.737
8.521LeuLys: 8.521 ± 0.842
5.622LeuLeu: 5.622 ± 0.986
1.581LeuMet: 1.581 ± 0.442
5.798LeuAsn: 5.798 ± 1.284
2.548LeuPro: 2.548 ± 0.484
2.811LeuGln: 2.811 ± 0.583
2.987LeuArg: 2.987 ± 0.465
6.237LeuSer: 6.237 ± 0.774
4.305LeuThr: 4.305 ± 0.801
4.656LeuVal: 4.656 ± 0.703
0.615LeuTrp: 0.615 ± 0.273
2.372LeuTyr: 2.372 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.46MetAla: 2.46 ± 0.573
0.088MetCys: 0.088 ± 0.082
1.23MetAsp: 1.23 ± 0.411
1.142MetGlu: 1.142 ± 0.326
0.966MetPhe: 0.966 ± 0.291
0.703MetGly: 0.703 ± 0.309
0.703MetHis: 0.703 ± 0.34
1.054MetIle: 1.054 ± 0.302
2.196MetLys: 2.196 ± 0.416
1.493MetLeu: 1.493 ± 0.275
0.615MetMet: 0.615 ± 0.24
1.406MetAsn: 1.406 ± 0.298
0.879MetPro: 0.879 ± 0.27
1.581MetGln: 1.581 ± 0.415
1.054MetArg: 1.054 ± 0.354
1.406MetSer: 1.406 ± 0.351
2.723MetThr: 2.723 ± 0.507
0.703MetVal: 0.703 ± 0.208
0.176MetTrp: 0.176 ± 0.124
0.791MetTyr: 0.791 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
3.338AsnAla: 3.338 ± 0.515
0.264AsnCys: 0.264 ± 0.156
2.548AsnAsp: 2.548 ± 0.573
2.723AsnGlu: 2.723 ± 0.562
2.196AsnPhe: 2.196 ± 0.409
6.325AsnGly: 6.325 ± 0.681
0.966AsnHis: 0.966 ± 0.293
3.338AsnIle: 3.338 ± 0.681
5.007AsnLys: 5.007 ± 0.938
4.744AsnLeu: 4.744 ± 0.703
0.703AsnMet: 0.703 ± 0.278
4.129AsnAsn: 4.129 ± 0.895
2.372AsnPro: 2.372 ± 0.441
2.987AsnGln: 2.987 ± 0.509
2.636AsnArg: 2.636 ± 0.444
3.953AsnSer: 3.953 ± 0.88
4.656AsnThr: 4.656 ± 0.98
3.865AsnVal: 3.865 ± 0.748
1.23AsnTrp: 1.23 ± 0.347
2.899AsnTyr: 2.899 ± 0.583
0.0AsnXaa: 0.0 ± 0.0
Pro
2.108ProAla: 2.108 ± 0.415
0.088ProCys: 0.088 ± 0.089
1.933ProAsp: 1.933 ± 0.33
2.196ProGlu: 2.196 ± 0.403
0.966ProPhe: 0.966 ± 0.269
0.615ProGly: 0.615 ± 0.239
0.439ProHis: 0.439 ± 0.231
1.933ProIle: 1.933 ± 0.411
2.108ProLys: 2.108 ± 0.615
1.318ProLeu: 1.318 ± 0.313
0.439ProMet: 0.439 ± 0.156
2.021ProAsn: 2.021 ± 0.708
0.351ProPro: 0.351 ± 0.178
1.054ProGln: 1.054 ± 0.345
1.493ProArg: 1.493 ± 0.48
2.021ProSer: 2.021 ± 0.339
2.021ProThr: 2.021 ± 0.368
1.757ProVal: 1.757 ± 0.309
0.264ProTrp: 0.264 ± 0.126
1.23ProTyr: 1.23 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
3.69GlnAla: 3.69 ± 0.59
0.176GlnCys: 0.176 ± 0.124
1.757GlnAsp: 1.757 ± 0.373
3.338GlnGlu: 3.338 ± 0.702
1.581GlnPhe: 1.581 ± 0.429
2.021GlnGly: 2.021 ± 0.46
0.527GlnHis: 0.527 ± 0.236
2.899GlnIle: 2.899 ± 0.584
3.426GlnLys: 3.426 ± 0.708
2.899GlnLeu: 2.899 ± 0.512
0.351GlnMet: 0.351 ± 0.197
2.284GlnAsn: 2.284 ± 0.422
1.054GlnPro: 1.054 ± 0.384
2.021GlnGln: 2.021 ± 0.551
1.757GlnArg: 1.757 ± 0.48
2.372GlnSer: 2.372 ± 0.412
3.514GlnThr: 3.514 ± 0.614
3.338GlnVal: 3.338 ± 0.811
0.176GlnTrp: 0.176 ± 0.111
1.493GlnTyr: 1.493 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
3.602ArgAla: 3.602 ± 0.793
0.264ArgCys: 0.264 ± 0.159
1.757ArgAsp: 1.757 ± 0.44
2.548ArgGlu: 2.548 ± 0.519
1.845ArgPhe: 1.845 ± 0.391
1.23ArgGly: 1.23 ± 0.368
0.264ArgHis: 0.264 ± 0.128
2.723ArgIle: 2.723 ± 0.472
3.953ArgLys: 3.953 ± 0.726
2.811ArgLeu: 2.811 ± 0.458
1.493ArgMet: 1.493 ± 0.337
1.581ArgAsn: 1.581 ± 0.397
0.879ArgPro: 0.879 ± 0.202
2.46ArgGln: 2.46 ± 0.555
1.318ArgArg: 1.318 ± 0.428
1.669ArgSer: 1.669 ± 0.456
1.933ArgThr: 1.933 ± 0.349
2.636ArgVal: 2.636 ± 0.57
0.176ArgTrp: 0.176 ± 0.138
2.548ArgTyr: 2.548 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
4.744SerAla: 4.744 ± 0.69
0.176SerCys: 0.176 ± 0.129
4.744SerAsp: 4.744 ± 0.609
5.095SerGlu: 5.095 ± 0.53
3.338SerPhe: 3.338 ± 0.438
4.92SerGly: 4.92 ± 0.575
0.703SerHis: 0.703 ± 0.226
5.007SerIle: 5.007 ± 0.799
4.656SerLys: 4.656 ± 0.649
5.183SerLeu: 5.183 ± 0.606
1.933SerMet: 1.933 ± 0.393
4.041SerAsn: 4.041 ± 0.922
1.318SerPro: 1.318 ± 0.318
2.899SerGln: 2.899 ± 0.532
2.987SerArg: 2.987 ± 0.561
4.92SerSer: 4.92 ± 0.84
5.095SerThr: 5.095 ± 1.256
3.69SerVal: 3.69 ± 0.54
0.791SerTrp: 0.791 ± 0.281
2.196SerTyr: 2.196 ± 0.477
0.0SerXaa: 0.0 ± 0.0
Thr
5.71ThrAla: 5.71 ± 1.583
0.527ThrCys: 0.527 ± 0.231
3.953ThrAsp: 3.953 ± 0.596
4.041ThrGlu: 4.041 ± 0.689
3.25ThrPhe: 3.25 ± 0.434
4.48ThrGly: 4.48 ± 0.82
0.879ThrHis: 0.879 ± 0.265
4.656ThrIle: 4.656 ± 0.864
5.095ThrLys: 5.095 ± 0.812
5.622ThrLeu: 5.622 ± 0.597
0.966ThrMet: 0.966 ± 0.265
3.514ThrAsn: 3.514 ± 0.738
2.636ThrPro: 2.636 ± 0.663
2.811ThrGln: 2.811 ± 0.49
1.318ThrArg: 1.318 ± 0.337
4.744ThrSer: 4.744 ± 0.956
4.744ThrThr: 4.744 ± 0.916
5.447ThrVal: 5.447 ± 0.826
0.439ThrTrp: 0.439 ± 0.156
2.548ThrTyr: 2.548 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
4.656ValAla: 4.656 ± 0.612
0.439ValCys: 0.439 ± 0.222
4.744ValAsp: 4.744 ± 0.834
4.305ValGlu: 4.305 ± 0.718
1.669ValPhe: 1.669 ± 0.44
4.129ValGly: 4.129 ± 0.575
0.879ValHis: 0.879 ± 0.242
4.129ValIle: 4.129 ± 0.576
5.183ValLys: 5.183 ± 0.857
4.92ValLeu: 4.92 ± 0.753
1.054ValMet: 1.054 ± 0.289
3.25ValAsn: 3.25 ± 0.432
2.108ValPro: 2.108 ± 0.493
1.757ValGln: 1.757 ± 0.331
2.548ValArg: 2.548 ± 0.557
6.413ValSer: 6.413 ± 0.729
4.92ValThr: 4.92 ± 0.804
4.832ValVal: 4.832 ± 0.61
1.318ValTrp: 1.318 ± 0.38
2.46ValTyr: 2.46 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
0.351TrpAla: 0.351 ± 0.179
0.351TrpCys: 0.351 ± 0.13
0.439TrpAsp: 0.439 ± 0.165
1.054TrpGlu: 1.054 ± 0.339
0.879TrpPhe: 0.879 ± 0.253
0.703TrpGly: 0.703 ± 0.227
0.088TrpHis: 0.088 ± 0.082
0.351TrpIle: 0.351 ± 0.161
1.054TrpLys: 1.054 ± 0.289
1.318TrpLeu: 1.318 ± 0.354
0.088TrpMet: 0.088 ± 0.097
1.318TrpAsn: 1.318 ± 0.706
0.088TrpPro: 0.088 ± 0.101
0.791TrpGln: 0.791 ± 0.226
0.351TrpArg: 0.351 ± 0.213
1.142TrpSer: 1.142 ± 0.306
1.318TrpThr: 1.318 ± 0.402
0.703TrpVal: 0.703 ± 0.209
0.439TrpTrp: 0.439 ± 0.193
0.879TrpTyr: 0.879 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.845TyrAla: 1.845 ± 0.488
0.264TyrCys: 0.264 ± 0.144
4.393TyrAsp: 4.393 ± 0.683
2.196TyrGlu: 2.196 ± 0.47
1.845TyrPhe: 1.845 ± 0.293
2.636TyrGly: 2.636 ± 0.55
0.351TyrHis: 0.351 ± 0.196
3.426TyrIle: 3.426 ± 0.672
2.548TyrLys: 2.548 ± 0.479
3.075TyrLeu: 3.075 ± 0.536
1.23TyrMet: 1.23 ± 0.302
2.46TyrAsn: 2.46 ± 0.415
1.23TyrPro: 1.23 ± 0.331
1.757TyrGln: 1.757 ± 0.568
1.669TyrArg: 1.669 ± 0.44
3.25TyrSer: 3.25 ± 0.533
2.811TyrThr: 2.811 ± 0.553
1.933TyrVal: 1.933 ± 0.325
0.615TyrTrp: 0.615 ± 0.229
1.933TyrTyr: 1.933 ± 0.464
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski