Amino acid dipepetide frequency for Streptococcus phage Javan516

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.333AlaAla: 3.333 ± 1.088
0.435AlaCys: 0.435 ± 0.173
4.275AlaAsp: 4.275 ± 0.671
4.565AlaGlu: 4.565 ± 0.615
3.261AlaPhe: 3.261 ± 0.605
4.42AlaGly: 4.42 ± 1.022
0.217AlaHis: 0.217 ± 0.119
5.87AlaIle: 5.87 ± 0.938
6.739AlaLys: 6.739 ± 0.802
5.797AlaLeu: 5.797 ± 0.76
2.754AlaMet: 2.754 ± 0.548
3.696AlaAsn: 3.696 ± 0.617
0.87AlaPro: 0.87 ± 0.204
2.609AlaGln: 2.609 ± 0.484
2.174AlaArg: 2.174 ± 0.315
4.275AlaSer: 4.275 ± 0.869
4.348AlaThr: 4.348 ± 0.771
4.348AlaVal: 4.348 ± 1.131
0.87AlaTrp: 0.87 ± 0.182
2.029AlaTyr: 2.029 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.145CysAla: 0.145 ± 0.109
0.0CysCys: 0.0 ± 0.0
0.362CysAsp: 0.362 ± 0.151
0.435CysGlu: 0.435 ± 0.195
0.072CysPhe: 0.072 ± 0.092
0.362CysGly: 0.362 ± 0.158
0.0CysHis: 0.0 ± 0.0
0.58CysIle: 0.58 ± 0.184
0.362CysLys: 0.362 ± 0.16
0.362CysLeu: 0.362 ± 0.182
0.072CysMet: 0.072 ± 0.059
0.362CysAsn: 0.362 ± 0.2
0.0CysPro: 0.0 ± 0.0
0.145CysGln: 0.145 ± 0.102
0.217CysArg: 0.217 ± 0.116
0.145CysSer: 0.145 ± 0.155
0.145CysThr: 0.145 ± 0.106
0.217CysVal: 0.217 ± 0.147
0.29CysTrp: 0.29 ± 0.136
0.145CysTyr: 0.145 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
3.551AspAla: 3.551 ± 0.441
0.435AspCys: 0.435 ± 0.176
3.986AspAsp: 3.986 ± 0.493
4.13AspGlu: 4.13 ± 0.518
3.406AspPhe: 3.406 ± 0.536
4.275AspGly: 4.275 ± 0.579
0.87AspHis: 0.87 ± 0.251
5.145AspIle: 5.145 ± 0.707
6.159AspLys: 6.159 ± 0.656
6.522AspLeu: 6.522 ± 0.687
1.594AspMet: 1.594 ± 0.278
3.043AspAsn: 3.043 ± 0.504
1.159AspPro: 1.159 ± 0.281
1.304AspGln: 1.304 ± 0.34
2.246AspArg: 2.246 ± 0.362
4.058AspSer: 4.058 ± 0.567
3.551AspThr: 3.551 ± 0.534
4.493AspVal: 4.493 ± 0.358
0.58AspTrp: 0.58 ± 0.198
3.768AspTyr: 3.768 ± 0.777
0.0AspXaa: 0.0 ± 0.0
Glu
5.0GluAla: 5.0 ± 0.809
0.217GluCys: 0.217 ± 0.127
3.116GluAsp: 3.116 ± 0.585
5.87GluGlu: 5.87 ± 0.902
3.478GluPhe: 3.478 ± 0.505
3.623GluGly: 3.623 ± 0.502
1.014GluHis: 1.014 ± 0.291
5.652GluIle: 5.652 ± 0.723
5.435GluLys: 5.435 ± 0.68
7.899GluLeu: 7.899 ± 0.752
2.609GluMet: 2.609 ± 0.445
3.188GluAsn: 3.188 ± 0.543
1.667GluPro: 1.667 ± 0.357
3.043GluGln: 3.043 ± 0.485
3.043GluArg: 3.043 ± 0.445
4.855GluSer: 4.855 ± 0.655
3.406GluThr: 3.406 ± 0.561
5.0GluVal: 5.0 ± 0.731
0.87GluTrp: 0.87 ± 0.261
2.899GluTyr: 2.899 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.464PheAla: 2.464 ± 0.59
0.435PheCys: 0.435 ± 0.196
3.768PheAsp: 3.768 ± 0.51
2.899PheGlu: 2.899 ± 0.42
1.667PhePhe: 1.667 ± 0.417
3.116PheGly: 3.116 ± 0.559
0.435PheHis: 0.435 ± 0.175
2.174PheIle: 2.174 ± 0.482
4.928PheLys: 4.928 ± 0.642
2.971PheLeu: 2.971 ± 0.566
1.087PheMet: 1.087 ± 0.307
2.464PheAsn: 2.464 ± 0.436
0.87PhePro: 0.87 ± 0.233
1.087PheGln: 1.087 ± 0.216
2.174PheArg: 2.174 ± 0.398
2.754PheSer: 2.754 ± 0.466
2.464PheThr: 2.464 ± 0.385
2.464PheVal: 2.464 ± 0.463
0.217PheTrp: 0.217 ± 0.099
1.304PheTyr: 1.304 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
3.913GlyAla: 3.913 ± 0.969
0.072GlyCys: 0.072 ± 0.066
4.42GlyAsp: 4.42 ± 0.925
3.551GlyGlu: 3.551 ± 0.52
3.116GlyPhe: 3.116 ± 0.607
3.333GlyGly: 3.333 ± 0.492
1.014GlyHis: 1.014 ± 0.258
4.42GlyIle: 4.42 ± 0.729
6.594GlyLys: 6.594 ± 0.819
6.449GlyLeu: 6.449 ± 0.946
1.739GlyMet: 1.739 ± 0.317
3.116GlyAsn: 3.116 ± 0.354
0.58GlyPro: 0.58 ± 0.2
2.029GlyGln: 2.029 ± 0.386
2.754GlyArg: 2.754 ± 0.446
2.754GlySer: 2.754 ± 0.566
4.493GlyThr: 4.493 ± 0.6
3.406GlyVal: 3.406 ± 0.488
0.507GlyTrp: 0.507 ± 0.191
3.261GlyTyr: 3.261 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.87HisAla: 0.87 ± 0.247
0.0HisCys: 0.0 ± 0.0
0.725HisAsp: 0.725 ± 0.275
1.159HisGlu: 1.159 ± 0.267
0.797HisPhe: 0.797 ± 0.248
0.942HisGly: 0.942 ± 0.324
0.29HisHis: 0.29 ± 0.18
1.087HisIle: 1.087 ± 0.319
1.304HisLys: 1.304 ± 0.376
0.87HisLeu: 0.87 ± 0.278
0.29HisMet: 0.29 ± 0.166
1.087HisAsn: 1.087 ± 0.333
0.507HisPro: 0.507 ± 0.18
0.652HisGln: 0.652 ± 0.22
0.507HisArg: 0.507 ± 0.175
0.942HisSer: 0.942 ± 0.315
0.58HisThr: 0.58 ± 0.199
1.159HisVal: 1.159 ± 0.288
0.217HisTrp: 0.217 ± 0.13
0.58HisTyr: 0.58 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.855IleAla: 4.855 ± 0.636
0.362IleCys: 0.362 ± 0.163
6.812IleAsp: 6.812 ± 0.721
5.435IleGlu: 5.435 ± 0.583
2.319IlePhe: 2.319 ± 0.462
5.217IleGly: 5.217 ± 0.735
1.594IleHis: 1.594 ± 0.361
5.217IleIle: 5.217 ± 0.637
6.884IleLys: 6.884 ± 0.762
5.507IleLeu: 5.507 ± 0.747
0.942IleMet: 0.942 ± 0.273
5.58IleAsn: 5.58 ± 0.605
1.594IlePro: 1.594 ± 0.36
2.101IleGln: 2.101 ± 0.388
3.188IleArg: 3.188 ± 0.56
5.072IleSer: 5.072 ± 0.838
5.435IleThr: 5.435 ± 0.69
3.913IleVal: 3.913 ± 0.461
0.58IleTrp: 0.58 ± 0.224
2.754IleTyr: 2.754 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
6.304LysAla: 6.304 ± 0.617
0.145LysCys: 0.145 ± 0.084
5.145LysAsp: 5.145 ± 0.728
6.739LysGlu: 6.739 ± 0.817
2.826LysPhe: 2.826 ± 0.465
4.855LysGly: 4.855 ± 0.712
1.014LysHis: 1.014 ± 0.236
7.319LysIle: 7.319 ± 0.814
7.391LysLys: 7.391 ± 0.919
8.333LysLeu: 8.333 ± 0.845
2.536LysMet: 2.536 ± 0.444
5.652LysAsn: 5.652 ± 0.557
2.464LysPro: 2.464 ± 0.396
4.928LysGln: 4.928 ± 0.557
4.203LysArg: 4.203 ± 0.673
5.217LysSer: 5.217 ± 0.551
5.87LysThr: 5.87 ± 0.804
6.159LysVal: 6.159 ± 0.708
0.942LysTrp: 0.942 ± 0.226
3.406LysTyr: 3.406 ± 0.589
0.0LysXaa: 0.0 ± 0.0
Leu
5.942LeuAla: 5.942 ± 0.961
0.435LeuCys: 0.435 ± 0.252
6.304LeuAsp: 6.304 ± 0.592
7.391LeuGlu: 7.391 ± 0.744
3.696LeuPhe: 3.696 ± 0.485
4.928LeuGly: 4.928 ± 0.769
0.652LeuHis: 0.652 ± 0.224
6.232LeuIle: 6.232 ± 0.764
8.116LeuLys: 8.116 ± 0.958
7.174LeuLeu: 7.174 ± 0.872
1.957LeuMet: 1.957 ± 0.382
5.072LeuAsn: 5.072 ± 0.737
2.826LeuPro: 2.826 ± 0.421
3.551LeuGln: 3.551 ± 0.536
2.536LeuArg: 2.536 ± 0.44
7.101LeuSer: 7.101 ± 0.697
6.014LeuThr: 6.014 ± 0.71
3.841LeuVal: 3.841 ± 0.533
0.362LeuTrp: 0.362 ± 0.17
2.246LeuTyr: 2.246 ± 0.515
0.0LeuXaa: 0.0 ± 0.0
Met
2.464MetAla: 2.464 ± 0.553
0.0MetCys: 0.0 ± 0.0
1.739MetAsp: 1.739 ± 0.343
1.667MetGlu: 1.667 ± 0.375
1.087MetPhe: 1.087 ± 0.31
0.942MetGly: 0.942 ± 0.271
0.435MetHis: 0.435 ± 0.184
1.812MetIle: 1.812 ± 0.274
1.594MetLys: 1.594 ± 0.346
1.884MetLeu: 1.884 ± 0.513
0.217MetMet: 0.217 ± 0.121
1.594MetAsn: 1.594 ± 0.322
0.58MetPro: 0.58 ± 0.241
1.449MetGln: 1.449 ± 0.3
1.522MetArg: 1.522 ± 0.331
1.304MetSer: 1.304 ± 0.309
2.101MetThr: 2.101 ± 0.337
1.159MetVal: 1.159 ± 0.296
0.145MetTrp: 0.145 ± 0.107
0.725MetTyr: 0.725 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
4.13AsnAla: 4.13 ± 0.647
0.29AsnCys: 0.29 ± 0.135
2.609AsnAsp: 2.609 ± 0.422
3.478AsnGlu: 3.478 ± 0.576
2.391AsnPhe: 2.391 ± 0.439
4.928AsnGly: 4.928 ± 0.602
1.014AsnHis: 1.014 ± 0.253
3.696AsnIle: 3.696 ± 0.608
4.42AsnLys: 4.42 ± 0.644
4.783AsnLeu: 4.783 ± 0.523
1.087AsnMet: 1.087 ± 0.243
3.116AsnAsn: 3.116 ± 0.553
1.377AsnPro: 1.377 ± 0.296
2.464AsnGln: 2.464 ± 0.431
2.319AsnArg: 2.319 ± 0.465
3.188AsnSer: 3.188 ± 0.559
2.681AsnThr: 2.681 ± 0.494
2.754AsnVal: 2.754 ± 0.38
1.014AsnTrp: 1.014 ± 0.303
2.319AsnTyr: 2.319 ± 0.383
0.0AsnXaa: 0.0 ± 0.0
Pro
1.087ProAla: 1.087 ± 0.315
0.0ProCys: 0.0 ± 0.0
1.304ProAsp: 1.304 ± 0.333
1.522ProGlu: 1.522 ± 0.305
1.594ProPhe: 1.594 ± 0.352
0.652ProGly: 0.652 ± 0.193
0.507ProHis: 0.507 ± 0.166
1.884ProIle: 1.884 ± 0.317
3.261ProLys: 3.261 ± 0.489
1.594ProLeu: 1.594 ± 0.401
0.507ProMet: 0.507 ± 0.215
1.667ProAsn: 1.667 ± 0.315
0.87ProPro: 0.87 ± 0.24
0.725ProGln: 0.725 ± 0.209
1.159ProArg: 1.159 ± 0.24
1.377ProSer: 1.377 ± 0.258
1.812ProThr: 1.812 ± 0.337
1.377ProVal: 1.377 ± 0.361
0.507ProTrp: 0.507 ± 0.192
1.449ProTyr: 1.449 ± 0.41
0.0ProXaa: 0.0 ± 0.0
Gln
4.058GlnAla: 4.058 ± 0.78
0.29GlnCys: 0.29 ± 0.162
1.594GlnAsp: 1.594 ± 0.282
2.899GlnGlu: 2.899 ± 0.449
1.449GlnPhe: 1.449 ± 0.318
2.681GlnGly: 2.681 ± 0.447
0.362GlnHis: 0.362 ± 0.174
3.478GlnIle: 3.478 ± 0.441
3.696GlnLys: 3.696 ± 0.518
3.406GlnLeu: 3.406 ± 0.519
0.725GlnMet: 0.725 ± 0.235
2.319GlnAsn: 2.319 ± 0.501
1.014GlnPro: 1.014 ± 0.239
2.391GlnGln: 2.391 ± 0.55
1.739GlnArg: 1.739 ± 0.394
4.058GlnSer: 4.058 ± 0.582
2.391GlnThr: 2.391 ± 0.424
2.101GlnVal: 2.101 ± 0.373
0.58GlnTrp: 0.58 ± 0.209
1.232GlnTyr: 1.232 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
2.319ArgAla: 2.319 ± 0.465
0.145ArgCys: 0.145 ± 0.108
1.884ArgAsp: 1.884 ± 0.351
2.754ArgGlu: 2.754 ± 0.529
1.522ArgPhe: 1.522 ± 0.356
2.319ArgGly: 2.319 ± 0.348
0.87ArgHis: 0.87 ± 0.239
3.261ArgIle: 3.261 ± 0.67
4.783ArgLys: 4.783 ± 0.68
3.913ArgLeu: 3.913 ± 0.498
1.014ArgMet: 1.014 ± 0.282
1.667ArgAsn: 1.667 ± 0.376
1.014ArgPro: 1.014 ± 0.3
1.884ArgGln: 1.884 ± 0.333
2.029ArgArg: 2.029 ± 0.414
2.029ArgSer: 2.029 ± 0.343
2.174ArgThr: 2.174 ± 0.388
2.609ArgVal: 2.609 ± 0.521
0.217ArgTrp: 0.217 ± 0.103
1.739ArgTyr: 1.739 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
5.072SerAla: 5.072 ± 1.253
0.145SerCys: 0.145 ± 0.094
4.203SerAsp: 4.203 ± 0.581
4.638SerGlu: 4.638 ± 0.683
2.319SerPhe: 2.319 ± 0.395
4.13SerGly: 4.13 ± 0.751
1.304SerHis: 1.304 ± 0.346
4.565SerIle: 4.565 ± 0.421
5.58SerLys: 5.58 ± 0.698
4.565SerLeu: 4.565 ± 0.586
2.174SerMet: 2.174 ± 0.532
3.188SerAsn: 3.188 ± 0.504
1.304SerPro: 1.304 ± 0.353
3.913SerGln: 3.913 ± 0.658
2.246SerArg: 2.246 ± 0.388
4.71SerSer: 4.71 ± 0.844
3.261SerThr: 3.261 ± 0.51
4.565SerVal: 4.565 ± 0.546
0.797SerTrp: 0.797 ± 0.231
2.464SerTyr: 2.464 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
4.42ThrAla: 4.42 ± 0.771
0.29ThrCys: 0.29 ± 0.154
4.42ThrAsp: 4.42 ± 0.517
3.261ThrGlu: 3.261 ± 0.585
2.536ThrPhe: 2.536 ± 0.467
4.275ThrGly: 4.275 ± 0.638
1.159ThrHis: 1.159 ± 0.27
4.783ThrIle: 4.783 ± 0.562
5.29ThrLys: 5.29 ± 0.924
5.725ThrLeu: 5.725 ± 0.722
1.014ThrMet: 1.014 ± 0.287
2.971ThrAsn: 2.971 ± 0.573
2.246ThrPro: 2.246 ± 0.394
2.826ThrGln: 2.826 ± 0.508
1.667ThrArg: 1.667 ± 0.267
3.913ThrSer: 3.913 ± 0.564
4.42ThrThr: 4.42 ± 0.685
4.203ThrVal: 4.203 ± 0.594
0.435ThrTrp: 0.435 ± 0.179
2.174ThrTyr: 2.174 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
4.348ValAla: 4.348 ± 0.852
0.29ValCys: 0.29 ± 0.134
3.406ValAsp: 3.406 ± 0.457
5.362ValGlu: 5.362 ± 0.674
2.246ValPhe: 2.246 ± 0.362
3.986ValGly: 3.986 ± 0.57
0.797ValHis: 0.797 ± 0.263
4.565ValIle: 4.565 ± 0.623
5.217ValLys: 5.217 ± 0.704
4.058ValLeu: 4.058 ± 0.503
1.159ValMet: 1.159 ± 0.295
2.464ValAsn: 2.464 ± 0.369
2.246ValPro: 2.246 ± 0.359
2.246ValGln: 2.246 ± 0.327
2.464ValArg: 2.464 ± 0.512
4.42ValSer: 4.42 ± 0.587
4.275ValThr: 4.275 ± 0.497
3.986ValVal: 3.986 ± 0.513
0.507ValTrp: 0.507 ± 0.14
1.522ValTyr: 1.522 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.58TrpAla: 0.58 ± 0.266
0.145TrpCys: 0.145 ± 0.101
0.797TrpAsp: 0.797 ± 0.217
0.725TrpGlu: 0.725 ± 0.245
0.507TrpPhe: 0.507 ± 0.196
0.797TrpGly: 0.797 ± 0.224
0.145TrpHis: 0.145 ± 0.098
0.58TrpIle: 0.58 ± 0.189
0.942TrpLys: 0.942 ± 0.205
1.594TrpLeu: 1.594 ± 0.405
0.435TrpMet: 0.435 ± 0.153
0.145TrpAsn: 0.145 ± 0.1
0.217TrpPro: 0.217 ± 0.126
0.507TrpGln: 0.507 ± 0.162
0.797TrpArg: 0.797 ± 0.214
0.507TrpSer: 0.507 ± 0.169
0.072TrpThr: 0.072 ± 0.077
0.217TrpVal: 0.217 ± 0.123
0.072TrpTrp: 0.072 ± 0.07
0.435TrpTyr: 0.435 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.029TyrAla: 2.029 ± 0.372
0.29TyrCys: 0.29 ± 0.148
3.406TyrAsp: 3.406 ± 0.479
3.406TyrGlu: 3.406 ± 0.495
1.449TyrPhe: 1.449 ± 0.41
1.739TyrGly: 1.739 ± 0.329
0.87TyrHis: 0.87 ± 0.336
3.043TyrIle: 3.043 ± 0.434
2.826TyrLys: 2.826 ± 0.558
2.971TyrLeu: 2.971 ± 0.629
0.435TyrMet: 0.435 ± 0.165
1.667TyrAsn: 1.667 ± 0.321
1.449TyrPro: 1.449 ± 0.45
2.609TyrGln: 2.609 ± 0.53
1.159TyrArg: 1.159 ± 0.316
2.464TyrSer: 2.464 ± 0.423
2.536TyrThr: 2.536 ± 0.371
1.594TyrVal: 1.594 ± 0.46
0.507TyrTrp: 0.507 ± 0.201
2.174TyrTyr: 2.174 ± 0.514
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski