Amino acid dipepetide frequency for Streptococcus phage Javan271

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.948AlaAla: 0.948 ± 0.327
0.345AlaCys: 0.345 ± 0.163
3.791AlaAsp: 3.791 ± 0.643
5.859AlaGlu: 5.859 ± 0.661
2.843AlaPhe: 2.843 ± 0.446
3.705AlaGly: 3.705 ± 0.786
1.034AlaHis: 1.034 ± 0.313
5.945AlaIle: 5.945 ± 0.723
6.118AlaLys: 6.118 ± 0.897
3.877AlaLeu: 3.877 ± 0.506
1.982AlaMet: 1.982 ± 0.357
4.997AlaAsn: 4.997 ± 0.866
1.551AlaPro: 1.551 ± 0.388
1.723AlaGln: 1.723 ± 0.374
2.326AlaArg: 2.326 ± 0.516
4.136AlaSer: 4.136 ± 0.644
4.136AlaThr: 4.136 ± 0.736
4.05AlaVal: 4.05 ± 0.768
1.034AlaTrp: 1.034 ± 0.333
2.326AlaTyr: 2.326 ± 0.515
0.0AlaXaa: 0.0 ± 0.0
Cys
0.086CysAla: 0.086 ± 0.083
0.0CysCys: 0.0 ± 0.0
0.775CysAsp: 0.775 ± 0.265
0.517CysGlu: 0.517 ± 0.164
0.0CysPhe: 0.0 ± 0.0
0.345CysGly: 0.345 ± 0.361
0.086CysHis: 0.086 ± 0.082
0.517CysIle: 0.517 ± 0.213
0.345CysLys: 0.345 ± 0.158
0.258CysLeu: 0.258 ± 0.199
0.086CysMet: 0.086 ± 0.09
0.431CysAsn: 0.431 ± 0.161
0.172CysPro: 0.172 ± 0.134
0.258CysGln: 0.258 ± 0.16
0.258CysArg: 0.258 ± 0.147
0.345CysSer: 0.345 ± 0.157
0.172CysThr: 0.172 ± 0.119
0.0CysVal: 0.0 ± 0.0
0.086CysTrp: 0.086 ± 0.094
0.517CysTyr: 0.517 ± 0.248
0.0CysXaa: 0.0 ± 0.0
Asp
3.446AspAla: 3.446 ± 0.505
0.775AspCys: 0.775 ± 0.257
3.619AspAsp: 3.619 ± 0.683
5.773AspGlu: 5.773 ± 1.042
2.499AspPhe: 2.499 ± 0.403
4.567AspGly: 4.567 ± 0.611
0.345AspHis: 0.345 ± 0.201
4.739AspIle: 4.739 ± 0.576
4.308AspLys: 4.308 ± 0.574
6.204AspLeu: 6.204 ± 0.748
1.637AspMet: 1.637 ± 0.352
5.342AspAsn: 5.342 ± 0.681
0.948AspPro: 0.948 ± 0.289
0.948AspGln: 0.948 ± 0.239
1.982AspArg: 1.982 ± 0.478
3.274AspSer: 3.274 ± 0.622
3.36AspThr: 3.36 ± 0.539
3.963AspVal: 3.963 ± 0.525
0.689AspTrp: 0.689 ± 0.255
3.446AspTyr: 3.446 ± 0.492
0.0AspXaa: 0.0 ± 0.0
Glu
4.567GluAla: 4.567 ± 0.749
0.086GluCys: 0.086 ± 0.09
4.308GluAsp: 4.308 ± 0.811
6.031GluGlu: 6.031 ± 0.832
3.016GluPhe: 3.016 ± 0.473
2.326GluGly: 2.326 ± 0.444
0.603GluHis: 0.603 ± 0.261
5.859GluIle: 5.859 ± 0.694
5.773GluLys: 5.773 ± 0.849
7.927GluLeu: 7.927 ± 0.859
1.809GluMet: 1.809 ± 0.467
4.911GluAsn: 4.911 ± 0.647
1.896GluPro: 1.896 ± 0.413
3.188GluGln: 3.188 ± 0.613
3.016GluArg: 3.016 ± 0.583
3.705GluSer: 3.705 ± 0.699
4.911GluThr: 4.911 ± 0.662
5.514GluVal: 5.514 ± 0.844
1.206GluTrp: 1.206 ± 0.351
2.585GluTyr: 2.585 ± 0.526
0.0GluXaa: 0.0 ± 0.0
Phe
2.757PheAla: 2.757 ± 0.443
0.086PheCys: 0.086 ± 0.087
3.102PheAsp: 3.102 ± 0.647
2.843PheGlu: 2.843 ± 0.546
0.862PhePhe: 0.862 ± 0.338
2.499PheGly: 2.499 ± 0.497
0.345PheHis: 0.345 ± 0.166
2.843PheIle: 2.843 ± 0.433
3.963PheLys: 3.963 ± 0.463
1.982PheLeu: 1.982 ± 0.361
1.034PheMet: 1.034 ± 0.382
3.963PheAsn: 3.963 ± 0.61
0.862PhePro: 0.862 ± 0.272
1.292PheGln: 1.292 ± 0.324
1.465PheArg: 1.465 ± 0.326
2.585PheSer: 2.585 ± 0.437
2.499PheThr: 2.499 ± 0.4
2.843PheVal: 2.843 ± 0.375
0.345PheTrp: 0.345 ± 0.155
1.292PheTyr: 1.292 ± 0.328
0.0PheXaa: 0.0 ± 0.0
Gly
3.877GlyAla: 3.877 ± 0.603
0.172GlyCys: 0.172 ± 0.18
3.446GlyAsp: 3.446 ± 0.508
2.585GlyGlu: 2.585 ± 0.398
2.24GlyPhe: 2.24 ± 0.483
4.308GlyGly: 4.308 ± 1.175
1.465GlyHis: 1.465 ± 0.351
4.911GlyIle: 4.911 ± 0.823
5.859GlyLys: 5.859 ± 0.743
4.05GlyLeu: 4.05 ± 0.554
2.499GlyMet: 2.499 ± 0.419
4.308GlyAsn: 4.308 ± 0.517
0.775GlyPro: 0.775 ± 0.301
2.499GlyGln: 2.499 ± 0.441
2.154GlyArg: 2.154 ± 0.485
4.05GlySer: 4.05 ± 0.879
5.084GlyThr: 5.084 ± 0.981
4.48GlyVal: 4.48 ± 0.737
1.637GlyTrp: 1.637 ± 0.382
3.016GlyTyr: 3.016 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
1.206HisAla: 1.206 ± 0.378
0.345HisCys: 0.345 ± 0.204
1.034HisAsp: 1.034 ± 0.312
0.689HisGlu: 0.689 ± 0.193
0.431HisPhe: 0.431 ± 0.178
0.775HisGly: 0.775 ± 0.25
0.345HisHis: 0.345 ± 0.268
0.948HisIle: 0.948 ± 0.308
1.12HisLys: 1.12 ± 0.247
1.206HisLeu: 1.206 ± 0.283
0.431HisMet: 0.431 ± 0.173
1.034HisAsn: 1.034 ± 0.288
0.258HisPro: 0.258 ± 0.173
0.431HisGln: 0.431 ± 0.197
0.431HisArg: 0.431 ± 0.173
0.689HisSer: 0.689 ± 0.215
0.775HisThr: 0.775 ± 0.246
0.775HisVal: 0.775 ± 0.255
0.172HisTrp: 0.172 ± 0.123
0.689HisTyr: 0.689 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.997IleAla: 4.997 ± 0.73
0.431IleCys: 0.431 ± 0.177
5.514IleAsp: 5.514 ± 0.668
5.687IleGlu: 5.687 ± 0.716
2.757IlePhe: 2.757 ± 0.474
4.136IleGly: 4.136 ± 0.62
0.948IleHis: 0.948 ± 0.252
5.256IleIle: 5.256 ± 0.676
6.634IleLys: 6.634 ± 0.784
4.567IleLeu: 4.567 ± 0.495
1.12IleMet: 1.12 ± 0.277
4.653IleAsn: 4.653 ± 0.535
2.326IlePro: 2.326 ± 0.438
1.637IleGln: 1.637 ± 0.372
2.068IleArg: 2.068 ± 0.47
6.548IleSer: 6.548 ± 0.763
5.859IleThr: 5.859 ± 0.767
4.05IleVal: 4.05 ± 0.548
1.206IleTrp: 1.206 ± 0.308
2.413IleTyr: 2.413 ± 0.394
0.0IleXaa: 0.0 ± 0.0
Lys
7.324LysAla: 7.324 ± 0.963
0.431LysCys: 0.431 ± 0.233
4.825LysAsp: 4.825 ± 0.658
7.151LysGlu: 7.151 ± 0.947
3.274LysPhe: 3.274 ± 0.498
5.428LysGly: 5.428 ± 0.477
1.206LysHis: 1.206 ± 0.378
5.256LysIle: 5.256 ± 0.717
7.668LysLys: 7.668 ± 0.975
6.031LysLeu: 6.031 ± 0.878
2.24LysMet: 2.24 ± 0.536
4.308LysAsn: 4.308 ± 0.82
2.068LysPro: 2.068 ± 0.389
3.963LysGln: 3.963 ± 0.723
4.308LysArg: 4.308 ± 0.663
6.031LysSer: 6.031 ± 0.737
6.118LysThr: 6.118 ± 0.563
5.428LysVal: 5.428 ± 0.686
1.206LysTrp: 1.206 ± 0.341
4.911LysTyr: 4.911 ± 0.715
0.0LysXaa: 0.0 ± 0.0
Leu
6.031LeuAla: 6.031 ± 0.642
0.603LeuCys: 0.603 ± 0.183
4.48LeuAsp: 4.48 ± 0.606
6.204LeuGlu: 6.204 ± 0.8
2.93LeuPhe: 2.93 ± 0.535
4.911LeuGly: 4.911 ± 0.554
1.379LeuHis: 1.379 ± 0.347
6.031LeuIle: 6.031 ± 0.622
7.324LeuLys: 7.324 ± 0.774
7.238LeuLeu: 7.238 ± 0.917
2.068LeuMet: 2.068 ± 0.485
5.17LeuAsn: 5.17 ± 0.763
2.413LeuPro: 2.413 ± 0.602
2.326LeuGln: 2.326 ± 0.534
2.93LeuArg: 2.93 ± 0.478
6.634LeuSer: 6.634 ± 0.668
6.462LeuThr: 6.462 ± 0.637
3.446LeuVal: 3.446 ± 0.503
0.862LeuTrp: 0.862 ± 0.245
2.585LeuTyr: 2.585 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
1.465MetAla: 1.465 ± 0.397
0.172MetCys: 0.172 ± 0.129
1.465MetAsp: 1.465 ± 0.349
1.206MetGlu: 1.206 ± 0.345
0.862MetPhe: 0.862 ± 0.283
1.292MetGly: 1.292 ± 0.292
0.517MetHis: 0.517 ± 0.223
1.379MetIle: 1.379 ± 0.397
2.154MetLys: 2.154 ± 0.487
1.637MetLeu: 1.637 ± 0.435
0.775MetMet: 0.775 ± 0.266
1.982MetAsn: 1.982 ± 0.361
0.431MetPro: 0.431 ± 0.224
1.637MetGln: 1.637 ± 0.389
0.948MetArg: 0.948 ± 0.287
1.637MetSer: 1.637 ± 0.53
2.499MetThr: 2.499 ± 0.552
1.292MetVal: 1.292 ± 0.24
0.345MetTrp: 0.345 ± 0.165
0.603MetTyr: 0.603 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.911AsnAla: 4.911 ± 0.776
0.086AsnCys: 0.086 ± 0.072
3.705AsnAsp: 3.705 ± 0.799
4.911AsnGlu: 4.911 ± 0.81
2.585AsnPhe: 2.585 ± 0.417
5.17AsnGly: 5.17 ± 0.662
1.206AsnHis: 1.206 ± 0.364
3.877AsnIle: 3.877 ± 0.593
5.773AsnLys: 5.773 ± 0.745
5.601AsnLeu: 5.601 ± 0.815
1.465AsnMet: 1.465 ± 0.381
4.136AsnAsn: 4.136 ± 0.544
2.068AsnPro: 2.068 ± 0.393
2.585AsnGln: 2.585 ± 0.394
2.326AsnArg: 2.326 ± 0.49
3.533AsnSer: 3.533 ± 0.438
3.619AsnThr: 3.619 ± 0.677
4.394AsnVal: 4.394 ± 0.677
1.034AsnTrp: 1.034 ± 0.318
2.413AsnTyr: 2.413 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
0.862ProAla: 0.862 ± 0.209
0.0ProCys: 0.0 ± 0.0
1.637ProAsp: 1.637 ± 0.322
1.551ProGlu: 1.551 ± 0.436
1.465ProPhe: 1.465 ± 0.307
0.689ProGly: 0.689 ± 0.256
0.345ProHis: 0.345 ± 0.192
2.843ProIle: 2.843 ± 0.436
2.413ProLys: 2.413 ± 0.544
1.896ProLeu: 1.896 ± 0.317
0.862ProMet: 0.862 ± 0.299
1.206ProAsn: 1.206 ± 0.267
0.862ProPro: 0.862 ± 0.273
1.206ProGln: 1.206 ± 0.334
0.948ProArg: 0.948 ± 0.307
1.465ProSer: 1.465 ± 0.476
1.465ProThr: 1.465 ± 0.352
1.379ProVal: 1.379 ± 0.381
0.258ProTrp: 0.258 ± 0.137
1.12ProTyr: 1.12 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
2.843GlnAla: 2.843 ± 0.428
0.086GlnCys: 0.086 ± 0.079
1.379GlnAsp: 1.379 ± 0.36
2.24GlnGlu: 2.24 ± 0.477
1.292GlnPhe: 1.292 ± 0.357
2.671GlnGly: 2.671 ± 0.466
0.775GlnHis: 0.775 ± 0.276
2.068GlnIle: 2.068 ± 0.42
3.877GlnLys: 3.877 ± 0.609
3.188GlnLeu: 3.188 ± 0.545
0.775GlnMet: 0.775 ± 0.259
1.551GlnAsn: 1.551 ± 0.303
0.431GlnPro: 0.431 ± 0.18
0.948GlnGln: 0.948 ± 0.307
1.551GlnArg: 1.551 ± 0.362
1.896GlnSer: 1.896 ± 0.457
2.499GlnThr: 2.499 ± 0.498
2.24GlnVal: 2.24 ± 0.442
0.517GlnTrp: 0.517 ± 0.265
1.379GlnTyr: 1.379 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
1.206ArgAla: 1.206 ± 0.383
0.086ArgCys: 0.086 ± 0.077
2.413ArgAsp: 2.413 ± 0.443
1.637ArgGlu: 1.637 ± 0.345
1.551ArgPhe: 1.551 ± 0.334
1.896ArgGly: 1.896 ± 0.467
0.258ArgHis: 0.258 ± 0.196
2.499ArgIle: 2.499 ± 0.425
4.222ArgLys: 4.222 ± 0.727
3.705ArgLeu: 3.705 ± 0.631
1.206ArgMet: 1.206 ± 0.298
2.326ArgAsn: 2.326 ± 0.453
0.862ArgPro: 0.862 ± 0.257
1.809ArgGln: 1.809 ± 0.364
1.551ArgArg: 1.551 ± 0.415
2.154ArgSer: 2.154 ± 0.402
2.154ArgThr: 2.154 ± 0.361
1.379ArgVal: 1.379 ± 0.357
0.603ArgTrp: 0.603 ± 0.229
1.982ArgTyr: 1.982 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
4.222SerAla: 4.222 ± 0.697
0.258SerCys: 0.258 ± 0.207
3.877SerAsp: 3.877 ± 0.555
6.376SerGlu: 6.376 ± 0.757
2.93SerPhe: 2.93 ± 0.569
5.17SerGly: 5.17 ± 1.127
0.775SerHis: 0.775 ± 0.259
5.514SerIle: 5.514 ± 0.708
6.462SerLys: 6.462 ± 0.848
5.256SerLeu: 5.256 ± 0.722
0.862SerMet: 0.862 ± 0.248
3.705SerAsn: 3.705 ± 0.587
1.637SerPro: 1.637 ± 0.406
1.982SerGln: 1.982 ± 0.405
1.379SerArg: 1.379 ± 0.301
4.911SerSer: 4.911 ± 0.717
4.653SerThr: 4.653 ± 0.745
3.36SerVal: 3.36 ± 0.592
0.948SerTrp: 0.948 ± 0.288
1.809SerTyr: 1.809 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
3.446ThrAla: 3.446 ± 0.676
0.345ThrCys: 0.345 ± 0.175
4.136ThrAsp: 4.136 ± 0.629
3.36ThrGlu: 3.36 ± 0.422
3.533ThrPhe: 3.533 ± 0.51
5.773ThrGly: 5.773 ± 1.055
1.034ThrHis: 1.034 ± 0.323
6.204ThrIle: 6.204 ± 0.65
5.514ThrLys: 5.514 ± 1.009
6.204ThrLeu: 6.204 ± 0.592
1.206ThrMet: 1.206 ± 0.327
4.05ThrAsn: 4.05 ± 0.551
1.637ThrPro: 1.637 ± 0.378
2.326ThrGln: 2.326 ± 0.407
1.982ThrArg: 1.982 ± 0.461
4.825ThrSer: 4.825 ± 0.874
4.825ThrThr: 4.825 ± 0.765
5.084ThrVal: 5.084 ± 1.069
1.034ThrTrp: 1.034 ± 0.309
3.963ThrTyr: 3.963 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
4.653ValAla: 4.653 ± 0.711
0.345ValCys: 0.345 ± 0.145
4.739ValAsp: 4.739 ± 0.514
4.739ValGlu: 4.739 ± 0.746
1.982ValPhe: 1.982 ± 0.516
4.136ValGly: 4.136 ± 0.847
0.172ValHis: 0.172 ± 0.104
3.188ValIle: 3.188 ± 0.448
3.877ValLys: 3.877 ± 0.448
4.739ValLeu: 4.739 ± 0.685
1.637ValMet: 1.637 ± 0.435
3.877ValAsn: 3.877 ± 0.62
1.809ValPro: 1.809 ± 0.414
2.068ValGln: 2.068 ± 0.396
1.896ValArg: 1.896 ± 0.459
4.136ValSer: 4.136 ± 0.487
5.773ValThr: 5.773 ± 0.691
3.533ValVal: 3.533 ± 0.667
0.775ValTrp: 0.775 ± 0.22
2.24ValTyr: 2.24 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
0.603TrpAla: 0.603 ± 0.204
0.258TrpCys: 0.258 ± 0.143
0.603TrpAsp: 0.603 ± 0.178
1.12TrpGlu: 1.12 ± 0.262
0.172TrpPhe: 0.172 ± 0.108
1.034TrpGly: 1.034 ± 0.308
0.517TrpHis: 0.517 ± 0.206
0.862TrpIle: 0.862 ± 0.221
1.465TrpLys: 1.465 ± 0.315
1.551TrpLeu: 1.551 ± 0.324
0.086TrpMet: 0.086 ± 0.091
0.948TrpAsn: 0.948 ± 0.221
0.258TrpPro: 0.258 ± 0.152
0.517TrpGln: 0.517 ± 0.251
0.517TrpArg: 0.517 ± 0.166
1.12TrpSer: 1.12 ± 0.323
1.12TrpThr: 1.12 ± 0.301
1.034TrpVal: 1.034 ± 0.279
0.172TrpTrp: 0.172 ± 0.109
0.431TrpTyr: 0.431 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.274TyrAla: 3.274 ± 0.665
0.345TyrCys: 0.345 ± 0.181
3.016TyrAsp: 3.016 ± 0.438
2.757TyrGlu: 2.757 ± 0.482
2.24TyrPhe: 2.24 ± 0.373
2.585TyrGly: 2.585 ± 0.373
0.431TyrHis: 0.431 ± 0.202
2.068TyrIle: 2.068 ± 0.466
4.308TyrLys: 4.308 ± 0.623
4.911TyrLeu: 4.911 ± 0.713
0.517TyrMet: 0.517 ± 0.208
2.499TyrAsn: 2.499 ± 0.472
1.206TyrPro: 1.206 ± 0.339
0.862TyrGln: 0.862 ± 0.242
1.465TyrArg: 1.465 ± 0.367
2.413TyrSer: 2.413 ± 0.392
2.499TyrThr: 2.499 ± 0.45
2.068TyrVal: 2.068 ± 0.469
0.258TyrTrp: 0.258 ± 0.15
1.379TyrTyr: 1.379 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (11607 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski