Amino acid dipepetide frequency for Streptococcus phage Javan427

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.508AlaAla: 3.508 ± 1.048
0.734AlaCys: 0.734 ± 0.23
5.221AlaAsp: 5.221 ± 0.74
5.955AlaGlu: 5.955 ± 0.604
2.855AlaPhe: 2.855 ± 0.42
4.324AlaGly: 4.324 ± 0.797
1.142AlaHis: 1.142 ± 0.343
5.955AlaIle: 5.955 ± 0.809
6.363AlaLys: 6.363 ± 1.228
7.097AlaLeu: 7.097 ± 0.867
2.692AlaMet: 2.692 ± 0.524
4.487AlaAsn: 4.487 ± 0.786
1.142AlaPro: 1.142 ± 0.24
2.121AlaGln: 2.121 ± 0.476
3.916AlaArg: 3.916 ± 0.638
4.161AlaSer: 4.161 ± 0.732
3.508AlaThr: 3.508 ± 0.511
4.895AlaVal: 4.895 ± 0.696
0.897AlaTrp: 0.897 ± 0.265
1.713AlaTyr: 1.713 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.163CysAla: 0.163 ± 0.103
0.0CysCys: 0.0 ± 0.0
0.245CysAsp: 0.245 ± 0.142
0.489CysGlu: 0.489 ± 0.186
0.082CysPhe: 0.082 ± 0.088
0.326CysGly: 0.326 ± 0.161
0.163CysHis: 0.163 ± 0.124
0.082CysIle: 0.082 ± 0.08
0.979CysLys: 0.979 ± 0.311
0.653CysLeu: 0.653 ± 0.225
0.082CysMet: 0.082 ± 0.094
0.082CysAsn: 0.082 ± 0.078
0.163CysPro: 0.163 ± 0.11
0.245CysGln: 0.245 ± 0.136
0.489CysArg: 0.489 ± 0.208
0.163CysSer: 0.163 ± 0.116
0.163CysThr: 0.163 ± 0.129
0.245CysVal: 0.245 ± 0.141
0.0CysTrp: 0.0 ± 0.0
0.163CysTyr: 0.163 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
4.568AspAla: 4.568 ± 0.739
0.408AspCys: 0.408 ± 0.156
3.508AspAsp: 3.508 ± 0.712
4.976AspGlu: 4.976 ± 0.665
3.426AspPhe: 3.426 ± 0.355
4.161AspGly: 4.161 ± 0.689
0.816AspHis: 0.816 ± 0.226
4.487AspIle: 4.487 ± 0.656
6.282AspLys: 6.282 ± 0.809
5.384AspLeu: 5.384 ± 0.702
1.876AspMet: 1.876 ± 0.317
3.182AspAsn: 3.182 ± 0.446
1.468AspPro: 1.468 ± 0.278
1.713AspGln: 1.713 ± 0.359
2.039AspArg: 2.039 ± 0.441
2.611AspSer: 2.611 ± 0.426
3.508AspThr: 3.508 ± 0.627
4.487AspVal: 4.487 ± 0.58
0.897AspTrp: 0.897 ± 0.202
3.345AspTyr: 3.345 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
5.792GluAla: 5.792 ± 0.724
0.326GluCys: 0.326 ± 0.161
5.221GluAsp: 5.221 ± 0.739
6.771GluGlu: 6.771 ± 0.731
3.834GluPhe: 3.834 ± 0.58
3.916GluGly: 3.916 ± 0.524
0.897GluHis: 0.897 ± 0.319
5.221GluIle: 5.221 ± 0.655
8.24GluLys: 8.24 ± 0.967
8.892GluLeu: 8.892 ± 1.008
2.203GluMet: 2.203 ± 0.393
5.058GluAsn: 5.058 ± 0.584
1.713GluPro: 1.713 ± 0.514
3.426GluGln: 3.426 ± 0.585
3.916GluArg: 3.916 ± 0.596
4.732GluSer: 4.732 ± 0.723
3.426GluThr: 3.426 ± 0.461
5.629GluVal: 5.629 ± 0.594
0.897GluTrp: 0.897 ± 0.286
1.958GluTyr: 1.958 ± 0.452
0.0GluXaa: 0.0 ± 0.0
Phe
3.426PheAla: 3.426 ± 0.508
0.163PheCys: 0.163 ± 0.115
3.182PheAsp: 3.182 ± 0.515
3.182PheGlu: 3.182 ± 0.449
1.142PhePhe: 1.142 ± 0.232
2.937PheGly: 2.937 ± 0.415
0.816PheHis: 0.816 ± 0.3
2.774PheIle: 2.774 ± 0.475
3.1PheLys: 3.1 ± 0.47
3.426PheLeu: 3.426 ± 0.519
1.224PheMet: 1.224 ± 0.331
2.039PheAsn: 2.039 ± 0.413
1.061PhePro: 1.061 ± 0.307
1.142PheGln: 1.142 ± 0.255
2.284PheArg: 2.284 ± 0.359
2.774PheSer: 2.774 ± 0.409
2.692PheThr: 2.692 ± 0.43
2.039PheVal: 2.039 ± 0.414
0.163PheTrp: 0.163 ± 0.106
2.039PheTyr: 2.039 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
3.753GlyAla: 3.753 ± 0.856
0.326GlyCys: 0.326 ± 0.188
3.589GlyAsp: 3.589 ± 0.491
4.405GlyGlu: 4.405 ± 0.549
3.997GlyPhe: 3.997 ± 0.554
3.997GlyGly: 3.997 ± 0.484
0.734GlyHis: 0.734 ± 0.216
3.345GlyIle: 3.345 ± 0.43
3.916GlyLys: 3.916 ± 0.48
5.466GlyLeu: 5.466 ± 0.813
1.795GlyMet: 1.795 ± 0.405
2.937GlyAsn: 2.937 ± 0.519
0.734GlyPro: 0.734 ± 0.216
3.508GlyGln: 3.508 ± 0.499
3.589GlyArg: 3.589 ± 0.518
3.1GlySer: 3.1 ± 0.76
3.1GlyThr: 3.1 ± 0.554
3.018GlyVal: 3.018 ± 0.381
0.897GlyTrp: 0.897 ± 0.449
3.263GlyTyr: 3.263 ± 0.549
0.0GlyXaa: 0.0 ± 0.0
His
1.142HisAla: 1.142 ± 0.216
0.245HisCys: 0.245 ± 0.123
0.734HisAsp: 0.734 ± 0.215
1.305HisGlu: 1.305 ± 0.31
0.979HisPhe: 0.979 ± 0.285
1.387HisGly: 1.387 ± 0.293
0.408HisHis: 0.408 ± 0.2
0.979HisIle: 0.979 ± 0.26
1.061HisLys: 1.061 ± 0.304
1.713HisLeu: 1.713 ± 0.413
0.245HisMet: 0.245 ± 0.126
0.979HisAsn: 0.979 ± 0.252
0.816HisPro: 0.816 ± 0.205
0.816HisGln: 0.816 ± 0.198
0.326HisArg: 0.326 ± 0.165
1.713HisSer: 1.713 ± 0.473
0.653HisThr: 0.653 ± 0.188
0.571HisVal: 0.571 ± 0.204
0.082HisTrp: 0.082 ± 0.068
0.653HisTyr: 0.653 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
4.895IleAla: 4.895 ± 0.745
0.489IleCys: 0.489 ± 0.216
4.242IleAsp: 4.242 ± 0.437
6.2IleGlu: 6.2 ± 0.825
2.529IlePhe: 2.529 ± 0.398
2.937IleGly: 2.937 ± 0.422
0.734IleHis: 0.734 ± 0.241
2.774IleIle: 2.774 ± 0.642
5.384IleLys: 5.384 ± 0.779
5.547IleLeu: 5.547 ± 0.635
1.142IleMet: 1.142 ± 0.303
3.671IleAsn: 3.671 ± 0.49
1.55IlePro: 1.55 ± 0.419
3.1IleGln: 3.1 ± 0.531
3.345IleArg: 3.345 ± 0.63
4.242IleSer: 4.242 ± 0.677
3.916IleThr: 3.916 ± 0.614
2.937IleVal: 2.937 ± 0.416
0.979IleTrp: 0.979 ± 0.282
2.121IleTyr: 2.121 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
6.608LysAla: 6.608 ± 0.901
0.0LysCys: 0.0 ± 0.0
4.976LysAsp: 4.976 ± 0.674
6.771LysGlu: 6.771 ± 0.916
2.121LysPhe: 2.121 ± 0.494
3.753LysGly: 3.753 ± 0.509
1.713LysHis: 1.713 ± 0.353
6.037LysIle: 6.037 ± 1.001
8.403LysLys: 8.403 ± 0.945
7.587LysLeu: 7.587 ± 0.909
2.529LysMet: 2.529 ± 0.353
5.874LysAsn: 5.874 ± 0.597
2.039LysPro: 2.039 ± 0.44
4.895LysGln: 4.895 ± 0.67
4.161LysArg: 4.161 ± 0.578
4.242LysSer: 4.242 ± 0.655
3.426LysThr: 3.426 ± 0.551
5.221LysVal: 5.221 ± 0.622
1.142LysTrp: 1.142 ± 0.291
3.1LysTyr: 3.1 ± 0.472
0.0LysXaa: 0.0 ± 0.0
Leu
7.995LeuAla: 7.995 ± 0.902
0.163LeuCys: 0.163 ± 0.134
7.097LeuAsp: 7.097 ± 0.793
8.647LeuGlu: 8.647 ± 0.783
2.855LeuPhe: 2.855 ± 0.684
5.384LeuGly: 5.384 ± 0.633
0.979LeuHis: 0.979 ± 0.255
5.629LeuIle: 5.629 ± 0.719
7.342LeuLys: 7.342 ± 0.745
8.321LeuLeu: 8.321 ± 1.269
2.284LeuMet: 2.284 ± 0.607
4.65LeuAsn: 4.65 ± 0.507
2.447LeuPro: 2.447 ± 0.493
2.855LeuGln: 2.855 ± 0.656
5.058LeuArg: 5.058 ± 0.539
5.547LeuSer: 5.547 ± 0.72
5.547LeuThr: 5.547 ± 0.781
5.14LeuVal: 5.14 ± 0.669
0.163LeuTrp: 0.163 ± 0.114
2.774LeuTyr: 2.774 ± 0.485
0.0LeuXaa: 0.0 ± 0.0
Met
1.795MetAla: 1.795 ± 0.312
0.163MetCys: 0.163 ± 0.107
1.468MetAsp: 1.468 ± 0.284
2.937MetGlu: 2.937 ± 0.503
0.816MetPhe: 0.816 ± 0.267
1.224MetGly: 1.224 ± 0.283
0.408MetHis: 0.408 ± 0.204
1.713MetIle: 1.713 ± 0.37
2.284MetLys: 2.284 ± 0.477
1.713MetLeu: 1.713 ± 0.46
0.734MetMet: 0.734 ± 0.224
1.958MetAsn: 1.958 ± 0.402
0.489MetPro: 0.489 ± 0.185
1.387MetGln: 1.387 ± 0.297
0.897MetArg: 0.897 ± 0.255
1.958MetSer: 1.958 ± 0.395
2.203MetThr: 2.203 ± 0.488
1.795MetVal: 1.795 ± 0.435
0.245MetTrp: 0.245 ± 0.151
0.408MetTyr: 0.408 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
4.161AsnAla: 4.161 ± 0.757
0.326AsnCys: 0.326 ± 0.167
3.263AsnAsp: 3.263 ± 0.431
3.753AsnGlu: 3.753 ± 0.505
2.039AsnPhe: 2.039 ± 0.324
4.813AsnGly: 4.813 ± 0.685
1.305AsnHis: 1.305 ± 0.365
3.426AsnIle: 3.426 ± 0.464
4.161AsnLys: 4.161 ± 0.617
5.221AsnLeu: 5.221 ± 0.726
0.897AsnMet: 0.897 ± 0.297
3.1AsnAsn: 3.1 ± 0.545
1.713AsnPro: 1.713 ± 0.354
3.1AsnGln: 3.1 ± 0.614
2.692AsnArg: 2.692 ± 0.536
3.1AsnSer: 3.1 ± 0.614
2.121AsnThr: 2.121 ± 0.361
3.263AsnVal: 3.263 ± 0.615
0.734AsnTrp: 0.734 ± 0.201
1.795AsnTyr: 1.795 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
1.305ProAla: 1.305 ± 0.315
0.082ProCys: 0.082 ± 0.099
1.795ProAsp: 1.795 ± 0.417
1.632ProGlu: 1.632 ± 0.409
1.224ProPhe: 1.224 ± 0.27
0.979ProGly: 0.979 ± 0.244
0.897ProHis: 0.897 ± 0.315
1.55ProIle: 1.55 ± 0.445
2.121ProLys: 2.121 ± 0.345
2.203ProLeu: 2.203 ± 0.475
0.489ProMet: 0.489 ± 0.186
0.979ProAsn: 0.979 ± 0.316
0.571ProPro: 0.571 ± 0.244
0.734ProGln: 0.734 ± 0.248
1.142ProArg: 1.142 ± 0.347
1.632ProSer: 1.632 ± 0.339
1.387ProThr: 1.387 ± 0.369
2.203ProVal: 2.203 ± 0.391
0.163ProTrp: 0.163 ± 0.096
1.061ProTyr: 1.061 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
3.671GlnAla: 3.671 ± 0.564
0.082GlnCys: 0.082 ± 0.074
2.284GlnAsp: 2.284 ± 0.345
4.161GlnGlu: 4.161 ± 0.545
1.387GlnPhe: 1.387 ± 0.365
2.529GlnGly: 2.529 ± 0.358
0.653GlnHis: 0.653 ± 0.185
2.774GlnIle: 2.774 ± 0.528
3.345GlnLys: 3.345 ± 0.455
3.753GlnLeu: 3.753 ± 0.643
1.061GlnMet: 1.061 ± 0.26
2.366GlnAsn: 2.366 ± 0.376
1.305GlnPro: 1.305 ± 0.359
1.305GlnGln: 1.305 ± 0.26
2.366GlnArg: 2.366 ± 0.478
2.447GlnSer: 2.447 ± 0.343
2.203GlnThr: 2.203 ± 0.402
2.774GlnVal: 2.774 ± 0.481
0.326GlnTrp: 0.326 ± 0.138
0.897GlnTyr: 0.897 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
3.1ArgAla: 3.1 ± 0.482
0.326ArgCys: 0.326 ± 0.135
2.692ArgAsp: 2.692 ± 0.501
4.242ArgGlu: 4.242 ± 0.536
2.529ArgPhe: 2.529 ± 0.57
1.958ArgGly: 1.958 ± 0.378
1.142ArgHis: 1.142 ± 0.23
3.182ArgIle: 3.182 ± 0.606
4.813ArgLys: 4.813 ± 0.717
5.303ArgLeu: 5.303 ± 0.704
1.468ArgMet: 1.468 ± 0.331
2.692ArgAsn: 2.692 ± 0.424
1.305ArgPro: 1.305 ± 0.357
2.121ArgGln: 2.121 ± 0.463
3.018ArgArg: 3.018 ± 0.627
2.284ArgSer: 2.284 ± 0.422
2.529ArgThr: 2.529 ± 0.475
2.203ArgVal: 2.203 ± 0.408
0.734ArgTrp: 0.734 ± 0.253
2.447ArgTyr: 2.447 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
3.916SerAla: 3.916 ± 0.591
0.245SerCys: 0.245 ± 0.121
3.753SerAsp: 3.753 ± 0.728
4.405SerGlu: 4.405 ± 0.588
2.774SerPhe: 2.774 ± 0.421
4.324SerGly: 4.324 ± 0.554
0.979SerHis: 0.979 ± 0.24
2.855SerIle: 2.855 ± 0.404
4.242SerLys: 4.242 ± 0.519
5.547SerLeu: 5.547 ± 0.507
2.121SerMet: 2.121 ± 0.369
2.855SerAsn: 2.855 ± 0.401
1.876SerPro: 1.876 ± 0.339
2.692SerGln: 2.692 ± 0.54
3.018SerArg: 3.018 ± 0.566
4.161SerSer: 4.161 ± 1.054
3.589SerThr: 3.589 ± 0.664
3.182SerVal: 3.182 ± 0.516
0.816SerTrp: 0.816 ± 0.231
3.018SerTyr: 3.018 ± 0.551
0.0SerXaa: 0.0 ± 0.0
Thr
5.221ThrAla: 5.221 ± 1.046
0.245ThrCys: 0.245 ± 0.138
3.263ThrAsp: 3.263 ± 0.463
3.1ThrGlu: 3.1 ± 0.549
2.774ThrPhe: 2.774 ± 0.606
3.263ThrGly: 3.263 ± 0.747
0.816ThrHis: 0.816 ± 0.231
3.263ThrIle: 3.263 ± 0.515
4.242ThrLys: 4.242 ± 0.462
4.65ThrLeu: 4.65 ± 0.466
1.061ThrMet: 1.061 ± 0.237
2.692ThrAsn: 2.692 ± 0.521
1.632ThrPro: 1.632 ± 0.405
1.713ThrGln: 1.713 ± 0.395
2.284ThrArg: 2.284 ± 0.576
3.997ThrSer: 3.997 ± 0.65
4.079ThrThr: 4.079 ± 0.56
3.426ThrVal: 3.426 ± 0.546
0.816ThrTrp: 0.816 ± 0.223
2.611ThrTyr: 2.611 ± 0.46
0.0ThrXaa: 0.0 ± 0.0
Val
4.161ValAla: 4.161 ± 0.586
0.163ValCys: 0.163 ± 0.114
3.671ValAsp: 3.671 ± 0.519
5.792ValGlu: 5.792 ± 0.747
2.039ValPhe: 2.039 ± 0.344
4.813ValGly: 4.813 ± 0.557
1.224ValHis: 1.224 ± 0.351
3.018ValIle: 3.018 ± 0.391
4.732ValLys: 4.732 ± 0.605
4.405ValLeu: 4.405 ± 0.71
1.632ValMet: 1.632 ± 0.352
2.774ValAsn: 2.774 ± 0.397
1.061ValPro: 1.061 ± 0.259
1.632ValGln: 1.632 ± 0.35
2.937ValArg: 2.937 ± 0.468
4.405ValSer: 4.405 ± 0.491
4.324ValThr: 4.324 ± 0.49
3.589ValVal: 3.589 ± 0.501
0.489ValTrp: 0.489 ± 0.207
1.876ValTyr: 1.876 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
0.653TrpAla: 0.653 ± 0.161
0.082TrpCys: 0.082 ± 0.094
0.489TrpAsp: 0.489 ± 0.177
0.979TrpGlu: 0.979 ± 0.235
0.653TrpPhe: 0.653 ± 0.289
0.653TrpGly: 0.653 ± 0.253
0.082TrpHis: 0.082 ± 0.092
0.897TrpIle: 0.897 ± 0.227
0.653TrpLys: 0.653 ± 0.253
0.734TrpLeu: 0.734 ± 0.232
0.326TrpMet: 0.326 ± 0.126
0.897TrpAsn: 0.897 ± 0.288
0.163TrpPro: 0.163 ± 0.116
0.734TrpGln: 0.734 ± 0.234
0.408TrpArg: 0.408 ± 0.2
0.979TrpSer: 0.979 ± 0.251
0.408TrpThr: 0.408 ± 0.159
0.408TrpVal: 0.408 ± 0.193
0.082TrpTrp: 0.082 ± 0.067
0.816TrpTyr: 0.816 ± 0.494
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.937TyrAla: 2.937 ± 0.501
0.408TyrCys: 0.408 ± 0.15
2.447TyrAsp: 2.447 ± 0.493
2.366TyrGlu: 2.366 ± 0.431
1.55TyrPhe: 1.55 ± 0.349
1.795TyrGly: 1.795 ± 0.51
0.897TyrHis: 0.897 ± 0.209
2.774TyrIle: 2.774 ± 0.596
2.937TyrLys: 2.937 ± 0.439
3.263TyrLeu: 3.263 ± 0.478
0.653TyrMet: 0.653 ± 0.204
1.713TyrAsn: 1.713 ± 0.284
0.897TyrPro: 0.897 ± 0.299
2.447TyrGln: 2.447 ± 0.458
2.203TyrArg: 2.203 ± 0.56
2.121TyrSer: 2.121 ± 0.551
2.366TyrThr: 2.366 ± 0.375
1.713TyrVal: 1.713 ± 0.348
0.571TyrTrp: 0.571 ± 0.2
2.121TyrTyr: 2.121 ± 0.546
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (12259 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski