Amino acid dipepetide frequency for Streptococcus phage IPP45

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.671AlaAla: 2.671 ± 0.946
0.501AlaCys: 0.501 ± 0.205
4.424AlaAsp: 4.424 ± 0.603
6.761AlaGlu: 6.761 ± 0.837
2.42AlaPhe: 2.42 ± 0.475
4.841AlaGly: 4.841 ± 0.9
0.334AlaHis: 0.334 ± 0.204
5.258AlaIle: 5.258 ± 1.074
5.592AlaLys: 5.592 ± 0.554
6.928AlaLeu: 6.928 ± 0.71
1.669AlaMet: 1.669 ± 0.386
4.173AlaAsn: 4.173 ± 1.136
1.502AlaPro: 1.502 ± 0.383
2.17AlaGln: 2.17 ± 0.469
2.921AlaArg: 2.921 ± 0.52
5.676AlaSer: 5.676 ± 0.914
3.672AlaThr: 3.672 ± 0.798
4.173AlaVal: 4.173 ± 0.519
1.502AlaTrp: 1.502 ± 0.601
1.586AlaTyr: 1.586 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.167CysAla: 0.167 ± 0.1
0.167CysCys: 0.167 ± 0.109
0.25CysAsp: 0.25 ± 0.139
0.835CysGlu: 0.835 ± 0.301
0.167CysPhe: 0.167 ± 0.096
0.417CysGly: 0.417 ± 0.212
0.167CysHis: 0.167 ± 0.127
0.501CysIle: 0.501 ± 0.25
0.25CysLys: 0.25 ± 0.144
0.835CysLeu: 0.835 ± 0.299
0.25CysMet: 0.25 ± 0.151
0.167CysAsn: 0.167 ± 0.144
0.083CysPro: 0.083 ± 0.096
0.167CysGln: 0.167 ± 0.105
0.584CysArg: 0.584 ± 0.263
0.334CysSer: 0.334 ± 0.178
0.334CysThr: 0.334 ± 0.186
0.25CysVal: 0.25 ± 0.141
0.167CysTrp: 0.167 ± 0.125
0.417CysTyr: 0.417 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.839AspAla: 3.839 ± 0.505
0.501AspCys: 0.501 ± 0.223
3.339AspAsp: 3.339 ± 0.813
3.756AspGlu: 3.756 ± 0.876
2.587AspPhe: 2.587 ± 0.395
5.008AspGly: 5.008 ± 0.718
1.002AspHis: 1.002 ± 0.299
4.758AspIle: 4.758 ± 0.547
4.841AspLys: 4.841 ± 0.78
5.509AspLeu: 5.509 ± 0.692
0.835AspMet: 0.835 ± 0.23
3.088AspAsn: 3.088 ± 0.406
1.002AspPro: 1.002 ± 0.264
1.085AspGln: 1.085 ± 0.436
2.17AspArg: 2.17 ± 0.387
2.587AspSer: 2.587 ± 0.364
3.005AspThr: 3.005 ± 0.446
3.839AspVal: 3.839 ± 0.794
0.918AspTrp: 0.918 ± 0.283
3.422AspTyr: 3.422 ± 0.704
0.0AspXaa: 0.0 ± 0.0
Glu
6.343GluAla: 6.343 ± 0.847
0.417GluCys: 0.417 ± 0.171
4.09GluAsp: 4.09 ± 0.662
5.759GluGlu: 5.759 ± 0.865
3.088GluPhe: 3.088 ± 0.481
3.756GluGly: 3.756 ± 0.626
0.501GluHis: 0.501 ± 0.234
6.343GluIle: 6.343 ± 0.645
6.928GluLys: 6.928 ± 1.255
7.011GluLeu: 7.011 ± 0.843
2.003GluMet: 2.003 ± 0.505
4.34GluAsn: 4.34 ± 0.708
1.836GluPro: 1.836 ± 0.582
3.589GluGln: 3.589 ± 0.852
3.589GluArg: 3.589 ± 0.809
4.34GluSer: 4.34 ± 0.635
3.422GluThr: 3.422 ± 0.572
4.507GluVal: 4.507 ± 0.7
1.335GluTrp: 1.335 ± 0.316
2.17GluTyr: 2.17 ± 0.357
0.0GluXaa: 0.0 ± 0.0
Phe
2.587PheAla: 2.587 ± 0.684
0.167PheCys: 0.167 ± 0.128
3.422PheAsp: 3.422 ± 0.482
3.589PheGlu: 3.589 ± 0.587
1.586PhePhe: 1.586 ± 0.398
2.921PheGly: 2.921 ± 0.503
0.25PheHis: 0.25 ± 0.133
2.42PheIle: 2.42 ± 0.658
2.921PheLys: 2.921 ± 0.532
2.754PheLeu: 2.754 ± 0.593
1.252PheMet: 1.252 ± 0.361
2.254PheAsn: 2.254 ± 0.482
0.668PhePro: 0.668 ± 0.317
1.502PheGln: 1.502 ± 0.31
1.836PheArg: 1.836 ± 0.37
3.088PheSer: 3.088 ± 0.613
2.42PheThr: 2.42 ± 0.37
2.337PheVal: 2.337 ± 0.546
0.417PheTrp: 0.417 ± 0.179
2.17PheTyr: 2.17 ± 0.488
0.0PheXaa: 0.0 ± 0.0
Gly
4.173GlyAla: 4.173 ± 0.741
0.25GlyCys: 0.25 ± 0.117
2.838GlyAsp: 2.838 ± 0.497
3.589GlyGlu: 3.589 ± 0.528
3.088GlyPhe: 3.088 ± 0.437
4.674GlyGly: 4.674 ± 0.592
0.918GlyHis: 0.918 ± 0.244
5.258GlyIle: 5.258 ± 0.964
4.841GlyLys: 4.841 ± 0.535
6.677GlyLeu: 6.677 ± 1.065
2.254GlyMet: 2.254 ± 0.576
4.257GlyAsn: 4.257 ± 0.535
0.918GlyPro: 0.918 ± 0.22
3.506GlyGln: 3.506 ± 0.674
3.923GlyArg: 3.923 ± 0.638
4.173GlySer: 4.173 ± 0.667
3.506GlyThr: 3.506 ± 0.501
4.173GlyVal: 4.173 ± 0.697
1.252GlyTrp: 1.252 ± 0.448
4.09GlyTyr: 4.09 ± 0.583
0.0GlyXaa: 0.0 ± 0.0
His
0.835HisAla: 0.835 ± 0.315
0.334HisCys: 0.334 ± 0.143
0.584HisAsp: 0.584 ± 0.2
0.835HisGlu: 0.835 ± 0.219
0.584HisPhe: 0.584 ± 0.259
0.668HisGly: 0.668 ± 0.243
0.167HisHis: 0.167 ± 0.139
1.085HisIle: 1.085 ± 0.354
1.085HisLys: 1.085 ± 0.294
0.835HisLeu: 0.835 ± 0.283
0.167HisMet: 0.167 ± 0.124
0.918HisAsn: 0.918 ± 0.278
0.584HisPro: 0.584 ± 0.227
0.584HisGln: 0.584 ± 0.181
0.584HisArg: 0.584 ± 0.254
1.169HisSer: 1.169 ± 0.432
0.584HisThr: 0.584 ± 0.192
0.584HisVal: 0.584 ± 0.221
0.167HisTrp: 0.167 ± 0.115
0.668HisTyr: 0.668 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
5.425IleAla: 5.425 ± 0.722
0.751IleCys: 0.751 ± 0.29
3.839IleAsp: 3.839 ± 0.44
6.594IleGlu: 6.594 ± 0.645
3.005IlePhe: 3.005 ± 0.617
4.924IleGly: 4.924 ± 0.705
0.668IleHis: 0.668 ± 0.295
3.422IleIle: 3.422 ± 0.664
5.091IleLys: 5.091 ± 0.768
5.926IleLeu: 5.926 ± 1.018
1.252IleMet: 1.252 ± 0.332
3.005IleAsn: 3.005 ± 0.385
2.504IlePro: 2.504 ± 0.562
1.836IleGln: 1.836 ± 0.478
3.088IleArg: 3.088 ± 0.827
6.093IleSer: 6.093 ± 1.235
4.841IleThr: 4.841 ± 0.558
3.839IleVal: 3.839 ± 0.658
1.586IleTrp: 1.586 ± 0.637
2.003IleTyr: 2.003 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
5.843LysAla: 5.843 ± 0.704
0.334LysCys: 0.334 ± 0.269
5.342LysAsp: 5.342 ± 0.655
6.01LysGlu: 6.01 ± 0.832
2.254LysPhe: 2.254 ± 0.427
4.924LysGly: 4.924 ± 0.548
1.252LysHis: 1.252 ± 0.342
5.676LysIle: 5.676 ± 0.794
6.761LysLys: 6.761 ± 1.207
5.676LysLeu: 5.676 ± 0.748
2.17LysMet: 2.17 ± 0.399
5.091LysAsn: 5.091 ± 0.697
2.087LysPro: 2.087 ± 0.549
5.091LysGln: 5.091 ± 0.539
4.507LysArg: 4.507 ± 0.66
5.676LysSer: 5.676 ± 0.83
4.591LysThr: 4.591 ± 0.617
4.424LysVal: 4.424 ± 0.747
1.085LysTrp: 1.085 ± 0.408
3.339LysTyr: 3.339 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
6.594LeuAla: 6.594 ± 0.647
0.167LeuCys: 0.167 ± 0.151
5.342LeuAsp: 5.342 ± 0.459
6.26LeuGlu: 6.26 ± 0.761
3.839LeuPhe: 3.839 ± 0.501
6.594LeuGly: 6.594 ± 0.971
1.085LeuHis: 1.085 ± 0.305
4.674LeuIle: 4.674 ± 0.563
7.595LeuLys: 7.595 ± 0.659
6.594LeuLeu: 6.594 ± 0.889
1.669LeuMet: 1.669 ± 0.504
5.008LeuAsn: 5.008 ± 0.941
2.42LeuPro: 2.42 ± 0.648
2.504LeuGln: 2.504 ± 0.724
4.841LeuArg: 4.841 ± 0.881
6.761LeuSer: 6.761 ± 0.743
5.509LeuThr: 5.509 ± 0.755
4.424LeuVal: 4.424 ± 0.672
1.002LeuTrp: 1.002 ± 0.379
2.003LeuTyr: 2.003 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
1.669MetAla: 1.669 ± 0.446
0.25MetCys: 0.25 ± 0.118
1.169MetAsp: 1.169 ± 0.386
1.836MetGlu: 1.836 ± 0.537
0.751MetPhe: 0.751 ± 0.298
1.753MetGly: 1.753 ± 0.482
0.25MetHis: 0.25 ± 0.187
1.753MetIle: 1.753 ± 0.434
2.337MetLys: 2.337 ± 0.562
1.669MetLeu: 1.669 ± 0.329
0.751MetMet: 0.751 ± 0.28
1.419MetAsn: 1.419 ± 0.417
0.668MetPro: 0.668 ± 0.174
1.002MetGln: 1.002 ± 0.329
1.002MetArg: 1.002 ± 0.336
1.586MetSer: 1.586 ± 0.386
1.586MetThr: 1.586 ± 0.464
1.502MetVal: 1.502 ± 0.373
0.167MetTrp: 0.167 ± 0.112
0.584MetTyr: 0.584 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
3.756AsnAla: 3.756 ± 0.883
0.167AsnCys: 0.167 ± 0.086
2.42AsnAsp: 2.42 ± 0.376
3.589AsnGlu: 3.589 ± 0.415
2.087AsnPhe: 2.087 ± 0.379
4.758AsnGly: 4.758 ± 0.741
0.751AsnHis: 0.751 ± 0.325
4.09AsnIle: 4.09 ± 0.505
4.09AsnLys: 4.09 ± 0.558
5.091AsnLeu: 5.091 ± 0.76
1.085AsnMet: 1.085 ± 0.294
2.254AsnAsn: 2.254 ± 0.47
2.671AsnPro: 2.671 ± 0.459
3.339AsnGln: 3.339 ± 0.725
2.337AsnArg: 2.337 ± 0.489
3.339AsnSer: 3.339 ± 0.724
2.504AsnThr: 2.504 ± 0.436
3.589AsnVal: 3.589 ± 0.624
1.002AsnTrp: 1.002 ± 0.289
2.337AsnTyr: 2.337 ± 0.45
0.0AsnXaa: 0.0 ± 0.0
Pro
1.502ProAla: 1.502 ± 0.378
0.167ProCys: 0.167 ± 0.134
1.836ProAsp: 1.836 ± 0.45
2.504ProGlu: 2.504 ± 0.448
1.419ProPhe: 1.419 ± 0.356
0.835ProGly: 0.835 ± 0.224
0.584ProHis: 0.584 ± 0.281
1.586ProIle: 1.586 ± 0.4
3.506ProLys: 3.506 ± 0.549
2.087ProLeu: 2.087 ± 0.514
0.584ProMet: 0.584 ± 0.198
1.169ProAsn: 1.169 ± 0.353
0.751ProPro: 0.751 ± 0.254
1.169ProGln: 1.169 ± 0.338
1.085ProArg: 1.085 ± 0.282
1.92ProSer: 1.92 ± 0.407
1.252ProThr: 1.252 ± 0.342
1.586ProVal: 1.586 ± 0.422
0.083ProTrp: 0.083 ± 0.087
0.668ProTyr: 0.668 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
3.339GlnAla: 3.339 ± 0.569
0.083GlnCys: 0.083 ± 0.086
1.669GlnAsp: 1.669 ± 0.461
3.589GlnGlu: 3.589 ± 0.566
1.252GlnPhe: 1.252 ± 0.33
2.838GlnGly: 2.838 ± 0.61
0.501GlnHis: 0.501 ± 0.193
3.172GlnIle: 3.172 ± 0.461
3.756GlnLys: 3.756 ± 0.485
2.17GlnLeu: 2.17 ± 0.3
1.169GlnMet: 1.169 ± 0.285
2.337GlnAsn: 2.337 ± 0.296
1.335GlnPro: 1.335 ± 0.372
1.002GlnGln: 1.002 ± 0.292
2.17GlnArg: 2.17 ± 0.413
2.754GlnSer: 2.754 ± 0.396
2.587GlnThr: 2.587 ± 0.643
2.754GlnVal: 2.754 ± 0.556
0.417GlnTrp: 0.417 ± 0.17
1.252GlnTyr: 1.252 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
3.422ArgAla: 3.422 ± 0.662
0.334ArgCys: 0.334 ± 0.163
2.42ArgAsp: 2.42 ± 0.431
2.504ArgGlu: 2.504 ± 0.549
1.586ArgPhe: 1.586 ± 0.369
1.836ArgGly: 1.836 ± 0.421
0.751ArgHis: 0.751 ± 0.27
3.506ArgIle: 3.506 ± 0.594
5.091ArgLys: 5.091 ± 0.851
4.424ArgLeu: 4.424 ± 0.713
1.252ArgMet: 1.252 ± 0.307
3.172ArgAsn: 3.172 ± 0.521
1.419ArgPro: 1.419 ± 0.335
2.254ArgGln: 2.254 ± 0.552
2.337ArgArg: 2.337 ± 0.518
1.92ArgSer: 1.92 ± 0.408
2.921ArgThr: 2.921 ± 0.582
2.42ArgVal: 2.42 ± 0.483
0.584ArgTrp: 0.584 ± 0.229
2.671ArgTyr: 2.671 ± 0.705
0.0ArgXaa: 0.0 ± 0.0
Ser
5.091SerAla: 5.091 ± 1.141
0.751SerCys: 0.751 ± 0.216
4.173SerAsp: 4.173 ± 0.532
5.509SerGlu: 5.509 ± 0.66
2.838SerPhe: 2.838 ± 0.385
5.425SerGly: 5.425 ± 0.821
1.169SerHis: 1.169 ± 0.278
4.006SerIle: 4.006 ± 0.668
4.924SerLys: 4.924 ± 0.939
6.093SerLeu: 6.093 ± 0.624
2.17SerMet: 2.17 ± 0.65
3.255SerAsn: 3.255 ± 0.409
1.836SerPro: 1.836 ± 0.371
2.087SerGln: 2.087 ± 0.517
2.504SerArg: 2.504 ± 0.633
4.674SerSer: 4.674 ± 0.73
4.674SerThr: 4.674 ± 0.847
3.923SerVal: 3.923 ± 0.884
0.835SerTrp: 0.835 ± 0.277
2.754SerTyr: 2.754 ± 0.528
0.0SerXaa: 0.0 ± 0.0
Thr
4.591ThrAla: 4.591 ± 1.265
0.417ThrCys: 0.417 ± 0.203
3.839ThrAsp: 3.839 ± 0.689
3.422ThrGlu: 3.422 ± 0.555
3.172ThrPhe: 3.172 ± 0.774
5.342ThrGly: 5.342 ± 0.844
0.668ThrHis: 0.668 ± 0.277
4.924ThrIle: 4.924 ± 0.518
3.589ThrLys: 3.589 ± 0.781
4.674ThrLeu: 4.674 ± 0.585
0.501ThrMet: 0.501 ± 0.174
2.838ThrAsn: 2.838 ± 0.589
1.002ThrPro: 1.002 ± 0.254
2.17ThrGln: 2.17 ± 0.652
1.753ThrArg: 1.753 ± 0.367
4.006ThrSer: 4.006 ± 0.762
4.424ThrThr: 4.424 ± 0.707
4.173ThrVal: 4.173 ± 0.572
0.668ThrTrp: 0.668 ± 0.278
2.587ThrTyr: 2.587 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
3.589ValAla: 3.589 ± 0.508
0.167ValCys: 0.167 ± 0.126
3.339ValAsp: 3.339 ± 0.515
4.674ValGlu: 4.674 ± 0.71
2.42ValPhe: 2.42 ± 0.548
3.756ValGly: 3.756 ± 0.537
1.002ValHis: 1.002 ± 0.305
3.506ValIle: 3.506 ± 0.484
5.342ValLys: 5.342 ± 0.514
4.758ValLeu: 4.758 ± 0.726
0.501ValMet: 0.501 ± 0.28
3.255ValAsn: 3.255 ± 0.477
1.586ValPro: 1.586 ± 0.44
2.587ValGln: 2.587 ± 0.416
2.838ValArg: 2.838 ± 0.558
4.758ValSer: 4.758 ± 0.607
4.424ValThr: 4.424 ± 0.759
4.257ValVal: 4.257 ± 0.546
1.002ValTrp: 1.002 ± 0.303
2.087ValTyr: 2.087 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.258
0.167TrpCys: 0.167 ± 0.12
0.584TrpAsp: 0.584 ± 0.231
1.335TrpGlu: 1.335 ± 0.528
0.584TrpPhe: 0.584 ± 0.239
1.252TrpGly: 1.252 ± 0.271
0.167TrpHis: 0.167 ± 0.124
1.085TrpIle: 1.085 ± 0.247
0.918TrpLys: 0.918 ± 0.284
1.169TrpLeu: 1.169 ± 0.39
0.668TrpMet: 0.668 ± 0.267
1.169TrpAsn: 1.169 ± 0.316
0.167TrpPro: 0.167 ± 0.112
0.835TrpGln: 0.835 ± 0.2
0.501TrpArg: 0.501 ± 0.259
1.002TrpSer: 1.002 ± 0.363
0.751TrpThr: 0.751 ± 0.307
1.002TrpVal: 1.002 ± 0.262
0.25TrpTrp: 0.25 ± 0.121
0.751TrpTyr: 0.751 ± 0.513
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.671TyrAla: 2.671 ± 0.467
0.417TyrCys: 0.417 ± 0.165
2.254TyrAsp: 2.254 ± 0.532
2.42TyrGlu: 2.42 ± 0.43
1.836TyrPhe: 1.836 ± 0.57
1.836TyrGly: 1.836 ± 0.362
0.835TyrHis: 0.835 ± 0.235
2.671TyrIle: 2.671 ± 0.46
2.671TyrLys: 2.671 ± 0.427
4.006TyrLeu: 4.006 ± 0.571
1.419TyrMet: 1.419 ± 0.363
2.087TyrAsn: 2.087 ± 0.374
1.252TyrPro: 1.252 ± 0.342
1.669TyrGln: 1.669 ± 0.275
2.254TyrArg: 2.254 ± 0.552
2.838TyrSer: 2.838 ± 0.521
1.753TyrThr: 1.753 ± 0.332
2.087TyrVal: 2.087 ± 0.435
0.584TyrTrp: 0.584 ± 0.192
1.502TyrTyr: 1.502 ± 0.657
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11982 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski