Amino acid dipepetide frequency for Streptococcus phage Javan87

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.727AlaAla: 3.727 ± 1.0
0.182AlaCys: 0.182 ± 0.174
4.181AlaAsp: 4.181 ± 0.694
5.636AlaGlu: 5.636 ± 0.922
2.182AlaPhe: 2.182 ± 0.393
4.909AlaGly: 4.909 ± 0.699
0.545AlaHis: 0.545 ± 0.258
5.545AlaIle: 5.545 ± 0.708
5.909AlaLys: 5.909 ± 1.011
5.818AlaLeu: 5.818 ± 0.56
2.091AlaMet: 2.091 ± 0.404
4.0AlaAsn: 4.0 ± 0.659
1.727AlaPro: 1.727 ± 0.396
2.0AlaGln: 2.0 ± 0.413
3.272AlaArg: 3.272 ± 0.64
4.181AlaSer: 4.181 ± 0.866
3.182AlaThr: 3.182 ± 0.611
4.363AlaVal: 4.363 ± 0.802
1.182AlaTrp: 1.182 ± 0.364
2.636AlaTyr: 2.636 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.136
0.0CysCys: 0.0 ± 0.0
0.273CysAsp: 0.273 ± 0.159
0.909CysGlu: 0.909 ± 0.305
0.091CysPhe: 0.091 ± 0.087
0.545CysGly: 0.545 ± 0.286
0.091CysHis: 0.091 ± 0.095
0.273CysIle: 0.273 ± 0.156
0.091CysLys: 0.091 ± 0.086
0.455CysLeu: 0.455 ± 0.175
0.091CysMet: 0.091 ± 0.091
0.364CysAsn: 0.364 ± 0.154
0.273CysPro: 0.273 ± 0.193
0.0CysGln: 0.0 ± 0.0
0.273CysArg: 0.273 ± 0.224
0.455CysSer: 0.455 ± 0.177
0.182CysThr: 0.182 ± 0.127
0.273CysVal: 0.273 ± 0.173
0.0CysTrp: 0.0 ± 0.0
0.364CysTyr: 0.364 ± 0.238
0.0CysXaa: 0.0 ± 0.0
Asp
3.091AspAla: 3.091 ± 0.647
0.636AspCys: 0.636 ± 0.241
3.363AspAsp: 3.363 ± 0.487
5.272AspGlu: 5.272 ± 0.743
3.182AspPhe: 3.182 ± 0.711
5.727AspGly: 5.727 ± 0.745
0.636AspHis: 0.636 ± 0.196
4.091AspIle: 4.091 ± 0.468
5.181AspLys: 5.181 ± 0.715
5.727AspLeu: 5.727 ± 0.818
1.182AspMet: 1.182 ± 0.409
5.181AspAsn: 5.181 ± 0.561
1.273AspPro: 1.273 ± 0.392
1.364AspGln: 1.364 ± 0.341
3.091AspArg: 3.091 ± 0.538
2.636AspSer: 2.636 ± 0.434
4.091AspThr: 4.091 ± 0.724
3.545AspVal: 3.545 ± 0.747
1.454AspTrp: 1.454 ± 0.395
3.545AspTyr: 3.545 ± 0.527
0.0AspXaa: 0.0 ± 0.0
Glu
5.0GluAla: 5.0 ± 1.032
0.364GluCys: 0.364 ± 0.171
3.818GluAsp: 3.818 ± 0.824
5.545GluGlu: 5.545 ± 0.885
3.272GluPhe: 3.272 ± 0.503
2.727GluGly: 2.727 ± 0.418
1.182GluHis: 1.182 ± 0.316
5.636GluIle: 5.636 ± 0.772
7.181GluLys: 7.181 ± 0.881
8.817GluLeu: 8.817 ± 0.84
1.909GluMet: 1.909 ± 0.481
3.363GluAsn: 3.363 ± 0.811
1.818GluPro: 1.818 ± 0.397
3.636GluGln: 3.636 ± 0.683
3.545GluArg: 3.545 ± 0.774
3.636GluSer: 3.636 ± 0.537
3.818GluThr: 3.818 ± 0.503
5.363GluVal: 5.363 ± 0.794
0.455GluTrp: 0.455 ± 0.202
3.454GluTyr: 3.454 ± 0.477
0.0GluXaa: 0.0 ± 0.0
Phe
3.818PheAla: 3.818 ± 0.571
0.091PheCys: 0.091 ± 0.099
3.636PheAsp: 3.636 ± 0.578
3.182PheGlu: 3.182 ± 0.763
1.182PhePhe: 1.182 ± 0.325
3.091PheGly: 3.091 ± 0.498
0.273PheHis: 0.273 ± 0.158
3.272PheIle: 3.272 ± 0.623
2.818PheLys: 2.818 ± 0.445
2.454PheLeu: 2.454 ± 0.59
1.545PheMet: 1.545 ± 0.333
2.636PheAsn: 2.636 ± 0.523
0.909PhePro: 0.909 ± 0.249
0.727PheGln: 0.727 ± 0.308
1.273PheArg: 1.273 ± 0.367
2.273PheSer: 2.273 ± 0.46
2.636PheThr: 2.636 ± 0.367
3.0PheVal: 3.0 ± 0.457
0.636PheTrp: 0.636 ± 0.167
1.545PheTyr: 1.545 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
3.818GlyAla: 3.818 ± 0.642
0.091GlyCys: 0.091 ± 0.082
3.636GlyAsp: 3.636 ± 0.651
3.363GlyGlu: 3.363 ± 0.419
2.363GlyPhe: 2.363 ± 0.484
4.0GlyGly: 4.0 ± 1.239
1.182GlyHis: 1.182 ± 0.327
5.272GlyIle: 5.272 ± 0.648
5.909GlyLys: 5.909 ± 0.71
4.363GlyLeu: 4.363 ± 0.706
2.0GlyMet: 2.0 ± 0.517
3.272GlyAsn: 3.272 ± 0.458
1.091GlyPro: 1.091 ± 0.326
2.363GlyGln: 2.363 ± 0.462
1.727GlyArg: 1.727 ± 0.443
4.363GlySer: 4.363 ± 0.924
5.363GlyThr: 5.363 ± 0.928
5.0GlyVal: 5.0 ± 0.839
1.273GlyTrp: 1.273 ± 0.343
3.636GlyTyr: 3.636 ± 0.651
0.0GlyXaa: 0.0 ± 0.0
His
1.0HisAla: 1.0 ± 0.361
0.091HisCys: 0.091 ± 0.075
0.818HisAsp: 0.818 ± 0.254
1.0HisGlu: 1.0 ± 0.308
0.727HisPhe: 0.727 ± 0.366
0.727HisGly: 0.727 ± 0.306
0.0HisHis: 0.0 ± 0.0
1.273HisIle: 1.273 ± 0.368
1.818HisLys: 1.818 ± 0.466
0.545HisLeu: 0.545 ± 0.189
0.364HisMet: 0.364 ± 0.194
1.0HisAsn: 1.0 ± 0.341
0.182HisPro: 0.182 ± 0.119
0.545HisGln: 0.545 ± 0.287
0.455HisArg: 0.455 ± 0.192
1.0HisSer: 1.0 ± 0.305
0.636HisThr: 0.636 ± 0.22
0.818HisVal: 0.818 ± 0.323
0.273HisTrp: 0.273 ± 0.156
0.545HisTyr: 0.545 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
5.636IleAla: 5.636 ± 0.635
0.455IleCys: 0.455 ± 0.227
6.09IleAsp: 6.09 ± 0.596
5.999IleGlu: 5.999 ± 0.961
2.909IlePhe: 2.909 ± 0.59
4.636IleGly: 4.636 ± 0.874
0.818IleHis: 0.818 ± 0.263
3.545IleIle: 3.545 ± 0.487
5.727IleLys: 5.727 ± 0.753
5.454IleLeu: 5.454 ± 0.731
1.091IleMet: 1.091 ± 0.28
4.909IleAsn: 4.909 ± 0.673
1.727IlePro: 1.727 ± 0.342
2.091IleGln: 2.091 ± 0.517
2.091IleArg: 2.091 ± 0.383
4.091IleSer: 4.091 ± 0.517
5.363IleThr: 5.363 ± 0.723
4.091IleVal: 4.091 ± 0.692
0.909IleTrp: 0.909 ± 0.239
3.091IleTyr: 3.091 ± 0.429
0.0IleXaa: 0.0 ± 0.0
Lys
4.909LysAla: 4.909 ± 0.813
0.273LysCys: 0.273 ± 0.147
5.363LysAsp: 5.363 ± 0.69
6.636LysGlu: 6.636 ± 0.999
3.091LysPhe: 3.091 ± 0.472
4.909LysGly: 4.909 ± 0.666
2.0LysHis: 2.0 ± 0.465
6.908LysIle: 6.908 ± 0.921
8.454LysLys: 8.454 ± 1.005
7.09LysLeu: 7.09 ± 0.787
1.818LysMet: 1.818 ± 0.415
4.727LysAsn: 4.727 ± 0.645
2.545LysPro: 2.545 ± 0.527
3.909LysGln: 3.909 ± 0.548
4.272LysArg: 4.272 ± 0.741
5.0LysSer: 5.0 ± 0.719
5.636LysThr: 5.636 ± 0.704
5.636LysVal: 5.636 ± 0.786
0.818LysTrp: 0.818 ± 0.319
3.909LysTyr: 3.909 ± 0.694
0.0LysXaa: 0.0 ± 0.0
Leu
5.999LeuAla: 5.999 ± 0.872
0.364LeuCys: 0.364 ± 0.16
5.636LeuAsp: 5.636 ± 0.661
6.636LeuGlu: 6.636 ± 1.066
3.0LeuPhe: 3.0 ± 0.557
4.454LeuGly: 4.454 ± 0.73
1.182LeuHis: 1.182 ± 0.244
5.0LeuIle: 5.0 ± 0.72
8.09LeuLys: 8.09 ± 0.685
5.454LeuLeu: 5.454 ± 0.563
1.454LeuMet: 1.454 ± 0.325
4.181LeuAsn: 4.181 ± 0.536
1.909LeuPro: 1.909 ± 0.448
3.0LeuGln: 3.0 ± 0.559
3.454LeuArg: 3.454 ± 0.469
6.181LeuSer: 6.181 ± 0.812
5.454LeuThr: 5.454 ± 0.58
4.909LeuVal: 4.909 ± 0.713
0.909LeuTrp: 0.909 ± 0.272
3.454LeuTyr: 3.454 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
2.363MetAla: 2.363 ± 0.588
0.0MetCys: 0.0 ± 0.0
1.364MetAsp: 1.364 ± 0.325
1.545MetGlu: 1.545 ± 0.42
0.727MetPhe: 0.727 ± 0.235
0.455MetGly: 0.455 ± 0.202
0.182MetHis: 0.182 ± 0.135
1.0MetIle: 1.0 ± 0.314
1.273MetLys: 1.273 ± 0.24
2.363MetLeu: 2.363 ± 0.504
0.636MetMet: 0.636 ± 0.21
1.364MetAsn: 1.364 ± 0.315
1.182MetPro: 1.182 ± 0.274
1.454MetGln: 1.454 ± 0.38
1.182MetArg: 1.182 ± 0.382
1.454MetSer: 1.454 ± 0.312
1.818MetThr: 1.818 ± 0.439
1.454MetVal: 1.454 ± 0.418
0.182MetTrp: 0.182 ± 0.14
0.636MetTyr: 0.636 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
3.636AsnAla: 3.636 ± 0.587
0.273AsnCys: 0.273 ± 0.16
3.909AsnAsp: 3.909 ± 0.629
4.727AsnGlu: 4.727 ± 0.732
2.363AsnPhe: 2.363 ± 0.323
4.454AsnGly: 4.454 ± 0.663
1.182AsnHis: 1.182 ± 0.318
4.0AsnIle: 4.0 ± 0.633
5.454AsnLys: 5.454 ± 0.653
3.818AsnLeu: 3.818 ± 0.71
1.273AsnMet: 1.273 ± 0.347
4.363AsnAsn: 4.363 ± 0.569
1.636AsnPro: 1.636 ± 0.362
2.363AsnGln: 2.363 ± 0.522
2.909AsnArg: 2.909 ± 0.99
3.182AsnSer: 3.182 ± 0.508
2.545AsnThr: 2.545 ± 0.404
3.454AsnVal: 3.454 ± 0.501
1.0AsnTrp: 1.0 ± 0.342
2.091AsnTyr: 2.091 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
2.091ProAla: 2.091 ± 0.437
0.091ProCys: 0.091 ± 0.085
2.091ProAsp: 2.091 ± 0.439
2.273ProGlu: 2.273 ± 0.645
1.364ProPhe: 1.364 ± 0.325
1.0ProGly: 1.0 ± 0.292
0.545ProHis: 0.545 ± 0.196
1.545ProIle: 1.545 ± 0.427
2.454ProLys: 2.454 ± 0.47
2.0ProLeu: 2.0 ± 0.401
0.455ProMet: 0.455 ± 0.205
1.182ProAsn: 1.182 ± 0.264
0.909ProPro: 0.909 ± 0.233
0.909ProGln: 0.909 ± 0.342
1.182ProArg: 1.182 ± 0.335
2.636ProSer: 2.636 ± 0.551
1.091ProThr: 1.091 ± 0.319
2.273ProVal: 2.273 ± 0.428
0.182ProTrp: 0.182 ± 0.125
0.818ProTyr: 0.818 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
2.545GlnAla: 2.545 ± 0.531
0.545GlnCys: 0.545 ± 0.263
1.182GlnAsp: 1.182 ± 0.339
2.909GlnGlu: 2.909 ± 0.495
0.909GlnPhe: 0.909 ± 0.293
2.636GlnGly: 2.636 ± 0.68
0.364GlnHis: 0.364 ± 0.14
3.091GlnIle: 3.091 ± 0.569
3.818GlnLys: 3.818 ± 0.678
2.636GlnLeu: 2.636 ± 0.465
1.0GlnMet: 1.0 ± 0.362
2.273GlnAsn: 2.273 ± 0.525
0.909GlnPro: 0.909 ± 0.358
1.273GlnGln: 1.273 ± 0.454
1.364GlnArg: 1.364 ± 0.311
2.545GlnSer: 2.545 ± 0.551
1.545GlnThr: 1.545 ± 0.403
2.0GlnVal: 2.0 ± 0.392
0.455GlnTrp: 0.455 ± 0.184
1.182GlnTyr: 1.182 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
2.636ArgAla: 2.636 ± 0.445
0.091ArgCys: 0.091 ± 0.099
2.636ArgAsp: 2.636 ± 0.502
2.454ArgGlu: 2.454 ± 0.605
1.818ArgPhe: 1.818 ± 0.489
2.454ArgGly: 2.454 ± 0.575
0.636ArgHis: 0.636 ± 0.261
2.273ArgIle: 2.273 ± 0.472
3.454ArgLys: 3.454 ± 0.614
4.636ArgLeu: 4.636 ± 0.657
1.182ArgMet: 1.182 ± 0.274
2.273ArgAsn: 2.273 ± 0.558
1.545ArgPro: 1.545 ± 0.324
2.363ArgGln: 2.363 ± 0.471
2.818ArgArg: 2.818 ± 0.525
1.818ArgSer: 1.818 ± 0.512
2.636ArgThr: 2.636 ± 0.458
2.273ArgVal: 2.273 ± 0.55
0.636ArgTrp: 0.636 ± 0.269
2.636ArgTyr: 2.636 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
3.818SerAla: 3.818 ± 0.597
0.455SerCys: 0.455 ± 0.323
4.0SerAsp: 4.0 ± 0.554
5.181SerGlu: 5.181 ± 0.79
2.909SerPhe: 2.909 ± 0.622
5.272SerGly: 5.272 ± 1.066
0.727SerHis: 0.727 ± 0.256
5.363SerIle: 5.363 ± 0.639
4.363SerLys: 4.363 ± 0.551
4.091SerLeu: 4.091 ± 0.674
1.182SerMet: 1.182 ± 0.357
3.545SerAsn: 3.545 ± 0.911
1.364SerPro: 1.364 ± 0.412
2.182SerGln: 2.182 ± 0.523
2.636SerArg: 2.636 ± 0.526
3.818SerSer: 3.818 ± 0.87
3.545SerThr: 3.545 ± 0.689
3.272SerVal: 3.272 ± 0.712
0.727SerTrp: 0.727 ± 0.271
2.182SerTyr: 2.182 ± 0.603
0.0SerXaa: 0.0 ± 0.0
Thr
4.545ThrAla: 4.545 ± 0.671
0.0ThrCys: 0.0 ± 0.0
2.909ThrAsp: 2.909 ± 0.627
3.909ThrGlu: 3.909 ± 0.621
2.727ThrPhe: 2.727 ± 0.691
5.0ThrGly: 5.0 ± 0.675
0.909ThrHis: 0.909 ± 0.289
4.818ThrIle: 4.818 ± 0.679
5.454ThrLys: 5.454 ± 0.567
4.727ThrLeu: 4.727 ± 0.797
1.0ThrMet: 1.0 ± 0.323
3.272ThrAsn: 3.272 ± 0.635
2.091ThrPro: 2.091 ± 0.359
1.636ThrGln: 1.636 ± 0.388
2.273ThrArg: 2.273 ± 0.396
4.091ThrSer: 4.091 ± 0.601
4.545ThrThr: 4.545 ± 0.79
4.636ThrVal: 4.636 ± 0.668
0.545ThrTrp: 0.545 ± 0.22
3.363ThrTyr: 3.363 ± 0.591
0.0ThrXaa: 0.0 ± 0.0
Val
4.545ValAla: 4.545 ± 0.667
0.455ValCys: 0.455 ± 0.169
4.636ValAsp: 4.636 ± 0.863
5.272ValGlu: 5.272 ± 0.901
2.727ValPhe: 2.727 ± 0.425
3.363ValGly: 3.363 ± 0.911
0.727ValHis: 0.727 ± 0.234
2.909ValIle: 2.909 ± 0.559
5.727ValLys: 5.727 ± 0.739
4.818ValLeu: 4.818 ± 0.518
1.273ValMet: 1.273 ± 0.321
3.636ValAsn: 3.636 ± 0.596
2.273ValPro: 2.273 ± 0.577
1.818ValGln: 1.818 ± 0.395
3.182ValArg: 3.182 ± 0.569
4.0ValSer: 4.0 ± 0.62
5.181ValThr: 5.181 ± 0.764
5.181ValVal: 5.181 ± 1.007
0.273ValTrp: 0.273 ± 0.166
2.182ValTyr: 2.182 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
1.0TrpAla: 1.0 ± 0.257
0.273TrpCys: 0.273 ± 0.154
0.818TrpAsp: 0.818 ± 0.28
0.364TrpGlu: 0.364 ± 0.199
0.727TrpPhe: 0.727 ± 0.306
0.818TrpGly: 0.818 ± 0.274
0.091TrpHis: 0.091 ± 0.097
1.182TrpIle: 1.182 ± 0.299
1.364TrpLys: 1.364 ± 0.332
1.091TrpLeu: 1.091 ± 0.378
0.091TrpMet: 0.091 ± 0.094
0.727TrpAsn: 0.727 ± 0.234
0.182TrpPro: 0.182 ± 0.123
0.455TrpGln: 0.455 ± 0.261
0.364TrpArg: 0.364 ± 0.213
0.818TrpSer: 0.818 ± 0.22
0.818TrpThr: 0.818 ± 0.287
0.636TrpVal: 0.636 ± 0.245
0.273TrpTrp: 0.273 ± 0.168
0.545TrpTyr: 0.545 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.818TyrAla: 2.818 ± 0.65
0.545TyrCys: 0.545 ± 0.202
3.909TyrAsp: 3.909 ± 0.593
1.727TyrGlu: 1.727 ± 0.435
2.727TyrPhe: 2.727 ± 0.552
2.909TyrGly: 2.909 ± 0.508
0.545TyrHis: 0.545 ± 0.21
3.727TyrIle: 3.727 ± 0.806
3.363TyrLys: 3.363 ± 0.642
4.363TyrLeu: 4.363 ± 0.577
0.909TyrMet: 0.909 ± 0.316
2.454TyrAsn: 2.454 ± 0.465
1.545TyrPro: 1.545 ± 0.342
1.091TyrGln: 1.091 ± 0.242
1.909TyrArg: 1.909 ± 0.426
2.454TyrSer: 2.454 ± 0.531
2.363TyrThr: 2.363 ± 0.472
2.0TyrVal: 2.0 ± 0.52
0.455TyrTrp: 0.455 ± 0.198
1.909TyrTyr: 1.909 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski