Amino acid dipepetide frequency for Streptococcus phage Javan178

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.807AlaAla: 6.807 ± 1.879
0.84AlaCys: 0.84 ± 0.331
4.706AlaAsp: 4.706 ± 0.588
5.798AlaGlu: 5.798 ± 0.844
2.857AlaPhe: 2.857 ± 0.478
4.37AlaGly: 4.37 ± 0.996
0.924AlaHis: 0.924 ± 0.262
6.05AlaIle: 6.05 ± 0.526
7.395AlaLys: 7.395 ± 0.881
6.471AlaLeu: 6.471 ± 0.723
2.353AlaMet: 2.353 ± 0.52
3.361AlaAsn: 3.361 ± 0.652
1.429AlaPro: 1.429 ± 0.292
3.697AlaGln: 3.697 ± 0.823
3.025AlaArg: 3.025 ± 0.581
4.286AlaSer: 4.286 ± 0.893
3.361AlaThr: 3.361 ± 0.72
4.202AlaVal: 4.202 ± 0.598
1.092AlaTrp: 1.092 ± 0.33
3.025AlaTyr: 3.025 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.42CysAla: 0.42 ± 0.174
0.0CysCys: 0.0 ± 0.0
0.42CysAsp: 0.42 ± 0.177
0.84CysGlu: 0.84 ± 0.318
0.252CysPhe: 0.252 ± 0.147
0.504CysGly: 0.504 ± 0.223
0.084CysHis: 0.084 ± 0.075
0.0CysIle: 0.0 ± 0.0
0.504CysLys: 0.504 ± 0.189
1.429CysLeu: 1.429 ± 0.458
0.252CysMet: 0.252 ± 0.151
0.42CysAsn: 0.42 ± 0.209
0.168CysPro: 0.168 ± 0.126
0.168CysGln: 0.168 ± 0.113
0.504CysArg: 0.504 ± 0.186
0.42CysSer: 0.42 ± 0.174
0.168CysThr: 0.168 ± 0.121
0.168CysVal: 0.168 ± 0.125
0.168CysTrp: 0.168 ± 0.112
0.252CysTyr: 0.252 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.529AspAla: 3.529 ± 0.675
0.672AspCys: 0.672 ± 0.257
4.622AspAsp: 4.622 ± 0.798
4.958AspGlu: 4.958 ± 0.56
3.95AspPhe: 3.95 ± 0.588
5.798AspGly: 5.798 ± 0.674
0.84AspHis: 0.84 ± 0.267
4.958AspIle: 4.958 ± 0.722
5.714AspLys: 5.714 ± 0.616
6.555AspLeu: 6.555 ± 0.903
0.84AspMet: 0.84 ± 0.261
4.202AspAsn: 4.202 ± 0.551
1.597AspPro: 1.597 ± 0.429
1.765AspGln: 1.765 ± 0.382
2.605AspArg: 2.605 ± 0.483
3.277AspSer: 3.277 ± 0.53
3.782AspThr: 3.782 ± 0.515
4.874AspVal: 4.874 ± 0.769
1.176AspTrp: 1.176 ± 0.295
3.697AspTyr: 3.697 ± 0.629
0.0AspXaa: 0.0 ± 0.0
Glu
6.218GluAla: 6.218 ± 0.97
0.336GluCys: 0.336 ± 0.167
3.782GluAsp: 3.782 ± 0.653
5.798GluGlu: 5.798 ± 0.842
2.605GluPhe: 2.605 ± 0.407
3.529GluGly: 3.529 ± 0.428
1.176GluHis: 1.176 ± 0.335
6.387GluIle: 6.387 ± 0.746
5.63GluLys: 5.63 ± 0.716
8.319GluLeu: 8.319 ± 0.7
1.513GluMet: 1.513 ± 0.334
2.941GluAsn: 2.941 ± 0.54
1.513GluPro: 1.513 ± 0.432
3.782GluGln: 3.782 ± 0.691
3.025GluArg: 3.025 ± 0.477
3.529GluSer: 3.529 ± 0.561
4.454GluThr: 4.454 ± 0.614
4.706GluVal: 4.706 ± 0.77
0.588GluTrp: 0.588 ± 0.215
3.277GluTyr: 3.277 ± 0.443
0.0GluXaa: 0.0 ± 0.0
Phe
2.857PheAla: 2.857 ± 0.565
0.672PheCys: 0.672 ± 0.266
4.79PheAsp: 4.79 ± 0.622
3.277PheGlu: 3.277 ± 0.714
0.84PhePhe: 0.84 ± 0.251
2.521PheGly: 2.521 ± 0.52
0.168PheHis: 0.168 ± 0.132
3.025PheIle: 3.025 ± 0.515
3.277PheLys: 3.277 ± 0.553
2.353PheLeu: 2.353 ± 0.598
0.672PheMet: 0.672 ± 0.26
2.689PheAsn: 2.689 ± 0.499
0.42PhePro: 0.42 ± 0.186
0.504PheGln: 0.504 ± 0.229
2.017PheArg: 2.017 ± 0.457
2.605PheSer: 2.605 ± 0.508
2.437PheThr: 2.437 ± 0.4
1.849PheVal: 1.849 ± 0.426
0.168PheTrp: 0.168 ± 0.109
1.261PheTyr: 1.261 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
4.79GlyAla: 4.79 ± 0.59
0.504GlyCys: 0.504 ± 0.189
3.866GlyAsp: 3.866 ± 0.602
3.529GlyGlu: 3.529 ± 0.577
3.109GlyPhe: 3.109 ± 0.451
3.025GlyGly: 3.025 ± 0.549
1.513GlyHis: 1.513 ± 0.315
4.874GlyIle: 4.874 ± 0.967
5.042GlyLys: 5.042 ± 0.69
4.37GlyLeu: 4.37 ± 0.734
1.513GlyMet: 1.513 ± 0.422
2.941GlyAsn: 2.941 ± 0.477
2.437GlyPro: 2.437 ± 1.388
2.269GlyGln: 2.269 ± 0.459
3.025GlyArg: 3.025 ± 0.56
3.193GlySer: 3.193 ± 0.461
3.445GlyThr: 3.445 ± 0.435
5.546GlyVal: 5.546 ± 1.014
0.84GlyTrp: 0.84 ± 0.318
2.353GlyTyr: 2.353 ± 0.635
0.0GlyXaa: 0.0 ± 0.0
His
1.008HisAla: 1.008 ± 0.32
0.084HisCys: 0.084 ± 0.075
0.924HisAsp: 0.924 ± 0.293
0.924HisGlu: 0.924 ± 0.248
1.008HisPhe: 1.008 ± 0.276
0.672HisGly: 0.672 ± 0.209
0.168HisHis: 0.168 ± 0.118
1.176HisIle: 1.176 ± 0.344
1.429HisLys: 1.429 ± 0.359
1.176HisLeu: 1.176 ± 0.509
0.336HisMet: 0.336 ± 0.189
0.588HisAsn: 0.588 ± 0.17
0.504HisPro: 0.504 ± 0.255
0.84HisGln: 0.84 ± 0.239
0.756HisArg: 0.756 ± 0.25
1.092HisSer: 1.092 ± 0.361
0.924HisThr: 0.924 ± 0.354
0.504HisVal: 0.504 ± 0.208
0.336HisTrp: 0.336 ± 0.167
0.588HisTyr: 0.588 ± 0.251
0.0HisXaa: 0.0 ± 0.0
Ile
5.546IleAla: 5.546 ± 0.8
0.168IleCys: 0.168 ± 0.126
5.63IleAsp: 5.63 ± 0.655
5.714IleGlu: 5.714 ± 0.742
2.269IlePhe: 2.269 ± 0.541
3.445IleGly: 3.445 ± 0.667
0.756IleHis: 0.756 ± 0.261
3.613IleIle: 3.613 ± 0.559
8.319IleLys: 8.319 ± 1.021
4.286IleLeu: 4.286 ± 0.573
1.261IleMet: 1.261 ± 0.366
5.042IleAsn: 5.042 ± 0.632
1.513IlePro: 1.513 ± 0.305
2.521IleGln: 2.521 ± 0.475
3.193IleArg: 3.193 ± 0.642
5.546IleSer: 5.546 ± 0.644
3.445IleThr: 3.445 ± 0.431
4.286IleVal: 4.286 ± 0.469
0.756IleTrp: 0.756 ± 0.313
2.941IleTyr: 2.941 ± 0.542
0.0IleXaa: 0.0 ± 0.0
Lys
6.891LysAla: 6.891 ± 0.777
0.336LysCys: 0.336 ± 0.178
5.546LysAsp: 5.546 ± 0.712
6.387LysGlu: 6.387 ± 0.888
1.597LysPhe: 1.597 ± 0.349
4.706LysGly: 4.706 ± 0.839
1.513LysHis: 1.513 ± 0.298
6.639LysIle: 6.639 ± 0.863
7.143LysLys: 7.143 ± 0.781
8.067LysLeu: 8.067 ± 0.735
2.437LysMet: 2.437 ± 0.513
4.79LysAsn: 4.79 ± 0.634
2.857LysPro: 2.857 ± 0.636
3.613LysGln: 3.613 ± 0.521
4.454LysArg: 4.454 ± 0.78
4.706LysSer: 4.706 ± 0.535
5.378LysThr: 5.378 ± 0.716
6.555LysVal: 6.555 ± 0.892
1.092LysTrp: 1.092 ± 0.254
3.782LysTyr: 3.782 ± 0.45
0.0LysXaa: 0.0 ± 0.0
Leu
6.555LeuAla: 6.555 ± 0.872
0.756LeuCys: 0.756 ± 0.218
6.555LeuAsp: 6.555 ± 0.881
6.218LeuGlu: 6.218 ± 0.665
3.025LeuPhe: 3.025 ± 0.448
5.21LeuGly: 5.21 ± 0.66
1.092LeuHis: 1.092 ± 0.334
4.286LeuIle: 4.286 ± 0.574
7.479LeuLys: 7.479 ± 0.63
4.958LeuLeu: 4.958 ± 0.649
1.849LeuMet: 1.849 ± 0.522
5.294LeuAsn: 5.294 ± 0.639
2.605LeuPro: 2.605 ± 0.607
2.605LeuGln: 2.605 ± 0.49
3.866LeuArg: 3.866 ± 0.603
4.286LeuSer: 4.286 ± 0.63
5.378LeuThr: 5.378 ± 0.617
5.042LeuVal: 5.042 ± 0.535
0.84LeuTrp: 0.84 ± 0.275
3.445LeuTyr: 3.445 ± 0.623
0.0LeuXaa: 0.0 ± 0.0
Met
2.101MetAla: 2.101 ± 0.42
0.168MetCys: 0.168 ± 0.131
1.429MetAsp: 1.429 ± 0.377
1.345MetGlu: 1.345 ± 0.359
0.504MetPhe: 0.504 ± 0.249
1.429MetGly: 1.429 ± 0.355
0.168MetHis: 0.168 ± 0.118
1.765MetIle: 1.765 ± 0.464
0.924MetLys: 0.924 ± 0.298
1.597MetLeu: 1.597 ± 0.408
0.336MetMet: 0.336 ± 0.158
0.924MetAsn: 0.924 ± 0.287
1.008MetPro: 1.008 ± 0.238
1.681MetGln: 1.681 ± 0.495
1.597MetArg: 1.597 ± 0.334
1.681MetSer: 1.681 ± 0.457
1.765MetThr: 1.765 ± 0.44
0.756MetVal: 0.756 ± 0.195
0.252MetTrp: 0.252 ± 0.144
0.588MetTyr: 0.588 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.286AsnAla: 4.286 ± 0.705
0.336AsnCys: 0.336 ± 0.165
2.773AsnAsp: 2.773 ± 0.547
2.857AsnGlu: 2.857 ± 0.438
2.353AsnPhe: 2.353 ± 0.513
4.286AsnGly: 4.286 ± 0.635
0.756AsnHis: 0.756 ± 0.326
3.025AsnIle: 3.025 ± 0.445
5.294AsnLys: 5.294 ± 0.671
4.874AsnLeu: 4.874 ± 0.679
1.092AsnMet: 1.092 ± 0.283
2.857AsnAsn: 2.857 ± 0.42
2.185AsnPro: 2.185 ± 0.445
2.437AsnGln: 2.437 ± 0.468
1.681AsnArg: 1.681 ± 0.335
3.782AsnSer: 3.782 ± 0.752
2.689AsnThr: 2.689 ± 0.549
2.941AsnVal: 2.941 ± 0.554
0.84AsnTrp: 0.84 ± 0.264
1.429AsnTyr: 1.429 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
1.429ProAla: 1.429 ± 0.453
0.252ProCys: 0.252 ± 0.13
1.933ProAsp: 1.933 ± 0.367
2.689ProGlu: 2.689 ± 0.481
0.924ProPhe: 0.924 ± 0.337
1.597ProGly: 1.597 ± 0.593
0.84ProHis: 0.84 ± 0.332
1.765ProIle: 1.765 ± 0.427
2.689ProLys: 2.689 ± 0.521
1.681ProLeu: 1.681 ± 0.471
0.336ProMet: 0.336 ± 0.146
0.756ProAsn: 0.756 ± 0.289
1.092ProPro: 1.092 ± 0.431
1.513ProGln: 1.513 ± 0.503
1.513ProArg: 1.513 ± 0.415
2.017ProSer: 2.017 ± 0.398
1.345ProThr: 1.345 ± 0.31
1.513ProVal: 1.513 ± 0.288
0.168ProTrp: 0.168 ± 0.137
1.429ProTyr: 1.429 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
3.529GlnAla: 3.529 ± 0.563
0.252GlnCys: 0.252 ± 0.165
1.681GlnAsp: 1.681 ± 0.416
3.782GlnGlu: 3.782 ± 0.682
1.092GlnPhe: 1.092 ± 0.286
3.277GlnGly: 3.277 ± 0.711
0.42GlnHis: 0.42 ± 0.19
2.689GlnIle: 2.689 ± 0.64
4.202GlnLys: 4.202 ± 0.541
3.866GlnLeu: 3.866 ± 0.628
1.176GlnMet: 1.176 ± 0.341
2.437GlnAsn: 2.437 ± 0.471
0.756GlnPro: 0.756 ± 0.342
1.933GlnGln: 1.933 ± 0.562
3.109GlnArg: 3.109 ± 0.568
2.689GlnSer: 2.689 ± 0.436
2.269GlnThr: 2.269 ± 0.471
1.261GlnVal: 1.261 ± 0.343
0.588GlnTrp: 0.588 ± 0.193
1.092GlnTyr: 1.092 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
3.445ArgAla: 3.445 ± 0.679
0.252ArgCys: 0.252 ± 0.152
3.025ArgAsp: 3.025 ± 0.598
3.445ArgGlu: 3.445 ± 0.547
2.269ArgPhe: 2.269 ± 0.492
2.857ArgGly: 2.857 ± 0.69
0.672ArgHis: 0.672 ± 0.287
2.605ArgIle: 2.605 ± 0.415
3.697ArgLys: 3.697 ± 0.547
3.866ArgLeu: 3.866 ± 0.573
1.008ArgMet: 1.008 ± 0.256
1.765ArgAsn: 1.765 ± 0.447
0.924ArgPro: 0.924 ± 0.349
2.101ArgGln: 2.101 ± 0.588
2.017ArgArg: 2.017 ± 0.385
1.849ArgSer: 1.849 ± 0.452
2.605ArgThr: 2.605 ± 0.576
3.361ArgVal: 3.361 ± 0.413
0.672ArgTrp: 0.672 ± 0.223
2.437ArgTyr: 2.437 ± 0.596
0.0ArgXaa: 0.0 ± 0.0
Ser
4.034SerAla: 4.034 ± 1.045
0.42SerCys: 0.42 ± 0.228
4.538SerAsp: 4.538 ± 0.532
3.697SerGlu: 3.697 ± 0.534
2.605SerPhe: 2.605 ± 0.493
3.193SerGly: 3.193 ± 0.681
1.008SerHis: 1.008 ± 0.296
5.714SerIle: 5.714 ± 0.589
5.126SerLys: 5.126 ± 0.712
4.706SerLeu: 4.706 ± 0.561
1.513SerMet: 1.513 ± 0.395
3.445SerAsn: 3.445 ± 0.557
1.261SerPro: 1.261 ± 0.383
2.521SerGln: 2.521 ± 0.506
1.849SerArg: 1.849 ± 0.427
3.529SerSer: 3.529 ± 0.66
2.689SerThr: 2.689 ± 0.575
3.025SerVal: 3.025 ± 0.43
0.672SerTrp: 0.672 ± 0.28
2.521SerTyr: 2.521 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
4.286ThrAla: 4.286 ± 0.97
0.168ThrCys: 0.168 ± 0.118
4.118ThrAsp: 4.118 ± 0.658
3.613ThrGlu: 3.613 ± 0.495
2.437ThrPhe: 2.437 ± 0.387
5.714ThrGly: 5.714 ± 0.793
1.176ThrHis: 1.176 ± 0.274
3.529ThrIle: 3.529 ± 0.497
5.21ThrLys: 5.21 ± 0.645
4.286ThrLeu: 4.286 ± 0.621
0.924ThrMet: 0.924 ± 0.261
2.269ThrAsn: 2.269 ± 0.458
1.849ThrPro: 1.849 ± 0.436
3.025ThrGln: 3.025 ± 0.425
2.185ThrArg: 2.185 ± 0.41
2.353ThrSer: 2.353 ± 0.522
4.118ThrThr: 4.118 ± 0.547
3.529ThrVal: 3.529 ± 0.569
0.672ThrTrp: 0.672 ± 0.269
1.849ThrTyr: 1.849 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
4.286ValAla: 4.286 ± 0.668
0.168ValCys: 0.168 ± 0.127
4.958ValAsp: 4.958 ± 0.598
4.706ValGlu: 4.706 ± 0.728
2.773ValPhe: 2.773 ± 0.585
3.109ValGly: 3.109 ± 0.486
0.756ValHis: 0.756 ± 0.212
4.79ValIle: 4.79 ± 0.593
5.378ValLys: 5.378 ± 0.613
4.37ValLeu: 4.37 ± 0.641
1.849ValMet: 1.849 ± 0.392
3.361ValAsn: 3.361 ± 0.592
1.261ValPro: 1.261 ± 0.442
1.765ValGln: 1.765 ± 0.325
1.933ValArg: 1.933 ± 0.416
4.118ValSer: 4.118 ± 0.705
4.118ValThr: 4.118 ± 0.583
4.118ValVal: 4.118 ± 0.574
0.756ValTrp: 0.756 ± 0.284
2.437ValTyr: 2.437 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
1.008TrpAla: 1.008 ± 0.38
0.168TrpCys: 0.168 ± 0.125
1.008TrpAsp: 1.008 ± 0.292
1.008TrpGlu: 1.008 ± 0.247
0.672TrpPhe: 0.672 ± 0.248
0.672TrpGly: 0.672 ± 0.275
0.336TrpHis: 0.336 ± 0.193
0.756TrpIle: 0.756 ± 0.268
0.924TrpLys: 0.924 ± 0.279
0.924TrpLeu: 0.924 ± 0.288
0.0TrpMet: 0.0 ± 0.0
0.42TrpAsn: 0.42 ± 0.17
0.084TrpPro: 0.084 ± 0.075
0.336TrpGln: 0.336 ± 0.148
0.84TrpArg: 0.84 ± 0.271
1.092TrpSer: 1.092 ± 0.272
0.84TrpThr: 0.84 ± 0.267
0.588TrpVal: 0.588 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.336TrpTyr: 0.336 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.025TyrAla: 3.025 ± 0.565
0.588TyrCys: 0.588 ± 0.21
3.109TyrAsp: 3.109 ± 0.481
2.521TyrGlu: 2.521 ± 0.446
1.261TyrPhe: 1.261 ± 0.424
2.269TyrGly: 2.269 ± 0.355
0.672TyrHis: 0.672 ± 0.328
2.605TyrIle: 2.605 ± 0.482
3.193TyrLys: 3.193 ± 0.612
3.193TyrLeu: 3.193 ± 0.538
0.672TyrMet: 0.672 ± 0.247
2.269TyrAsn: 2.269 ± 0.418
2.017TyrPro: 2.017 ± 0.452
3.025TyrGln: 3.025 ± 0.558
1.681TyrArg: 1.681 ± 0.395
2.017TyrSer: 2.017 ± 0.38
2.101TyrThr: 2.101 ± 0.445
2.185TyrVal: 2.185 ± 0.404
0.336TyrTrp: 0.336 ± 0.154
2.185TyrTyr: 2.185 ± 0.451
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski