Amino acid dipepetide frequency for Streptococcus phage Javan258

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.542AlaAla: 5.542 ± 1.452
0.407AlaCys: 0.407 ± 0.198
4.727AlaAsp: 4.727 ± 0.966
5.705AlaGlu: 5.705 ± 0.762
2.771AlaPhe: 2.771 ± 0.715
5.134AlaGly: 5.134 ± 0.82
1.059AlaHis: 1.059 ± 0.306
6.112AlaIle: 6.112 ± 1.056
6.519AlaLys: 6.519 ± 0.935
6.845AlaLeu: 6.845 ± 1.342
2.363AlaMet: 2.363 ± 0.774
3.993AlaAsn: 3.993 ± 0.513
2.445AlaPro: 2.445 ± 0.357
3.015AlaGln: 3.015 ± 0.912
2.282AlaArg: 2.282 ± 0.451
4.319AlaSer: 4.319 ± 0.952
4.808AlaThr: 4.808 ± 0.573
5.379AlaVal: 5.379 ± 1.015
0.815AlaTrp: 0.815 ± 0.292
2.526AlaTyr: 2.526 ± 0.68
0.0AlaXaa: 0.0 ± 0.0
Cys
0.326CysAla: 0.326 ± 0.179
0.081CysCys: 0.081 ± 0.082
0.163CysAsp: 0.163 ± 0.12
0.733CysGlu: 0.733 ± 0.237
0.081CysPhe: 0.081 ± 0.084
0.163CysGly: 0.163 ± 0.12
0.081CysHis: 0.081 ± 0.1
0.0CysIle: 0.0 ± 0.0
0.407CysLys: 0.407 ± 0.186
0.407CysLeu: 0.407 ± 0.199
0.081CysMet: 0.081 ± 0.093
0.163CysAsn: 0.163 ± 0.097
0.081CysPro: 0.081 ± 0.074
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.244CysSer: 0.244 ± 0.135
0.163CysThr: 0.163 ± 0.104
0.244CysVal: 0.244 ± 0.142
0.081CysTrp: 0.081 ± 0.085
0.244CysTyr: 0.244 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
4.238AspAla: 4.238 ± 0.731
0.163AspCys: 0.163 ± 0.113
5.379AspAsp: 5.379 ± 0.899
4.319AspGlu: 4.319 ± 0.861
3.26AspPhe: 3.26 ± 0.461
6.112AspGly: 6.112 ± 1.157
0.407AspHis: 0.407 ± 0.168
4.808AspIle: 4.808 ± 0.66
5.46AspLys: 5.46 ± 0.685
4.727AspLeu: 4.727 ± 0.736
1.467AspMet: 1.467 ± 0.33
3.993AspAsn: 3.993 ± 0.655
1.304AspPro: 1.304 ± 0.279
1.467AspGln: 1.467 ± 0.434
2.363AspArg: 2.363 ± 0.457
3.504AspSer: 3.504 ± 0.669
4.808AspThr: 4.808 ± 0.601
3.178AspVal: 3.178 ± 0.592
0.489AspTrp: 0.489 ± 0.23
3.667AspTyr: 3.667 ± 0.715
0.0AspXaa: 0.0 ± 0.0
Glu
3.912GluAla: 3.912 ± 0.628
0.163GluCys: 0.163 ± 0.102
3.26GluAsp: 3.26 ± 0.607
4.319GluGlu: 4.319 ± 0.931
2.771GluPhe: 2.771 ± 0.459
2.2GluGly: 2.2 ± 0.397
1.222GluHis: 1.222 ± 0.398
5.216GluIle: 5.216 ± 0.746
4.645GluLys: 4.645 ± 0.744
7.334GluLeu: 7.334 ± 1.266
2.037GluMet: 2.037 ± 0.511
4.401GluAsn: 4.401 ± 0.717
0.896GluPro: 0.896 ± 0.328
3.26GluGln: 3.26 ± 0.514
3.586GluArg: 3.586 ± 0.675
1.63GluSer: 1.63 ± 0.351
3.178GluThr: 3.178 ± 0.543
4.482GluVal: 4.482 ± 0.656
0.978GluTrp: 0.978 ± 0.273
2.608GluTyr: 2.608 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.608PheAla: 2.608 ± 0.674
0.081PheCys: 0.081 ± 0.084
4.238PheAsp: 4.238 ± 0.605
3.586PheGlu: 3.586 ± 0.581
0.978PhePhe: 0.978 ± 0.314
2.445PheGly: 2.445 ± 0.636
0.815PheHis: 0.815 ± 0.265
3.178PheIle: 3.178 ± 0.518
2.689PheLys: 2.689 ± 0.377
2.526PheLeu: 2.526 ± 0.54
0.978PheMet: 0.978 ± 0.241
2.771PheAsn: 2.771 ± 0.314
0.57PhePro: 0.57 ± 0.243
1.141PheGln: 1.141 ± 0.364
0.896PheArg: 0.896 ± 0.242
3.749PheSer: 3.749 ± 0.753
2.608PheThr: 2.608 ± 0.563
2.363PheVal: 2.363 ± 0.36
0.244PheTrp: 0.244 ± 0.127
1.222PheTyr: 1.222 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
4.727GlyAla: 4.727 ± 1.206
0.163GlyCys: 0.163 ± 0.113
2.852GlyAsp: 2.852 ± 0.522
3.667GlyGlu: 3.667 ± 0.59
3.586GlyPhe: 3.586 ± 0.565
3.993GlyGly: 3.993 ± 0.598
0.896GlyHis: 0.896 ± 0.317
5.297GlyIle: 5.297 ± 0.961
6.438GlyLys: 6.438 ± 0.718
4.808GlyLeu: 4.808 ± 1.021
2.608GlyMet: 2.608 ± 0.568
4.075GlyAsn: 4.075 ± 0.826
0.407GlyPro: 0.407 ± 0.141
2.771GlyGln: 2.771 ± 0.547
3.178GlyArg: 3.178 ± 0.623
4.319GlySer: 4.319 ± 0.913
5.297GlyThr: 5.297 ± 0.788
5.867GlyVal: 5.867 ± 0.971
0.815GlyTrp: 0.815 ± 0.229
2.526GlyTyr: 2.526 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
0.57HisAla: 0.57 ± 0.252
0.163HisCys: 0.163 ± 0.111
0.652HisAsp: 0.652 ± 0.238
1.059HisGlu: 1.059 ± 0.283
0.733HisPhe: 0.733 ± 0.198
0.652HisGly: 0.652 ± 0.246
0.489HisHis: 0.489 ± 0.185
1.304HisIle: 1.304 ± 0.273
1.304HisLys: 1.304 ± 0.398
0.733HisLeu: 0.733 ± 0.22
0.489HisMet: 0.489 ± 0.253
0.815HisAsn: 0.815 ± 0.255
0.733HisPro: 0.733 ± 0.295
0.489HisGln: 0.489 ± 0.193
0.733HisArg: 0.733 ± 0.272
0.978HisSer: 0.978 ± 0.298
1.304HisThr: 1.304 ± 0.305
0.652HisVal: 0.652 ± 0.201
0.163HisTrp: 0.163 ± 0.129
0.733HisTyr: 0.733 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
5.46IleAla: 5.46 ± 0.834
0.081IleCys: 0.081 ± 0.079
5.053IleAsp: 5.053 ± 0.535
5.542IleGlu: 5.542 ± 0.795
1.548IlePhe: 1.548 ± 0.362
5.542IleGly: 5.542 ± 0.916
1.141IleHis: 1.141 ± 0.312
3.26IleIle: 3.26 ± 0.645
6.764IleLys: 6.764 ± 0.728
4.482IleLeu: 4.482 ± 0.589
1.63IleMet: 1.63 ± 0.452
4.971IleAsn: 4.971 ± 0.659
1.956IlePro: 1.956 ± 0.519
3.097IleGln: 3.097 ± 0.536
3.423IleArg: 3.423 ± 0.476
4.971IleSer: 4.971 ± 0.653
5.134IleThr: 5.134 ± 0.803
2.282IleVal: 2.282 ± 0.708
0.407IleTrp: 0.407 ± 0.195
2.852IleTyr: 2.852 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
6.682LysAla: 6.682 ± 0.928
0.326LysCys: 0.326 ± 0.147
4.319LysAsp: 4.319 ± 0.642
5.786LysGlu: 5.786 ± 0.923
2.689LysPhe: 2.689 ± 0.396
5.216LysGly: 5.216 ± 1.033
2.037LysHis: 2.037 ± 0.442
4.727LysIle: 4.727 ± 0.652
6.682LysLys: 6.682 ± 1.021
5.867LysLeu: 5.867 ± 0.638
1.874LysMet: 1.874 ± 0.504
4.808LysAsn: 4.808 ± 0.592
2.119LysPro: 2.119 ± 0.486
2.526LysGln: 2.526 ± 0.587
3.586LysArg: 3.586 ± 0.626
6.193LysSer: 6.193 ± 0.967
5.134LysThr: 5.134 ± 0.613
5.705LysVal: 5.705 ± 0.654
0.896LysTrp: 0.896 ± 0.307
2.526LysTyr: 2.526 ± 0.389
0.0LysXaa: 0.0 ± 0.0
Leu
6.927LeuAla: 6.927 ± 1.07
0.652LeuCys: 0.652 ± 0.239
6.112LeuAsp: 6.112 ± 0.813
4.971LeuGlu: 4.971 ± 0.935
2.689LeuPhe: 2.689 ± 0.519
4.808LeuGly: 4.808 ± 0.94
1.304LeuHis: 1.304 ± 0.409
4.89LeuIle: 4.89 ± 0.724
6.845LeuLys: 6.845 ± 0.924
4.89LeuLeu: 4.89 ± 0.632
1.467LeuMet: 1.467 ± 0.489
5.297LeuAsn: 5.297 ± 0.691
2.608LeuPro: 2.608 ± 0.488
2.689LeuGln: 2.689 ± 0.506
1.874LeuArg: 1.874 ± 0.414
6.519LeuSer: 6.519 ± 0.836
5.379LeuThr: 5.379 ± 0.55
4.401LeuVal: 4.401 ± 0.507
0.652LeuTrp: 0.652 ± 0.254
2.2LeuTyr: 2.2 ± 0.518
0.0LeuXaa: 0.0 ± 0.0
Met
2.445MetAla: 2.445 ± 0.502
0.081MetCys: 0.081 ± 0.086
1.304MetAsp: 1.304 ± 0.336
1.385MetGlu: 1.385 ± 0.282
0.896MetPhe: 0.896 ± 0.199
1.141MetGly: 1.141 ± 0.303
0.326MetHis: 0.326 ± 0.187
1.141MetIle: 1.141 ± 0.295
2.037MetLys: 2.037 ± 0.353
2.608MetLeu: 2.608 ± 0.558
0.57MetMet: 0.57 ± 0.236
1.059MetAsn: 1.059 ± 0.275
0.652MetPro: 0.652 ± 0.257
1.548MetGln: 1.548 ± 0.392
1.141MetArg: 1.141 ± 0.346
2.119MetSer: 2.119 ± 0.476
2.282MetThr: 2.282 ± 0.429
1.304MetVal: 1.304 ± 0.357
0.326MetTrp: 0.326 ± 0.167
0.896MetTyr: 0.896 ± 0.315
0.0MetXaa: 0.0 ± 0.0
Asn
5.705AsnAla: 5.705 ± 0.863
0.081AsnCys: 0.081 ± 0.075
3.912AsnAsp: 3.912 ± 0.636
3.015AsnGlu: 3.015 ± 0.591
2.037AsnPhe: 2.037 ± 0.381
6.845AsnGly: 6.845 ± 1.185
0.57AsnHis: 0.57 ± 0.197
4.319AsnIle: 4.319 ± 0.657
4.238AsnLys: 4.238 ± 0.624
4.156AsnLeu: 4.156 ± 0.579
1.385AsnMet: 1.385 ± 0.294
4.238AsnAsn: 4.238 ± 0.873
3.015AsnPro: 3.015 ± 0.599
2.689AsnGln: 2.689 ± 0.665
1.874AsnArg: 1.874 ± 0.429
3.912AsnSer: 3.912 ± 0.563
3.26AsnThr: 3.26 ± 0.508
2.689AsnVal: 2.689 ± 0.706
0.815AsnTrp: 0.815 ± 0.276
2.852AsnTyr: 2.852 ± 0.64
0.0AsnXaa: 0.0 ± 0.0
Pro
1.63ProAla: 1.63 ± 0.324
0.244ProCys: 0.244 ± 0.155
1.874ProAsp: 1.874 ± 0.392
1.467ProGlu: 1.467 ± 0.383
1.548ProPhe: 1.548 ± 0.379
1.385ProGly: 1.385 ± 0.432
0.407ProHis: 0.407 ± 0.192
1.956ProIle: 1.956 ± 0.39
2.037ProLys: 2.037 ± 0.5
2.037ProLeu: 2.037 ± 0.375
0.57ProMet: 0.57 ± 0.243
1.874ProAsn: 1.874 ± 0.501
0.733ProPro: 0.733 ± 0.235
1.467ProGln: 1.467 ± 0.411
0.815ProArg: 0.815 ± 0.295
1.874ProSer: 1.874 ± 0.358
2.363ProThr: 2.363 ± 0.6
1.711ProVal: 1.711 ± 0.421
0.244ProTrp: 0.244 ± 0.129
1.222ProTyr: 1.222 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
4.075GlnAla: 4.075 ± 0.789
0.163GlnCys: 0.163 ± 0.114
1.63GlnAsp: 1.63 ± 0.428
2.2GlnGlu: 2.2 ± 0.559
0.896GlnPhe: 0.896 ± 0.285
3.667GlnGly: 3.667 ± 0.762
0.733GlnHis: 0.733 ± 0.279
2.608GlnIle: 2.608 ± 0.42
3.097GlnLys: 3.097 ± 0.451
2.526GlnLeu: 2.526 ± 0.459
1.304GlnMet: 1.304 ± 0.329
2.526GlnAsn: 2.526 ± 0.471
0.978GlnPro: 0.978 ± 0.29
2.282GlnGln: 2.282 ± 0.713
1.141GlnArg: 1.141 ± 0.276
3.178GlnSer: 3.178 ± 0.545
1.793GlnThr: 1.793 ± 0.357
3.015GlnVal: 3.015 ± 0.586
0.326GlnTrp: 0.326 ± 0.137
2.445GlnTyr: 2.445 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
2.445ArgAla: 2.445 ± 0.656
0.081ArgCys: 0.081 ± 0.075
2.037ArgAsp: 2.037 ± 0.367
1.956ArgGlu: 1.956 ± 0.462
1.956ArgPhe: 1.956 ± 0.412
2.608ArgGly: 2.608 ± 0.42
0.326ArgHis: 0.326 ± 0.177
2.282ArgIle: 2.282 ± 0.464
3.178ArgLys: 3.178 ± 0.567
3.667ArgLeu: 3.667 ± 0.699
0.652ArgMet: 0.652 ± 0.371
2.608ArgAsn: 2.608 ± 0.394
1.304ArgPro: 1.304 ± 0.273
1.304ArgGln: 1.304 ± 0.3
1.467ArgArg: 1.467 ± 0.416
1.548ArgSer: 1.548 ± 0.372
2.526ArgThr: 2.526 ± 0.528
2.526ArgVal: 2.526 ± 0.424
0.733ArgTrp: 0.733 ± 0.321
2.363ArgTyr: 2.363 ± 0.597
0.0ArgXaa: 0.0 ± 0.0
Ser
5.867SerAla: 5.867 ± 1.542
0.163SerCys: 0.163 ± 0.124
4.564SerAsp: 4.564 ± 0.592
3.015SerGlu: 3.015 ± 0.545
3.178SerPhe: 3.178 ± 0.681
4.156SerGly: 4.156 ± 0.998
0.407SerHis: 0.407 ± 0.171
3.423SerIle: 3.423 ± 0.468
4.482SerLys: 4.482 ± 0.587
4.89SerLeu: 4.89 ± 0.543
1.385SerMet: 1.385 ± 0.255
4.075SerAsn: 4.075 ± 0.771
1.874SerPro: 1.874 ± 0.39
3.178SerGln: 3.178 ± 0.494
1.956SerArg: 1.956 ± 0.413
4.89SerSer: 4.89 ± 0.688
4.971SerThr: 4.971 ± 0.787
4.645SerVal: 4.645 ± 0.631
0.652SerTrp: 0.652 ± 0.215
3.178SerTyr: 3.178 ± 0.542
0.0SerXaa: 0.0 ± 0.0
Thr
4.401ThrAla: 4.401 ± 0.708
0.081ThrCys: 0.081 ± 0.082
4.238ThrAsp: 4.238 ± 0.736
3.423ThrGlu: 3.423 ± 0.615
3.015ThrPhe: 3.015 ± 0.531
4.89ThrGly: 4.89 ± 0.669
0.815ThrHis: 0.815 ± 0.22
6.112ThrIle: 6.112 ± 0.752
4.808ThrLys: 4.808 ± 0.624
6.03ThrLeu: 6.03 ± 0.833
1.548ThrMet: 1.548 ± 0.516
3.178ThrAsn: 3.178 ± 0.588
2.771ThrPro: 2.771 ± 0.564
2.852ThrGln: 2.852 ± 0.548
2.119ThrArg: 2.119 ± 0.375
3.667ThrSer: 3.667 ± 0.765
4.401ThrThr: 4.401 ± 0.928
6.519ThrVal: 6.519 ± 0.603
1.222ThrTrp: 1.222 ± 0.331
3.015ThrTyr: 3.015 ± 0.903
0.0ThrXaa: 0.0 ± 0.0
Val
5.623ValAla: 5.623 ± 1.144
0.163ValCys: 0.163 ± 0.127
5.053ValAsp: 5.053 ± 0.762
3.912ValGlu: 3.912 ± 0.66
2.852ValPhe: 2.852 ± 0.516
4.808ValGly: 4.808 ± 0.953
0.815ValHis: 0.815 ± 0.264
5.216ValIle: 5.216 ± 0.6
4.156ValLys: 4.156 ± 0.682
3.749ValLeu: 3.749 ± 0.666
1.548ValMet: 1.548 ± 0.312
3.178ValAsn: 3.178 ± 0.423
1.385ValPro: 1.385 ± 0.305
2.037ValGln: 2.037 ± 0.41
2.2ValArg: 2.2 ± 0.515
4.156ValSer: 4.156 ± 0.578
5.542ValThr: 5.542 ± 0.791
3.83ValVal: 3.83 ± 0.562
0.815ValTrp: 0.815 ± 0.286
2.445ValTyr: 2.445 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.268
0.0TrpCys: 0.0 ± 0.0
0.733TrpAsp: 0.733 ± 0.294
0.407TrpGlu: 0.407 ± 0.163
0.57TrpPhe: 0.57 ± 0.21
0.815TrpGly: 0.815 ± 0.288
0.326TrpHis: 0.326 ± 0.155
0.489TrpIle: 0.489 ± 0.197
0.733TrpLys: 0.733 ± 0.308
0.815TrpLeu: 0.815 ± 0.264
0.326TrpMet: 0.326 ± 0.178
0.733TrpAsn: 0.733 ± 0.235
0.0TrpPro: 0.0 ± 0.0
0.407TrpGln: 0.407 ± 0.23
0.652TrpArg: 0.652 ± 0.182
0.815TrpSer: 0.815 ± 0.276
1.222TrpThr: 1.222 ± 0.532
0.733TrpVal: 0.733 ± 0.225
0.244TrpTrp: 0.244 ± 0.177
0.57TrpTyr: 0.57 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.852TyrAla: 2.852 ± 0.658
0.407TyrCys: 0.407 ± 0.17
3.015TyrAsp: 3.015 ± 0.585
1.63TyrGlu: 1.63 ± 0.4
1.63TyrPhe: 1.63 ± 0.427
1.711TyrGly: 1.711 ± 0.369
0.57TyrHis: 0.57 ± 0.306
3.26TyrIle: 3.26 ± 0.529
3.178TyrLys: 3.178 ± 0.503
4.075TyrLeu: 4.075 ± 0.715
0.896TyrMet: 0.896 ± 0.325
2.852TyrAsn: 2.852 ± 0.545
1.63TyrPro: 1.63 ± 0.358
2.363TyrGln: 2.363 ± 0.516
2.363TyrArg: 2.363 ± 0.426
2.363TyrSer: 2.363 ± 0.398
2.934TyrThr: 2.934 ± 0.707
1.956TyrVal: 1.956 ± 0.415
0.407TyrTrp: 0.407 ± 0.168
1.63TyrTyr: 1.63 ± 0.373
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12272 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski