Amino acid dipepetide frequency for Streptococcus phage Javan141

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.786AlaAla: 2.786 ± 0.792
0.159AlaCys: 0.159 ± 0.117
4.219AlaAsp: 4.219 ± 0.627
6.607AlaGlu: 6.607 ± 0.901
2.707AlaPhe: 2.707 ± 0.513
4.697AlaGly: 4.697 ± 0.643
0.637AlaHis: 0.637 ± 0.248
5.891AlaIle: 5.891 ± 0.782
6.766AlaLys: 6.766 ± 0.696
4.537AlaLeu: 4.537 ± 0.52
1.99AlaMet: 1.99 ± 0.366
3.98AlaAsn: 3.98 ± 0.53
1.592AlaPro: 1.592 ± 0.369
2.309AlaGln: 2.309 ± 0.405
2.945AlaArg: 2.945 ± 0.625
4.537AlaSer: 4.537 ± 0.699
3.503AlaThr: 3.503 ± 0.543
4.617AlaVal: 4.617 ± 0.735
0.796AlaTrp: 0.796 ± 0.237
2.866AlaTyr: 2.866 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.191
0.0CysCys: 0.0 ± 0.0
0.239CysAsp: 0.239 ± 0.133
0.796CysGlu: 0.796 ± 0.269
0.239CysPhe: 0.239 ± 0.139
0.557CysGly: 0.557 ± 0.246
0.08CysHis: 0.08 ± 0.083
0.478CysIle: 0.478 ± 0.228
0.318CysLys: 0.318 ± 0.164
0.557CysLeu: 0.557 ± 0.205
0.0CysMet: 0.0 ± 0.0
0.318CysAsn: 0.318 ± 0.17
0.239CysPro: 0.239 ± 0.137
0.318CysGln: 0.318 ± 0.212
0.557CysArg: 0.557 ± 0.219
0.637CysSer: 0.637 ± 0.192
0.0CysThr: 0.0 ± 0.0
0.159CysVal: 0.159 ± 0.105
0.08CysTrp: 0.08 ± 0.067
0.557CysTyr: 0.557 ± 0.313
0.0CysXaa: 0.0 ± 0.0
Asp
3.423AspAla: 3.423 ± 0.443
0.955AspCys: 0.955 ± 0.271
3.821AspAsp: 3.821 ± 0.59
4.936AspGlu: 4.936 ± 0.63
2.945AspPhe: 2.945 ± 0.592
4.617AspGly: 4.617 ± 0.73
0.716AspHis: 0.716 ± 0.21
3.98AspIle: 3.98 ± 0.522
5.97AspLys: 5.97 ± 0.808
4.776AspLeu: 4.776 ± 0.635
1.274AspMet: 1.274 ± 0.349
4.378AspAsn: 4.378 ± 0.677
0.876AspPro: 0.876 ± 0.216
1.114AspGln: 1.114 ± 0.256
3.184AspArg: 3.184 ± 0.537
2.945AspSer: 2.945 ± 0.515
5.174AspThr: 5.174 ± 0.546
3.98AspVal: 3.98 ± 0.679
0.955AspTrp: 0.955 ± 0.348
3.662AspTyr: 3.662 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
5.493GluAla: 5.493 ± 0.779
0.398GluCys: 0.398 ± 0.175
3.503GluAsp: 3.503 ± 0.632
5.174GluGlu: 5.174 ± 0.63
3.264GluPhe: 3.264 ± 0.523
2.468GluGly: 2.468 ± 0.511
1.114GluHis: 1.114 ± 0.36
5.413GluIle: 5.413 ± 0.705
7.164GluLys: 7.164 ± 0.901
7.164GluLeu: 7.164 ± 0.779
1.831GluMet: 1.831 ± 0.411
3.582GluAsn: 3.582 ± 0.709
1.512GluPro: 1.512 ± 0.321
3.901GluGln: 3.901 ± 0.555
3.821GluArg: 3.821 ± 0.769
3.343GluSer: 3.343 ± 0.429
3.741GluThr: 3.741 ± 0.445
4.617GluVal: 4.617 ± 0.632
0.478GluTrp: 0.478 ± 0.218
2.945GluTyr: 2.945 ± 0.445
0.0GluXaa: 0.0 ± 0.0
Phe
3.662PheAla: 3.662 ± 0.566
0.318PheCys: 0.318 ± 0.178
3.821PheAsp: 3.821 ± 0.594
3.264PheGlu: 3.264 ± 0.534
1.114PhePhe: 1.114 ± 0.378
3.343PheGly: 3.343 ± 0.464
0.239PheHis: 0.239 ± 0.13
2.388PheIle: 2.388 ± 0.46
3.264PheLys: 3.264 ± 0.505
2.309PheLeu: 2.309 ± 0.462
0.955PheMet: 0.955 ± 0.307
2.627PheAsn: 2.627 ± 0.594
1.035PhePro: 1.035 ± 0.285
0.955PheGln: 0.955 ± 0.292
1.194PheArg: 1.194 ± 0.353
2.388PheSer: 2.388 ± 0.496
2.707PheThr: 2.707 ± 0.416
2.707PheVal: 2.707 ± 0.563
0.716PheTrp: 0.716 ± 0.195
1.274PheTyr: 1.274 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
3.662GlyAla: 3.662 ± 0.685
0.478GlyCys: 0.478 ± 0.194
4.537GlyAsp: 4.537 ± 0.672
3.184GlyGlu: 3.184 ± 0.419
1.911GlyPhe: 1.911 ± 0.364
4.06GlyGly: 4.06 ± 0.787
1.035GlyHis: 1.035 ± 0.284
5.97GlyIle: 5.97 ± 0.7
6.528GlyLys: 6.528 ± 0.748
4.697GlyLeu: 4.697 ± 0.615
2.468GlyMet: 2.468 ± 0.499
3.184GlyAsn: 3.184 ± 0.483
1.433GlyPro: 1.433 ± 0.475
2.309GlyGln: 2.309 ± 0.386
1.831GlyArg: 1.831 ± 0.418
4.458GlySer: 4.458 ± 0.93
4.856GlyThr: 4.856 ± 0.73
4.378GlyVal: 4.378 ± 0.559
1.114GlyTrp: 1.114 ± 0.323
3.423GlyTyr: 3.423 ± 0.521
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.386
0.159HisCys: 0.159 ± 0.105
0.876HisAsp: 0.876 ± 0.249
1.114HisGlu: 1.114 ± 0.278
0.637HisPhe: 0.637 ± 0.191
0.478HisGly: 0.478 ± 0.175
0.08HisHis: 0.08 ± 0.083
1.035HisIle: 1.035 ± 0.305
1.751HisLys: 1.751 ± 0.428
0.876HisLeu: 0.876 ± 0.219
0.557HisMet: 0.557 ± 0.219
1.114HisAsn: 1.114 ± 0.341
0.478HisPro: 0.478 ± 0.204
0.478HisGln: 0.478 ± 0.174
0.398HisArg: 0.398 ± 0.143
0.478HisSer: 0.478 ± 0.2
0.478HisThr: 0.478 ± 0.2
0.955HisVal: 0.955 ± 0.279
0.239HisTrp: 0.239 ± 0.134
0.478HisTyr: 0.478 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
5.732IleAla: 5.732 ± 0.647
0.716IleCys: 0.716 ± 0.256
5.572IleAsp: 5.572 ± 0.605
5.652IleGlu: 5.652 ± 0.858
2.547IlePhe: 2.547 ± 0.593
4.776IleGly: 4.776 ± 0.516
1.035IleHis: 1.035 ± 0.237
3.503IleIle: 3.503 ± 0.494
5.811IleLys: 5.811 ± 0.734
5.572IleLeu: 5.572 ± 0.526
1.194IleMet: 1.194 ± 0.285
4.219IleAsn: 4.219 ± 0.606
1.99IlePro: 1.99 ± 0.387
2.07IleGln: 2.07 ± 0.467
1.911IleArg: 1.911 ± 0.355
4.299IleSer: 4.299 ± 0.534
5.015IleThr: 5.015 ± 0.609
5.413IleVal: 5.413 ± 0.792
0.876IleTrp: 0.876 ± 0.23
3.343IleTyr: 3.343 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
6.13LysAla: 6.13 ± 0.617
0.478LysCys: 0.478 ± 0.186
5.095LysAsp: 5.095 ± 0.663
6.368LysGlu: 6.368 ± 0.906
2.627LysPhe: 2.627 ± 0.391
4.856LysGly: 4.856 ± 0.819
1.512LysHis: 1.512 ± 0.289
6.209LysIle: 6.209 ± 0.919
7.403LysLys: 7.403 ± 1.256
6.687LysLeu: 6.687 ± 0.619
2.786LysMet: 2.786 ± 0.539
5.015LysAsn: 5.015 ± 0.612
2.945LysPro: 2.945 ± 0.544
3.98LysGln: 3.98 ± 0.584
4.219LysArg: 4.219 ± 0.614
4.617LysSer: 4.617 ± 0.72
5.732LysThr: 5.732 ± 0.753
6.766LysVal: 6.766 ± 0.718
0.955LysTrp: 0.955 ± 0.318
3.423LysTyr: 3.423 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
5.572LeuAla: 5.572 ± 0.877
0.239LeuCys: 0.239 ± 0.138
5.413LeuAsp: 5.413 ± 0.733
6.368LeuGlu: 6.368 ± 0.83
3.105LeuPhe: 3.105 ± 0.473
4.458LeuGly: 4.458 ± 0.625
0.796LeuHis: 0.796 ± 0.249
5.891LeuIle: 5.891 ± 0.691
7.562LeuLys: 7.562 ± 0.686
5.572LeuLeu: 5.572 ± 0.556
1.592LeuMet: 1.592 ± 0.311
4.458LeuAsn: 4.458 ± 0.467
1.592LeuPro: 1.592 ± 0.321
3.503LeuGln: 3.503 ± 0.502
3.582LeuArg: 3.582 ± 0.659
5.891LeuSer: 5.891 ± 0.782
5.413LeuThr: 5.413 ± 0.599
4.139LeuVal: 4.139 ± 0.64
0.955LeuTrp: 0.955 ± 0.245
2.707LeuTyr: 2.707 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
1.99MetAla: 1.99 ± 0.451
0.0MetCys: 0.0 ± 0.0
0.876MetAsp: 0.876 ± 0.302
1.911MetGlu: 1.911 ± 0.482
1.114MetPhe: 1.114 ± 0.279
0.955MetGly: 0.955 ± 0.31
0.239MetHis: 0.239 ± 0.139
1.274MetIle: 1.274 ± 0.294
1.433MetLys: 1.433 ± 0.333
2.388MetLeu: 2.388 ± 0.563
0.557MetMet: 0.557 ± 0.207
1.114MetAsn: 1.114 ± 0.245
1.035MetPro: 1.035 ± 0.253
1.194MetGln: 1.194 ± 0.311
1.194MetArg: 1.194 ± 0.357
1.672MetSer: 1.672 ± 0.301
2.149MetThr: 2.149 ± 0.374
1.035MetVal: 1.035 ± 0.273
0.239MetTrp: 0.239 ± 0.129
0.716MetTyr: 0.716 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
3.582AsnAla: 3.582 ± 0.554
0.239AsnCys: 0.239 ± 0.122
3.423AsnAsp: 3.423 ± 0.499
3.503AsnGlu: 3.503 ± 0.454
1.911AsnPhe: 1.911 ± 0.266
5.413AsnGly: 5.413 ± 0.645
0.876AsnHis: 0.876 ± 0.296
4.06AsnIle: 4.06 ± 0.64
5.413AsnLys: 5.413 ± 0.658
5.652AsnLeu: 5.652 ± 0.728
1.194AsnMet: 1.194 ± 0.372
3.821AsnAsn: 3.821 ± 0.557
1.99AsnPro: 1.99 ± 0.316
2.149AsnGln: 2.149 ± 0.504
1.592AsnArg: 1.592 ± 0.318
3.105AsnSer: 3.105 ± 0.471
2.627AsnThr: 2.627 ± 0.436
3.423AsnVal: 3.423 ± 0.564
0.876AsnTrp: 0.876 ± 0.268
2.309AsnTyr: 2.309 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
1.592ProAla: 1.592 ± 0.29
0.239ProCys: 0.239 ± 0.136
2.468ProAsp: 2.468 ± 0.548
1.751ProGlu: 1.751 ± 0.398
1.274ProPhe: 1.274 ± 0.283
1.035ProGly: 1.035 ± 0.26
0.716ProHis: 0.716 ± 0.212
1.592ProIle: 1.592 ± 0.346
2.229ProLys: 2.229 ± 0.384
1.433ProLeu: 1.433 ± 0.263
0.557ProMet: 0.557 ± 0.19
1.831ProAsn: 1.831 ± 0.345
0.876ProPro: 0.876 ± 0.264
1.751ProGln: 1.751 ± 0.402
0.716ProArg: 0.716 ± 0.201
2.468ProSer: 2.468 ± 0.635
1.035ProThr: 1.035 ± 0.313
1.831ProVal: 1.831 ± 0.315
0.239ProTrp: 0.239 ± 0.138
1.512ProTyr: 1.512 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
3.105GlnAla: 3.105 ± 0.433
0.159GlnCys: 0.159 ± 0.115
1.512GlnAsp: 1.512 ± 0.331
2.866GlnGlu: 2.866 ± 0.472
1.353GlnPhe: 1.353 ± 0.351
2.786GlnGly: 2.786 ± 0.549
0.637GlnHis: 0.637 ± 0.213
3.503GlnIle: 3.503 ± 0.563
3.741GlnLys: 3.741 ± 0.592
3.343GlnLeu: 3.343 ± 0.459
0.876GlnMet: 0.876 ± 0.248
2.627GlnAsn: 2.627 ± 0.513
0.716GlnPro: 0.716 ± 0.304
1.592GlnGln: 1.592 ± 0.361
1.353GlnArg: 1.353 ± 0.272
2.945GlnSer: 2.945 ± 0.548
1.831GlnThr: 1.831 ± 0.332
1.672GlnVal: 1.672 ± 0.325
0.398GlnTrp: 0.398 ± 0.151
1.831GlnTyr: 1.831 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
3.264ArgAla: 3.264 ± 0.533
0.318ArgCys: 0.318 ± 0.159
2.07ArgAsp: 2.07 ± 0.349
2.945ArgGlu: 2.945 ± 0.548
1.672ArgPhe: 1.672 ± 0.358
2.388ArgGly: 2.388 ± 0.492
0.557ArgHis: 0.557 ± 0.214
2.309ArgIle: 2.309 ± 0.464
3.105ArgLys: 3.105 ± 0.579
4.617ArgLeu: 4.617 ± 0.692
0.716ArgMet: 0.716 ± 0.228
1.433ArgAsn: 1.433 ± 0.289
0.955ArgPro: 0.955 ± 0.259
2.07ArgGln: 2.07 ± 0.466
2.229ArgArg: 2.229 ± 0.515
2.786ArgSer: 2.786 ± 0.513
2.229ArgThr: 2.229 ± 0.51
2.547ArgVal: 2.547 ± 0.479
0.637ArgTrp: 0.637 ± 0.222
1.831ArgTyr: 1.831 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
3.901SerAla: 3.901 ± 0.634
0.318SerCys: 0.318 ± 0.22
4.06SerAsp: 4.06 ± 0.51
4.06SerGlu: 4.06 ± 0.541
3.264SerPhe: 3.264 ± 0.533
5.732SerGly: 5.732 ± 0.714
0.716SerHis: 0.716 ± 0.233
4.697SerIle: 4.697 ± 0.593
4.776SerLys: 4.776 ± 0.667
5.015SerLeu: 5.015 ± 0.527
1.114SerMet: 1.114 ± 0.355
3.105SerAsn: 3.105 ± 0.643
1.512SerPro: 1.512 ± 0.464
3.025SerGln: 3.025 ± 0.495
2.627SerArg: 2.627 ± 0.556
3.582SerSer: 3.582 ± 0.765
4.06SerThr: 4.06 ± 0.666
4.856SerVal: 4.856 ± 0.563
0.796SerTrp: 0.796 ± 0.293
2.468SerTyr: 2.468 ± 0.414
0.0SerXaa: 0.0 ± 0.0
Thr
4.139ThrAla: 4.139 ± 0.545
0.08ThrCys: 0.08 ± 0.083
2.627ThrAsp: 2.627 ± 0.503
3.184ThrGlu: 3.184 ± 0.559
3.582ThrPhe: 3.582 ± 0.735
5.174ThrGly: 5.174 ± 0.711
1.194ThrHis: 1.194 ± 0.306
5.493ThrIle: 5.493 ± 0.659
4.537ThrLys: 4.537 ± 0.61
4.617ThrLeu: 4.617 ± 0.719
0.796ThrMet: 0.796 ± 0.283
3.503ThrAsn: 3.503 ± 0.47
1.751ThrPro: 1.751 ± 0.306
2.07ThrGln: 2.07 ± 0.435
2.149ThrArg: 2.149 ± 0.384
4.537ThrSer: 4.537 ± 0.432
4.139ThrThr: 4.139 ± 0.766
4.697ThrVal: 4.697 ± 0.818
0.637ThrTrp: 0.637 ± 0.199
2.627ThrTyr: 2.627 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
5.015ValAla: 5.015 ± 0.614
0.398ValCys: 0.398 ± 0.196
5.652ValAsp: 5.652 ± 0.688
5.015ValGlu: 5.015 ± 0.713
2.627ValPhe: 2.627 ± 0.444
4.139ValGly: 4.139 ± 0.74
0.637ValHis: 0.637 ± 0.201
3.901ValIle: 3.901 ± 0.683
5.095ValLys: 5.095 ± 0.62
3.821ValLeu: 3.821 ± 0.439
1.592ValMet: 1.592 ± 0.359
3.901ValAsn: 3.901 ± 0.557
2.707ValPro: 2.707 ± 0.582
1.592ValGln: 1.592 ± 0.362
2.786ValArg: 2.786 ± 0.414
5.493ValSer: 5.493 ± 0.631
4.219ValThr: 4.219 ± 0.844
4.378ValVal: 4.378 ± 0.82
0.557ValTrp: 0.557 ± 0.191
2.309ValTyr: 2.309 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.796TrpAla: 0.796 ± 0.26
0.159TrpCys: 0.159 ± 0.121
1.035TrpAsp: 1.035 ± 0.307
0.716TrpGlu: 0.716 ± 0.236
0.398TrpPhe: 0.398 ± 0.18
0.876TrpGly: 0.876 ± 0.291
0.159TrpHis: 0.159 ± 0.11
0.876TrpIle: 0.876 ± 0.239
1.194TrpLys: 1.194 ± 0.324
1.114TrpLeu: 1.114 ± 0.351
0.08TrpMet: 0.08 ± 0.073
0.557TrpAsn: 0.557 ± 0.188
0.159TrpPro: 0.159 ± 0.099
0.637TrpGln: 0.637 ± 0.253
0.398TrpArg: 0.398 ± 0.172
0.955TrpSer: 0.955 ± 0.231
0.876TrpThr: 0.876 ± 0.269
0.716TrpVal: 0.716 ± 0.247
0.08TrpTrp: 0.08 ± 0.089
0.478TrpTyr: 0.478 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.025TyrAla: 3.025 ± 0.657
0.716TyrCys: 0.716 ± 0.272
2.866TyrAsp: 2.866 ± 0.476
1.512TyrGlu: 1.512 ± 0.365
1.99TyrPhe: 1.99 ± 0.481
2.707TyrGly: 2.707 ± 0.561
0.557TyrHis: 0.557 ± 0.208
2.786TyrIle: 2.786 ± 0.549
3.741TyrLys: 3.741 ± 0.63
4.06TyrLeu: 4.06 ± 0.551
0.876TyrMet: 0.876 ± 0.326
2.388TyrAsn: 2.388 ± 0.44
1.751TyrPro: 1.751 ± 0.402
1.831TyrGln: 1.831 ± 0.377
1.99TyrArg: 1.99 ± 0.39
2.547TyrSer: 2.547 ± 0.471
1.751TyrThr: 1.751 ± 0.323
2.945TyrVal: 2.945 ± 0.504
0.637TyrTrp: 0.637 ± 0.193
2.627TyrTyr: 2.627 ± 0.373
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12563 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski