Amino acid dipepetide frequency for Streptococcus phage CHPC1046

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.09AlaAla: 3.09 ± 0.978
0.094AlaCys: 0.094 ± 0.093
3.746AlaAsp: 3.746 ± 0.47
4.495AlaGlu: 4.495 ± 0.715
2.716AlaPhe: 2.716 ± 0.643
3.465AlaGly: 3.465 ± 0.549
0.749AlaHis: 0.749 ± 0.26
4.495AlaIle: 4.495 ± 0.621
6.181AlaLys: 6.181 ± 0.795
6.181AlaLeu: 6.181 ± 0.754
1.217AlaMet: 1.217 ± 0.345
5.057AlaAsn: 5.057 ± 0.707
1.311AlaPro: 1.311 ± 0.311
3.09AlaGln: 3.09 ± 0.608
1.779AlaArg: 1.779 ± 0.37
4.495AlaSer: 4.495 ± 0.913
4.027AlaThr: 4.027 ± 0.727
4.402AlaVal: 4.402 ± 0.802
1.03AlaTrp: 1.03 ± 0.287
1.967AlaTyr: 1.967 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.281CysAla: 0.281 ± 0.229
0.0CysCys: 0.0 ± 0.0
0.656CysAsp: 0.656 ± 0.25
0.375CysGlu: 0.375 ± 0.252
0.281CysPhe: 0.281 ± 0.136
0.187CysGly: 0.187 ± 0.125
0.187CysHis: 0.187 ± 0.17
0.0CysIle: 0.0 ± 0.0
0.656CysLys: 0.656 ± 0.25
0.468CysLeu: 0.468 ± 0.238
0.0CysMet: 0.0 ± 0.0
0.094CysAsn: 0.094 ± 0.085
0.0CysPro: 0.0 ± 0.0
0.094CysGln: 0.094 ± 0.088
0.187CysArg: 0.187 ± 0.177
0.749CysSer: 0.749 ± 0.318
0.375CysThr: 0.375 ± 0.209
0.281CysVal: 0.281 ± 0.148
0.187CysTrp: 0.187 ± 0.139
0.187CysTyr: 0.187 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
3.84AspAla: 3.84 ± 0.514
0.375AspCys: 0.375 ± 0.189
4.402AspAsp: 4.402 ± 0.604
4.87AspGlu: 4.87 ± 0.737
3.371AspPhe: 3.371 ± 0.447
5.713AspGly: 5.713 ± 0.941
1.124AspHis: 1.124 ± 0.334
5.338AspIle: 5.338 ± 0.649
4.963AspLys: 4.963 ± 0.485
4.402AspLeu: 4.402 ± 0.763
2.248AspMet: 2.248 ± 0.435
4.027AspAsn: 4.027 ± 0.667
2.435AspPro: 2.435 ± 0.497
1.311AspGln: 1.311 ± 0.306
3.184AspArg: 3.184 ± 0.494
2.81AspSer: 2.81 ± 0.51
3.652AspThr: 3.652 ± 0.533
2.716AspVal: 2.716 ± 0.506
0.749AspTrp: 0.749 ± 0.291
2.341AspTyr: 2.341 ± 0.416
0.0AspXaa: 0.0 ± 0.0
Glu
4.121GluAla: 4.121 ± 0.517
0.187GluCys: 0.187 ± 0.116
3.09GluAsp: 3.09 ± 0.582
5.619GluGlu: 5.619 ± 1.139
2.716GluPhe: 2.716 ± 0.636
3.09GluGly: 3.09 ± 0.463
1.405GluHis: 1.405 ± 0.331
6.368GluIle: 6.368 ± 0.995
4.402GluLys: 4.402 ± 1.03
6.556GluLeu: 6.556 ± 0.884
2.248GluMet: 2.248 ± 0.615
3.84GluAsn: 3.84 ± 0.736
1.498GluPro: 1.498 ± 0.398
3.933GluGln: 3.933 ± 0.752
3.278GluArg: 3.278 ± 0.509
3.933GluSer: 3.933 ± 0.595
4.308GluThr: 4.308 ± 0.721
5.057GluVal: 5.057 ± 0.673
0.937GluTrp: 0.937 ± 0.286
2.903GluTyr: 2.903 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
3.371PheAla: 3.371 ± 0.507
0.187PheCys: 0.187 ± 0.154
3.465PheAsp: 3.465 ± 0.566
2.248PheGlu: 2.248 ± 0.528
1.498PhePhe: 1.498 ± 0.324
2.903PheGly: 2.903 ± 0.688
0.749PheHis: 0.749 ± 0.22
3.09PheIle: 3.09 ± 0.488
4.027PheLys: 4.027 ± 0.59
3.278PheLeu: 3.278 ± 0.571
0.468PheMet: 0.468 ± 0.245
3.184PheAsn: 3.184 ± 0.581
0.749PhePro: 0.749 ± 0.224
1.124PheGln: 1.124 ± 0.249
1.217PheArg: 1.217 ± 0.365
2.997PheSer: 2.997 ± 0.464
2.716PheThr: 2.716 ± 0.572
2.997PheVal: 2.997 ± 0.536
0.656PheTrp: 0.656 ± 0.237
1.967PheTyr: 1.967 ± 0.487
0.0PheXaa: 0.0 ± 0.0
Gly
2.81GlyAla: 2.81 ± 0.677
0.375GlyCys: 0.375 ± 0.179
4.214GlyAsp: 4.214 ± 0.645
4.027GlyGlu: 4.027 ± 0.816
3.184GlyPhe: 3.184 ± 0.556
4.121GlyGly: 4.121 ± 0.824
0.843GlyHis: 0.843 ± 0.251
4.495GlyIle: 4.495 ± 0.746
5.9GlyLys: 5.9 ± 0.701
5.9GlyLeu: 5.9 ± 0.858
1.405GlyMet: 1.405 ± 0.378
3.84GlyAsn: 3.84 ± 0.725
1.03GlyPro: 1.03 ± 0.331
2.997GlyGln: 2.997 ± 0.469
2.435GlyArg: 2.435 ± 0.444
4.402GlySer: 4.402 ± 0.732
4.402GlyThr: 4.402 ± 0.791
3.09GlyVal: 3.09 ± 0.547
1.124GlyTrp: 1.124 ± 0.365
2.716GlyTyr: 2.716 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
0.468HisAla: 0.468 ± 0.206
0.0HisCys: 0.0 ± 0.0
0.843HisAsp: 0.843 ± 0.269
0.562HisGlu: 0.562 ± 0.233
0.375HisPhe: 0.375 ± 0.185
0.937HisGly: 0.937 ± 0.309
0.375HisHis: 0.375 ± 0.171
0.843HisIle: 0.843 ± 0.326
1.124HisLys: 1.124 ± 0.333
1.498HisLeu: 1.498 ± 0.352
0.468HisMet: 0.468 ± 0.266
1.03HisAsn: 1.03 ± 0.327
0.656HisPro: 0.656 ± 0.233
0.749HisGln: 0.749 ± 0.28
0.749HisArg: 0.749 ± 0.239
0.843HisSer: 0.843 ± 0.297
0.937HisThr: 0.937 ± 0.261
1.405HisVal: 1.405 ± 0.239
0.094HisTrp: 0.094 ± 0.092
1.217HisTyr: 1.217 ± 0.41
0.0HisXaa: 0.0 ± 0.0
Ile
4.683IleAla: 4.683 ± 0.838
0.656IleCys: 0.656 ± 0.226
4.495IleAsp: 4.495 ± 0.595
5.151IleGlu: 5.151 ± 0.965
1.967IlePhe: 1.967 ± 0.396
4.776IleGly: 4.776 ± 0.481
0.843IleHis: 0.843 ± 0.274
3.465IleIle: 3.465 ± 0.627
6.93IleLys: 6.93 ± 0.76
3.559IleLeu: 3.559 ± 0.697
1.779IleMet: 1.779 ± 0.452
4.776IleAsn: 4.776 ± 0.644
3.09IlePro: 3.09 ± 0.535
3.09IleGln: 3.09 ± 0.425
2.248IleArg: 2.248 ± 0.543
4.121IleSer: 4.121 ± 0.583
3.652IleThr: 3.652 ± 0.528
3.371IleVal: 3.371 ± 0.594
1.03IleTrp: 1.03 ± 0.283
2.06IleTyr: 2.06 ± 0.444
0.0IleXaa: 0.0 ± 0.0
Lys
5.244LysAla: 5.244 ± 0.624
0.187LysCys: 0.187 ± 0.124
5.151LysAsp: 5.151 ± 0.748
7.586LysGlu: 7.586 ± 1.16
3.559LysPhe: 3.559 ± 0.68
4.963LysGly: 4.963 ± 0.607
1.217LysHis: 1.217 ± 0.341
5.619LysIle: 5.619 ± 0.647
7.024LysLys: 7.024 ± 1.159
7.117LysLeu: 7.117 ± 0.887
2.622LysMet: 2.622 ± 0.514
5.057LysAsn: 5.057 ± 0.863
3.559LysPro: 3.559 ± 0.632
4.87LysGln: 4.87 ± 0.604
3.559LysArg: 3.559 ± 0.57
4.308LysSer: 4.308 ± 0.611
4.776LysThr: 4.776 ± 0.731
4.308LysVal: 4.308 ± 0.629
1.03LysTrp: 1.03 ± 0.298
3.652LysTyr: 3.652 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
6.368LeuAla: 6.368 ± 0.751
0.281LeuCys: 0.281 ± 0.162
5.713LeuAsp: 5.713 ± 0.617
6.93LeuGlu: 6.93 ± 1.013
2.997LeuPhe: 2.997 ± 0.427
5.151LeuGly: 5.151 ± 0.957
0.749LeuHis: 0.749 ± 0.278
4.402LeuIle: 4.402 ± 0.628
7.679LeuLys: 7.679 ± 0.758
5.151LeuLeu: 5.151 ± 0.666
2.716LeuMet: 2.716 ± 0.474
5.057LeuAsn: 5.057 ± 0.834
2.435LeuPro: 2.435 ± 0.433
2.435LeuGln: 2.435 ± 0.445
3.371LeuArg: 3.371 ± 0.759
5.806LeuSer: 5.806 ± 0.891
5.9LeuThr: 5.9 ± 0.986
4.87LeuVal: 4.87 ± 0.733
0.562LeuTrp: 0.562 ± 0.228
2.435LeuTyr: 2.435 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
1.779MetAla: 1.779 ± 0.307
0.0MetCys: 0.0 ± 0.0
1.124MetAsp: 1.124 ± 0.273
1.498MetGlu: 1.498 ± 0.432
1.124MetPhe: 1.124 ± 0.29
0.843MetGly: 0.843 ± 0.344
0.281MetHis: 0.281 ± 0.167
1.592MetIle: 1.592 ± 0.362
2.622MetLys: 2.622 ± 0.633
2.248MetLeu: 2.248 ± 0.367
0.656MetMet: 0.656 ± 0.287
1.311MetAsn: 1.311 ± 0.313
0.749MetPro: 0.749 ± 0.202
0.937MetGln: 0.937 ± 0.282
0.937MetArg: 0.937 ± 0.227
1.873MetSer: 1.873 ± 0.375
1.873MetThr: 1.873 ± 0.344
1.592MetVal: 1.592 ± 0.405
0.281MetTrp: 0.281 ± 0.141
1.405MetTyr: 1.405 ± 0.405
0.0MetXaa: 0.0 ± 0.0
Asn
4.402AsnAla: 4.402 ± 1.01
0.562AsnCys: 0.562 ± 0.261
3.278AsnAsp: 3.278 ± 0.583
4.027AsnGlu: 4.027 ± 0.819
3.278AsnPhe: 3.278 ± 0.641
6.649AsnGly: 6.649 ± 1.239
1.311AsnHis: 1.311 ± 0.32
3.652AsnIle: 3.652 ± 0.541
4.402AsnLys: 4.402 ± 0.669
5.806AsnLeu: 5.806 ± 0.759
1.124AsnMet: 1.124 ± 0.266
3.933AsnAsn: 3.933 ± 0.901
2.529AsnPro: 2.529 ± 0.542
3.09AsnGln: 3.09 ± 0.423
1.873AsnArg: 1.873 ± 0.396
4.121AsnSer: 4.121 ± 0.658
3.652AsnThr: 3.652 ± 0.583
2.622AsnVal: 2.622 ± 0.536
1.498AsnTrp: 1.498 ± 0.36
2.06AsnTyr: 2.06 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
1.967ProAla: 1.967 ± 0.364
0.094ProCys: 0.094 ± 0.105
1.686ProAsp: 1.686 ± 0.403
1.873ProGlu: 1.873 ± 0.428
1.124ProPhe: 1.124 ± 0.365
0.843ProGly: 0.843 ± 0.297
0.562ProHis: 0.562 ± 0.228
1.498ProIle: 1.498 ± 0.322
3.278ProLys: 3.278 ± 0.518
2.622ProLeu: 2.622 ± 0.386
0.281ProMet: 0.281 ± 0.164
2.903ProAsn: 2.903 ± 0.416
0.468ProPro: 0.468 ± 0.268
1.498ProGln: 1.498 ± 0.371
0.937ProArg: 0.937 ± 0.322
2.529ProSer: 2.529 ± 0.586
1.873ProThr: 1.873 ± 0.404
1.686ProVal: 1.686 ± 0.365
0.562ProTrp: 0.562 ± 0.205
1.311ProTyr: 1.311 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
4.027GlnAla: 4.027 ± 0.548
0.187GlnCys: 0.187 ± 0.131
2.248GlnAsp: 2.248 ± 0.438
2.903GlnGlu: 2.903 ± 0.686
1.311GlnPhe: 1.311 ± 0.32
3.09GlnGly: 3.09 ± 0.62
0.562GlnHis: 0.562 ± 0.224
2.716GlnIle: 2.716 ± 0.485
3.465GlnLys: 3.465 ± 0.675
3.746GlnLeu: 3.746 ± 0.506
1.967GlnMet: 1.967 ± 0.401
2.341GlnAsn: 2.341 ± 0.505
0.468GlnPro: 0.468 ± 0.216
3.84GlnGln: 3.84 ± 0.768
1.873GlnArg: 1.873 ± 0.37
2.622GlnSer: 2.622 ± 0.454
3.09GlnThr: 3.09 ± 0.562
1.967GlnVal: 1.967 ± 0.476
0.843GlnTrp: 0.843 ± 0.306
1.967GlnTyr: 1.967 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
2.341ArgAla: 2.341 ± 0.53
0.094ArgCys: 0.094 ± 0.088
2.435ArgAsp: 2.435 ± 0.37
2.06ArgGlu: 2.06 ± 0.502
1.873ArgPhe: 1.873 ± 0.31
2.248ArgGly: 2.248 ± 0.604
0.656ArgHis: 0.656 ± 0.267
2.622ArgIle: 2.622 ± 0.543
2.997ArgLys: 2.997 ± 0.694
4.121ArgLeu: 4.121 ± 0.578
0.937ArgMet: 0.937 ± 0.313
2.435ArgAsn: 2.435 ± 0.388
0.937ArgPro: 0.937 ± 0.272
1.967ArgGln: 1.967 ± 0.399
1.311ArgArg: 1.311 ± 0.362
2.154ArgSer: 2.154 ± 0.483
2.622ArgThr: 2.622 ± 0.646
2.435ArgVal: 2.435 ± 0.475
1.03ArgTrp: 1.03 ± 0.26
2.154ArgTyr: 2.154 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
3.746SerAla: 3.746 ± 0.546
0.562SerCys: 0.562 ± 0.24
4.402SerAsp: 4.402 ± 0.492
3.465SerGlu: 3.465 ± 0.564
2.81SerPhe: 2.81 ± 0.504
4.214SerGly: 4.214 ± 0.661
0.468SerHis: 0.468 ± 0.249
4.683SerIle: 4.683 ± 0.58
5.713SerLys: 5.713 ± 0.811
5.151SerLeu: 5.151 ± 0.635
1.498SerMet: 1.498 ± 0.38
4.495SerAsn: 4.495 ± 0.702
2.248SerPro: 2.248 ± 0.359
2.81SerGln: 2.81 ± 0.553
3.184SerArg: 3.184 ± 0.795
3.465SerSer: 3.465 ± 0.523
3.84SerThr: 3.84 ± 0.794
4.402SerVal: 4.402 ± 0.656
0.749SerTrp: 0.749 ± 0.285
2.529SerTyr: 2.529 ± 0.502
0.0SerXaa: 0.0 ± 0.0
Thr
4.121ThrAla: 4.121 ± 0.749
0.375ThrCys: 0.375 ± 0.196
3.746ThrAsp: 3.746 ± 0.677
3.746ThrGlu: 3.746 ± 0.599
3.278ThrPhe: 3.278 ± 0.564
3.559ThrGly: 3.559 ± 0.438
1.405ThrHis: 1.405 ± 0.394
4.87ThrIle: 4.87 ± 0.961
5.244ThrLys: 5.244 ± 0.697
6.087ThrLeu: 6.087 ± 0.714
0.843ThrMet: 0.843 ± 0.281
3.84ThrAsn: 3.84 ± 0.608
1.779ThrPro: 1.779 ± 0.447
2.622ThrGln: 2.622 ± 0.522
2.154ThrArg: 2.154 ± 0.392
3.933ThrSer: 3.933 ± 0.504
3.465ThrThr: 3.465 ± 0.78
4.121ThrVal: 4.121 ± 0.583
1.03ThrTrp: 1.03 ± 0.271
2.716ThrTyr: 2.716 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
3.84ValAla: 3.84 ± 0.738
0.468ValCys: 0.468 ± 0.205
5.057ValAsp: 5.057 ± 0.622
4.214ValGlu: 4.214 ± 0.677
2.716ValPhe: 2.716 ± 0.572
3.933ValGly: 3.933 ± 0.59
0.562ValHis: 0.562 ± 0.23
3.559ValIle: 3.559 ± 0.682
4.495ValLys: 4.495 ± 0.644
2.903ValLeu: 2.903 ± 0.532
0.843ValMet: 0.843 ± 0.272
3.933ValAsn: 3.933 ± 0.631
1.779ValPro: 1.779 ± 0.387
2.06ValGln: 2.06 ± 0.542
2.435ValArg: 2.435 ± 0.546
4.308ValSer: 4.308 ± 0.744
5.151ValThr: 5.151 ± 0.817
3.559ValVal: 3.559 ± 0.51
0.937ValTrp: 0.937 ± 0.316
2.06ValTyr: 2.06 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
0.562TrpAla: 0.562 ± 0.217
0.0TrpCys: 0.0 ± 0.0
1.217TrpAsp: 1.217 ± 0.423
0.749TrpGlu: 0.749 ± 0.236
0.562TrpPhe: 0.562 ± 0.257
0.562TrpGly: 0.562 ± 0.213
0.375TrpHis: 0.375 ± 0.183
0.843TrpIle: 0.843 ± 0.296
1.03TrpLys: 1.03 ± 0.319
1.311TrpLeu: 1.311 ± 0.392
0.187TrpMet: 0.187 ± 0.14
0.749TrpAsn: 0.749 ± 0.249
0.281TrpPro: 0.281 ± 0.159
0.656TrpGln: 0.656 ± 0.203
0.937TrpArg: 0.937 ± 0.347
1.686TrpSer: 1.686 ± 0.551
0.656TrpThr: 0.656 ± 0.23
1.498TrpVal: 1.498 ± 0.321
0.281TrpTrp: 0.281 ± 0.208
0.656TrpTyr: 0.656 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.529TyrAla: 2.529 ± 0.569
0.562TyrCys: 0.562 ± 0.281
2.903TyrAsp: 2.903 ± 0.449
3.184TyrGlu: 3.184 ± 0.661
2.154TyrPhe: 2.154 ± 0.421
2.06TyrGly: 2.06 ± 0.416
0.749TyrHis: 0.749 ± 0.252
1.592TyrIle: 1.592 ± 0.295
3.278TyrLys: 3.278 ± 0.484
2.903TyrLeu: 2.903 ± 0.52
1.124TyrMet: 1.124 ± 0.395
1.967TyrAsn: 1.967 ± 0.368
1.592TyrPro: 1.592 ± 0.425
2.06TyrGln: 2.06 ± 0.346
1.779TyrArg: 1.779 ± 0.311
3.278TyrSer: 3.278 ± 0.616
1.967TyrThr: 1.967 ± 0.501
2.435TyrVal: 2.435 ± 0.478
0.187TyrTrp: 0.187 ± 0.136
2.154TyrTyr: 2.154 ± 0.602
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10679 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski