Amino acid dipepetide frequency for Pectobacterium phage PP81

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.051AlaAla: 11.051 ± 1.334
0.868AlaCys: 0.868 ± 0.313
6.157AlaAsp: 6.157 ± 0.761
6.473AlaGlu: 6.473 ± 0.924
3.158AlaPhe: 3.158 ± 0.602
6.947AlaGly: 6.947 ± 0.778
1.184AlaHis: 1.184 ± 0.412
4.5AlaIle: 4.5 ± 0.774
5.841AlaLys: 5.841 ± 0.627
8.525AlaLeu: 8.525 ± 0.876
2.684AlaMet: 2.684 ± 0.515
4.5AlaAsn: 4.5 ± 0.506
3.315AlaPro: 3.315 ± 0.527
3.394AlaGln: 3.394 ± 0.548
4.973AlaArg: 4.973 ± 0.915
6.315AlaSer: 6.315 ± 0.792
3.631AlaThr: 3.631 ± 0.456
6.078AlaVal: 6.078 ± 0.776
0.868AlaTrp: 0.868 ± 0.359
2.921AlaTyr: 2.921 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.868CysAla: 0.868 ± 0.329
0.158CysCys: 0.158 ± 0.102
0.474CysAsp: 0.474 ± 0.208
0.632CysGlu: 0.632 ± 0.237
0.553CysPhe: 0.553 ± 0.232
0.632CysGly: 0.632 ± 0.259
0.237CysHis: 0.237 ± 0.138
0.632CysIle: 0.632 ± 0.252
0.395CysLys: 0.395 ± 0.253
0.632CysLeu: 0.632 ± 0.296
0.158CysMet: 0.158 ± 0.114
0.237CysAsn: 0.237 ± 0.135
0.632CysPro: 0.632 ± 0.232
0.316CysGln: 0.316 ± 0.154
0.868CysArg: 0.868 ± 0.384
0.395CysSer: 0.395 ± 0.164
0.158CysThr: 0.158 ± 0.127
0.474CysVal: 0.474 ± 0.237
0.158CysTrp: 0.158 ± 0.102
0.632CysTyr: 0.632 ± 0.289
0.0CysXaa: 0.0 ± 0.0
Asp
5.052AspAla: 5.052 ± 0.593
0.395AspCys: 0.395 ± 0.229
3.473AspAsp: 3.473 ± 0.452
3.315AspGlu: 3.315 ± 0.452
3.237AspPhe: 3.237 ± 0.543
6.631AspGly: 6.631 ± 0.726
0.789AspHis: 0.789 ± 0.225
2.605AspIle: 2.605 ± 0.419
4.026AspLys: 4.026 ± 0.409
3.0AspLeu: 3.0 ± 0.466
2.052AspMet: 2.052 ± 0.343
2.763AspAsn: 2.763 ± 0.427
2.921AspPro: 2.921 ± 0.616
1.816AspGln: 1.816 ± 0.341
3.394AspArg: 3.394 ± 0.519
4.342AspSer: 4.342 ± 0.625
3.552AspThr: 3.552 ± 0.637
5.052AspVal: 5.052 ± 0.658
0.71AspTrp: 0.71 ± 0.218
2.921AspTyr: 2.921 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
7.42GluAla: 7.42 ± 0.578
0.71GluCys: 0.71 ± 0.207
3.868GluAsp: 3.868 ± 0.534
4.894GluGlu: 4.894 ± 0.971
3.0GluPhe: 3.0 ± 0.358
4.105GluGly: 4.105 ± 0.638
1.263GluHis: 1.263 ± 0.398
2.605GluIle: 2.605 ± 0.427
3.158GluLys: 3.158 ± 0.498
5.052GluLeu: 5.052 ± 0.619
1.658GluMet: 1.658 ± 0.455
2.368GluAsn: 2.368 ± 0.381
1.579GluPro: 1.579 ± 0.439
2.131GluGln: 2.131 ± 0.545
3.789GluArg: 3.789 ± 0.53
4.815GluSer: 4.815 ± 0.68
4.105GluThr: 4.105 ± 0.502
4.026GluVal: 4.026 ± 0.583
0.71GluTrp: 0.71 ± 0.286
2.763GluTyr: 2.763 ± 0.464
0.0GluXaa: 0.0 ± 0.0
Phe
2.526PheAla: 2.526 ± 0.398
0.474PheCys: 0.474 ± 0.227
3.0PheAsp: 3.0 ± 0.465
1.579PheGlu: 1.579 ± 0.364
0.947PhePhe: 0.947 ± 0.238
2.842PheGly: 2.842 ± 0.519
1.026PheHis: 1.026 ± 0.304
2.052PheIle: 2.052 ± 0.538
2.684PheLys: 2.684 ± 0.534
3.315PheLeu: 3.315 ± 0.592
1.026PheMet: 1.026 ± 0.303
1.737PheAsn: 1.737 ± 0.371
1.973PhePro: 1.973 ± 0.345
1.421PheGln: 1.421 ± 0.337
2.052PheArg: 2.052 ± 0.385
2.289PheSer: 2.289 ± 0.364
3.868PheThr: 3.868 ± 0.639
2.447PheVal: 2.447 ± 0.329
0.632PheTrp: 0.632 ± 0.194
1.342PheTyr: 1.342 ± 0.254
0.0PheXaa: 0.0 ± 0.0
Gly
7.026GlyAla: 7.026 ± 0.851
1.026GlyCys: 1.026 ± 0.379
5.052GlyAsp: 5.052 ± 0.722
4.026GlyGlu: 4.026 ± 0.491
3.631GlyPhe: 3.631 ± 0.478
5.131GlyGly: 5.131 ± 0.765
1.184GlyHis: 1.184 ± 0.276
3.947GlyIle: 3.947 ± 0.441
5.052GlyLys: 5.052 ± 0.75
7.105GlyLeu: 7.105 ± 0.645
1.658GlyMet: 1.658 ± 0.312
4.263GlyAsn: 4.263 ± 0.607
0.71GlyPro: 0.71 ± 0.234
2.842GlyGln: 2.842 ± 0.486
3.868GlyArg: 3.868 ± 0.501
4.894GlySer: 4.894 ± 0.57
4.342GlyThr: 4.342 ± 0.674
4.815GlyVal: 4.815 ± 0.51
1.737GlyTrp: 1.737 ± 0.404
3.552GlyTyr: 3.552 ± 0.69
0.0GlyXaa: 0.0 ± 0.0
His
1.184HisAla: 1.184 ± 0.348
0.395HisCys: 0.395 ± 0.162
0.868HisAsp: 0.868 ± 0.279
1.026HisGlu: 1.026 ± 0.277
0.632HisPhe: 0.632 ± 0.169
1.184HisGly: 1.184 ± 0.3
0.237HisHis: 0.237 ± 0.143
1.263HisIle: 1.263 ± 0.325
1.184HisLys: 1.184 ± 0.362
2.131HisLeu: 2.131 ± 0.44
0.789HisMet: 0.789 ± 0.215
0.789HisAsn: 0.789 ± 0.26
0.474HisPro: 0.474 ± 0.152
0.237HisGln: 0.237 ± 0.125
0.868HisArg: 0.868 ± 0.301
1.5HisSer: 1.5 ± 0.427
0.868HisThr: 0.868 ± 0.326
1.579HisVal: 1.579 ± 0.378
0.553HisTrp: 0.553 ± 0.198
0.789HisTyr: 0.789 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
5.052IleAla: 5.052 ± 0.704
0.553IleCys: 0.553 ± 0.229
2.605IleAsp: 2.605 ± 0.412
3.552IleGlu: 3.552 ± 0.601
1.184IlePhe: 1.184 ± 0.271
3.868IleGly: 3.868 ± 0.596
1.263IleHis: 1.263 ± 0.27
2.842IleIle: 2.842 ± 0.558
3.394IleLys: 3.394 ± 0.544
4.578IleLeu: 4.578 ± 0.883
1.342IleMet: 1.342 ± 0.313
2.21IleAsn: 2.21 ± 0.459
2.526IlePro: 2.526 ± 0.442
1.816IleGln: 1.816 ± 0.368
3.158IleArg: 3.158 ± 0.454
2.131IleSer: 2.131 ± 0.45
3.237IleThr: 3.237 ± 0.528
2.368IleVal: 2.368 ± 0.449
0.474IleTrp: 0.474 ± 0.174
1.895IleTyr: 1.895 ± 0.476
0.0IleXaa: 0.0 ± 0.0
Lys
7.815LysAla: 7.815 ± 0.933
0.474LysCys: 0.474 ± 0.24
3.394LysAsp: 3.394 ± 0.489
4.578LysGlu: 4.578 ± 0.839
3.315LysPhe: 3.315 ± 0.584
5.526LysGly: 5.526 ± 0.609
1.658LysHis: 1.658 ± 0.45
2.21LysIle: 2.21 ± 0.346
3.631LysLys: 3.631 ± 0.501
5.605LysLeu: 5.605 ± 0.79
1.579LysMet: 1.579 ± 0.43
2.21LysAsn: 2.21 ± 0.5
2.131LysPro: 2.131 ± 0.485
2.921LysGln: 2.921 ± 0.501
2.921LysArg: 2.921 ± 0.52
3.71LysSer: 3.71 ± 0.513
2.605LysThr: 2.605 ± 0.47
5.368LysVal: 5.368 ± 0.78
0.71LysTrp: 0.71 ± 0.194
2.21LysTyr: 2.21 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
7.894LeuAla: 7.894 ± 1.187
0.632LeuCys: 0.632 ± 0.216
4.342LeuAsp: 4.342 ± 0.689
6.157LeuGlu: 6.157 ± 0.741
2.289LeuPhe: 2.289 ± 0.371
5.999LeuGly: 5.999 ± 0.63
1.342LeuHis: 1.342 ± 0.343
4.342LeuIle: 4.342 ± 0.735
6.394LeuLys: 6.394 ± 0.779
6.236LeuLeu: 6.236 ± 0.7
3.158LeuMet: 3.158 ± 0.463
4.5LeuAsn: 4.5 ± 0.399
2.605LeuPro: 2.605 ± 0.539
3.079LeuGln: 3.079 ± 0.555
5.841LeuArg: 5.841 ± 0.451
5.131LeuSer: 5.131 ± 0.556
4.184LeuThr: 4.184 ± 0.564
4.578LeuVal: 4.578 ± 0.522
1.421LeuTrp: 1.421 ± 0.368
2.447LeuTyr: 2.447 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
3.394MetAla: 3.394 ± 0.486
0.079MetCys: 0.079 ± 0.079
1.5MetAsp: 1.5 ± 0.457
1.579MetGlu: 1.579 ± 0.347
1.105MetPhe: 1.105 ± 0.311
1.895MetGly: 1.895 ± 0.396
0.553MetHis: 0.553 ± 0.195
1.026MetIle: 1.026 ± 0.342
1.5MetLys: 1.5 ± 0.294
2.684MetLeu: 2.684 ± 0.402
0.553MetMet: 0.553 ± 0.226
1.816MetAsn: 1.816 ± 0.387
1.026MetPro: 1.026 ± 0.317
1.342MetGln: 1.342 ± 0.378
1.5MetArg: 1.5 ± 0.378
1.342MetSer: 1.342 ± 0.404
1.895MetThr: 1.895 ± 0.344
2.131MetVal: 2.131 ± 0.353
0.237MetTrp: 0.237 ± 0.141
0.553MetTyr: 0.553 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.079AsnAla: 3.079 ± 0.574
0.395AsnCys: 0.395 ± 0.211
4.105AsnAsp: 4.105 ± 0.733
2.447AsnGlu: 2.447 ± 0.358
1.5AsnPhe: 1.5 ± 0.36
4.263AsnGly: 4.263 ± 0.692
0.789AsnHis: 0.789 ± 0.266
2.447AsnIle: 2.447 ± 0.461
3.079AsnLys: 3.079 ± 0.532
3.868AsnLeu: 3.868 ± 0.582
0.789AsnMet: 0.789 ± 0.258
1.816AsnAsn: 1.816 ± 0.378
3.552AsnPro: 3.552 ± 0.618
1.737AsnGln: 1.737 ± 0.398
2.447AsnArg: 2.447 ± 0.423
1.895AsnSer: 1.895 ± 0.343
3.158AsnThr: 3.158 ± 0.583
3.71AsnVal: 3.71 ± 0.419
0.553AsnTrp: 0.553 ± 0.171
1.5AsnTyr: 1.5 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
2.763ProAla: 2.763 ± 0.351
0.237ProCys: 0.237 ± 0.127
2.21ProAsp: 2.21 ± 0.344
4.105ProGlu: 4.105 ± 0.545
1.421ProPhe: 1.421 ± 0.321
0.0ProGly: 0.0 ± 0.0
0.474ProHis: 0.474 ± 0.203
1.658ProIle: 1.658 ± 0.24
3.079ProLys: 3.079 ± 0.582
2.526ProLeu: 2.526 ± 0.369
0.947ProMet: 0.947 ± 0.328
2.921ProAsn: 2.921 ± 0.538
0.947ProPro: 0.947 ± 0.284
1.579ProGln: 1.579 ± 0.33
1.658ProArg: 1.658 ± 0.325
2.21ProSer: 2.21 ± 0.425
1.895ProThr: 1.895 ± 0.406
3.0ProVal: 3.0 ± 0.487
0.632ProTrp: 0.632 ± 0.187
1.579ProTyr: 1.579 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
4.184GlnAla: 4.184 ± 0.614
0.395GlnCys: 0.395 ± 0.17
2.684GlnAsp: 2.684 ± 0.686
2.052GlnGlu: 2.052 ± 0.472
1.973GlnPhe: 1.973 ± 0.285
2.842GlnGly: 2.842 ± 0.481
0.868GlnHis: 0.868 ± 0.263
2.21GlnIle: 2.21 ± 0.419
1.737GlnLys: 1.737 ± 0.334
3.158GlnLeu: 3.158 ± 0.519
0.71GlnMet: 0.71 ± 0.291
1.263GlnAsn: 1.263 ± 0.312
1.895GlnPro: 1.895 ± 0.333
2.289GlnGln: 2.289 ± 0.469
2.131GlnArg: 2.131 ± 0.435
2.605GlnSer: 2.605 ± 0.5
1.421GlnThr: 1.421 ± 0.309
2.21GlnVal: 2.21 ± 0.36
0.789GlnTrp: 0.789 ± 0.274
1.263GlnTyr: 1.263 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
4.5ArgAla: 4.5 ± 0.59
0.632ArgCys: 0.632 ± 0.287
4.105ArgAsp: 4.105 ± 0.542
3.868ArgGlu: 3.868 ± 0.54
1.5ArgPhe: 1.5 ± 0.343
4.342ArgGly: 4.342 ± 0.463
0.71ArgHis: 0.71 ± 0.21
2.368ArgIle: 2.368 ± 0.41
3.789ArgLys: 3.789 ± 0.549
5.447ArgLeu: 5.447 ± 0.586
1.895ArgMet: 1.895 ± 0.341
1.973ArgAsn: 1.973 ± 0.38
1.263ArgPro: 1.263 ± 0.296
2.052ArgGln: 2.052 ± 0.519
2.131ArgArg: 2.131 ± 0.487
3.789ArgSer: 3.789 ± 0.632
3.158ArgThr: 3.158 ± 0.416
3.789ArgVal: 3.789 ± 0.59
0.868ArgTrp: 0.868 ± 0.254
1.5ArgTyr: 1.5 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
5.763SerAla: 5.763 ± 0.491
0.868SerCys: 0.868 ± 0.304
3.237SerAsp: 3.237 ± 0.486
2.526SerGlu: 2.526 ± 0.512
3.237SerPhe: 3.237 ± 0.44
6.078SerGly: 6.078 ± 0.666
1.579SerHis: 1.579 ± 0.419
3.631SerIle: 3.631 ± 0.63
3.473SerLys: 3.473 ± 0.525
3.71SerLeu: 3.71 ± 0.454
1.737SerMet: 1.737 ± 0.307
2.605SerAsn: 2.605 ± 0.42
1.737SerPro: 1.737 ± 0.341
2.684SerGln: 2.684 ± 0.498
3.0SerArg: 3.0 ± 0.566
3.71SerSer: 3.71 ± 0.464
3.158SerThr: 3.158 ± 0.442
4.736SerVal: 4.736 ± 0.425
0.71SerTrp: 0.71 ± 0.215
2.684SerTyr: 2.684 ± 0.556
0.0SerXaa: 0.0 ± 0.0
Thr
4.657ThrAla: 4.657 ± 0.705
0.395ThrCys: 0.395 ± 0.219
3.079ThrAsp: 3.079 ± 0.561
3.237ThrGlu: 3.237 ± 0.39
2.289ThrPhe: 2.289 ± 0.502
4.342ThrGly: 4.342 ± 0.488
1.026ThrHis: 1.026 ± 0.227
3.552ThrIle: 3.552 ± 0.73
4.736ThrLys: 4.736 ± 0.604
4.973ThrLeu: 4.973 ± 0.678
0.868ThrMet: 0.868 ± 0.272
2.289ThrAsn: 2.289 ± 0.44
3.0ThrPro: 3.0 ± 0.484
1.973ThrGln: 1.973 ± 0.326
2.605ThrArg: 2.605 ± 0.511
3.631ThrSer: 3.631 ± 0.573
3.237ThrThr: 3.237 ± 0.647
4.184ThrVal: 4.184 ± 0.534
0.71ThrTrp: 0.71 ± 0.2
1.579ThrTyr: 1.579 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
5.605ValAla: 5.605 ± 0.564
0.316ValCys: 0.316 ± 0.194
4.105ValAsp: 4.105 ± 0.611
4.263ValGlu: 4.263 ± 0.732
2.763ValPhe: 2.763 ± 0.629
4.578ValGly: 4.578 ± 0.696
1.184ValHis: 1.184 ± 0.407
3.789ValIle: 3.789 ± 0.544
4.894ValLys: 4.894 ± 0.57
5.289ValLeu: 5.289 ± 0.754
2.21ValMet: 2.21 ± 0.409
4.657ValAsn: 4.657 ± 0.734
2.131ValPro: 2.131 ± 0.435
3.158ValGln: 3.158 ± 0.453
3.631ValArg: 3.631 ± 0.532
3.789ValSer: 3.789 ± 0.564
5.289ValThr: 5.289 ± 0.603
5.92ValVal: 5.92 ± 0.932
0.789ValTrp: 0.789 ± 0.236
2.368ValTyr: 2.368 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
1.105TrpAla: 1.105 ± 0.387
0.158TrpCys: 0.158 ± 0.111
0.789TrpAsp: 0.789 ± 0.212
0.947TrpGlu: 0.947 ± 0.321
0.474TrpPhe: 0.474 ± 0.236
1.105TrpGly: 1.105 ± 0.272
0.395TrpHis: 0.395 ± 0.208
0.553TrpIle: 0.553 ± 0.225
0.789TrpLys: 0.789 ± 0.311
1.5TrpLeu: 1.5 ± 0.333
0.553TrpMet: 0.553 ± 0.224
0.789TrpAsn: 0.789 ± 0.247
0.237TrpPro: 0.237 ± 0.126
0.395TrpGln: 0.395 ± 0.2
1.184TrpArg: 1.184 ± 0.278
0.789TrpSer: 0.789 ± 0.238
0.789TrpThr: 0.789 ± 0.265
1.105TrpVal: 1.105 ± 0.453
0.316TrpTrp: 0.316 ± 0.131
0.079TrpTyr: 0.079 ± 0.081
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.447TyrAla: 2.447 ± 0.575
0.079TyrCys: 0.079 ± 0.081
3.0TyrAsp: 3.0 ± 0.538
2.763TyrGlu: 2.763 ± 0.33
1.026TyrPhe: 1.026 ± 0.263
3.552TyrGly: 3.552 ± 0.387
0.868TyrHis: 0.868 ± 0.374
1.816TyrIle: 1.816 ± 0.561
1.737TyrLys: 1.737 ± 0.427
3.237TyrLeu: 3.237 ± 0.625
1.263TyrMet: 1.263 ± 0.313
1.421TyrAsn: 1.421 ± 0.334
1.184TyrPro: 1.184 ± 0.309
1.5TyrGln: 1.5 ± 0.372
1.658TyrArg: 1.658 ± 0.327
1.579TyrSer: 1.579 ± 0.322
1.816TyrThr: 1.816 ± 0.376
3.158TyrVal: 3.158 ± 0.613
0.474TyrTrp: 0.474 ± 0.225
0.868TyrTyr: 0.868 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12669 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski