Amino acid dipepetide frequency for Propionibacterium phage phiPA50S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.757AlaAla: 10.757 ± 1.565
0.846AlaCys: 0.846 ± 0.278
6.647AlaAsp: 6.647 ± 1.185
5.68AlaGlu: 5.68 ± 0.965
3.022AlaPhe: 3.022 ± 0.601
10.636AlaGly: 10.636 ± 1.48
1.934AlaHis: 1.934 ± 0.509
3.868AlaIle: 3.868 ± 0.726
3.988AlaLys: 3.988 ± 0.554
7.493AlaLeu: 7.493 ± 1.322
4.351AlaMet: 4.351 ± 1.288
2.78AlaAsn: 2.78 ± 0.705
2.78AlaPro: 2.78 ± 0.512
3.868AlaGln: 3.868 ± 0.604
6.768AlaArg: 6.768 ± 1.326
6.406AlaSer: 6.406 ± 0.856
6.285AlaThr: 6.285 ± 1.101
10.394AlaVal: 10.394 ± 1.92
1.209AlaTrp: 1.209 ± 0.362
2.78AlaTyr: 2.78 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.967CysAla: 0.967 ± 0.342
0.242CysCys: 0.242 ± 0.186
1.088CysAsp: 1.088 ± 0.328
1.088CysGlu: 1.088 ± 0.362
0.0CysPhe: 0.0 ± 0.0
1.088CysGly: 1.088 ± 0.391
0.242CysHis: 0.242 ± 0.174
0.363CysIle: 0.363 ± 0.201
0.483CysLys: 0.483 ± 0.289
0.363CysLeu: 0.363 ± 0.235
0.0CysMet: 0.0 ± 0.0
0.121CysAsn: 0.121 ± 0.127
0.604CysPro: 0.604 ± 0.229
0.483CysGln: 0.483 ± 0.225
1.329CysArg: 1.329 ± 0.439
0.604CysSer: 0.604 ± 0.31
0.846CysThr: 0.846 ± 0.303
0.846CysVal: 0.846 ± 0.372
0.242CysTrp: 0.242 ± 0.157
0.363CysTyr: 0.363 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
5.076AspAla: 5.076 ± 0.875
0.967AspCys: 0.967 ± 0.31
5.197AspAsp: 5.197 ± 0.955
3.626AspGlu: 3.626 ± 0.782
1.692AspPhe: 1.692 ± 0.409
7.493AspGly: 7.493 ± 2.242
2.296AspHis: 2.296 ± 0.587
2.659AspIle: 2.659 ± 0.706
2.296AspLys: 2.296 ± 0.47
4.593AspLeu: 4.593 ± 1.0
1.329AspMet: 1.329 ± 0.411
3.263AspAsn: 3.263 ± 0.8
3.868AspPro: 3.868 ± 0.968
2.296AspGln: 2.296 ± 0.565
3.626AspArg: 3.626 ± 0.656
2.901AspSer: 2.901 ± 0.622
4.714AspThr: 4.714 ± 0.736
5.197AspVal: 5.197 ± 0.731
1.329AspTrp: 1.329 ± 0.349
2.417AspTyr: 2.417 ± 0.611
0.0AspXaa: 0.0 ± 0.0
Glu
6.406GluAla: 6.406 ± 1.059
0.604GluCys: 0.604 ± 0.294
2.055GluAsp: 2.055 ± 0.62
2.659GluGlu: 2.659 ± 0.648
1.329GluPhe: 1.329 ± 0.347
3.626GluGly: 3.626 ± 0.547
0.967GluHis: 0.967 ± 0.366
2.175GluIle: 2.175 ± 0.424
1.692GluLys: 1.692 ± 0.523
3.747GluLeu: 3.747 ± 0.72
1.088GluMet: 1.088 ± 0.327
1.209GluAsn: 1.209 ± 0.378
1.813GluPro: 1.813 ± 0.585
2.538GluGln: 2.538 ± 0.721
2.901GluArg: 2.901 ± 0.583
4.351GluSer: 4.351 ± 0.912
2.78GluThr: 2.78 ± 0.823
3.142GluVal: 3.142 ± 0.662
2.055GluTrp: 2.055 ± 0.561
1.813GluTyr: 1.813 ± 0.517
0.0GluXaa: 0.0 ± 0.0
Phe
2.78PheAla: 2.78 ± 1.079
0.242PheCys: 0.242 ± 0.151
1.934PheAsp: 1.934 ± 0.53
1.45PheGlu: 1.45 ± 0.403
1.088PhePhe: 1.088 ± 0.357
2.901PheGly: 2.901 ± 0.54
0.604PheHis: 0.604 ± 0.263
0.967PheIle: 0.967 ± 0.31
1.329PheLys: 1.329 ± 0.404
1.813PheLeu: 1.813 ± 0.46
0.846PheMet: 0.846 ± 0.3
0.846PheAsn: 0.846 ± 0.431
1.329PhePro: 1.329 ± 0.307
0.846PheGln: 0.846 ± 0.248
2.175PheArg: 2.175 ± 0.543
2.296PheSer: 2.296 ± 0.498
1.692PheThr: 1.692 ± 0.428
1.813PheVal: 1.813 ± 0.603
0.604PheTrp: 0.604 ± 0.257
0.121PheTyr: 0.121 ± 0.105
0.0PheXaa: 0.0 ± 0.0
Gly
7.252GlyAla: 7.252 ± 1.476
0.725GlyCys: 0.725 ± 0.3
6.164GlyAsp: 6.164 ± 0.945
4.351GlyGlu: 4.351 ± 0.732
3.022GlyPhe: 3.022 ± 0.991
7.252GlyGly: 7.252 ± 1.237
1.209GlyHis: 1.209 ± 0.367
2.901GlyIle: 2.901 ± 0.696
4.714GlyLys: 4.714 ± 0.646
8.098GlyLeu: 8.098 ± 1.277
1.934GlyMet: 1.934 ± 0.483
2.538GlyAsn: 2.538 ± 0.496
4.351GlyPro: 4.351 ± 1.313
3.505GlyGln: 3.505 ± 0.535
5.439GlyArg: 5.439 ± 1.306
6.889GlySer: 6.889 ± 1.047
4.351GlyThr: 4.351 ± 0.907
9.065GlyVal: 9.065 ± 1.397
1.934GlyTrp: 1.934 ± 0.535
2.901GlyTyr: 2.901 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
1.934HisAla: 1.934 ± 0.714
0.725HisCys: 0.725 ± 0.252
1.209HisAsp: 1.209 ± 0.44
0.483HisGlu: 0.483 ± 0.251
0.725HisPhe: 0.725 ± 0.386
1.813HisGly: 1.813 ± 0.472
1.692HisHis: 1.692 ± 0.541
1.571HisIle: 1.571 ± 0.534
1.329HisLys: 1.329 ± 0.478
2.659HisLeu: 2.659 ± 0.698
0.121HisMet: 0.121 ± 0.137
1.088HisAsn: 1.088 ± 0.368
1.45HisPro: 1.45 ± 0.456
1.209HisGln: 1.209 ± 0.466
1.209HisArg: 1.209 ± 0.42
0.846HisSer: 0.846 ± 0.298
1.934HisThr: 1.934 ± 0.508
1.692HisVal: 1.692 ± 0.52
0.242HisTrp: 0.242 ± 0.185
0.846HisTyr: 0.846 ± 0.409
0.0HisXaa: 0.0 ± 0.0
Ile
4.593IleAla: 4.593 ± 0.745
0.846IleCys: 0.846 ± 0.284
5.439IleAsp: 5.439 ± 0.84
3.022IleGlu: 3.022 ± 0.565
1.45IlePhe: 1.45 ± 0.426
2.417IleGly: 2.417 ± 0.679
1.571IleHis: 1.571 ± 0.502
3.263IleIle: 3.263 ± 0.861
1.45IleLys: 1.45 ± 0.407
2.901IleLeu: 2.901 ± 0.426
1.088IleMet: 1.088 ± 0.309
2.055IleAsn: 2.055 ± 0.55
2.538IlePro: 2.538 ± 0.647
1.813IleGln: 1.813 ± 0.409
2.659IleArg: 2.659 ± 0.644
2.659IleSer: 2.659 ± 0.586
3.505IleThr: 3.505 ± 0.737
2.78IleVal: 2.78 ± 0.414
0.363IleTrp: 0.363 ± 0.256
1.088IleTyr: 1.088 ± 0.345
0.0IleXaa: 0.0 ± 0.0
Lys
4.714LysAla: 4.714 ± 0.922
0.242LysCys: 0.242 ± 0.159
2.901LysAsp: 2.901 ± 0.691
1.088LysGlu: 1.088 ± 0.417
0.363LysPhe: 0.363 ± 0.2
3.747LysGly: 3.747 ± 0.577
0.846LysHis: 0.846 ± 0.399
1.329LysIle: 1.329 ± 0.425
2.175LysLys: 2.175 ± 0.57
3.505LysLeu: 3.505 ± 0.472
0.846LysMet: 0.846 ± 0.29
1.813LysAsn: 1.813 ± 0.389
3.263LysPro: 3.263 ± 0.74
2.296LysGln: 2.296 ± 0.676
3.022LysArg: 3.022 ± 0.678
1.934LysSer: 1.934 ± 0.726
3.142LysThr: 3.142 ± 0.66
1.813LysVal: 1.813 ± 0.396
0.846LysTrp: 0.846 ± 0.285
0.967LysTyr: 0.967 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
9.79LeuAla: 9.79 ± 0.996
1.45LeuCys: 1.45 ± 0.508
5.318LeuAsp: 5.318 ± 1.129
3.505LeuGlu: 3.505 ± 0.72
1.692LeuPhe: 1.692 ± 0.529
5.801LeuGly: 5.801 ± 1.314
2.417LeuHis: 2.417 ± 0.512
3.384LeuIle: 3.384 ± 0.73
4.109LeuLys: 4.109 ± 0.732
4.834LeuLeu: 4.834 ± 0.745
1.813LeuMet: 1.813 ± 0.554
2.659LeuAsn: 2.659 ± 0.457
3.626LeuPro: 3.626 ± 0.533
3.384LeuGln: 3.384 ± 0.645
3.142LeuArg: 3.142 ± 0.57
5.439LeuSer: 5.439 ± 0.715
4.23LeuThr: 4.23 ± 0.725
4.472LeuVal: 4.472 ± 0.829
1.209LeuTrp: 1.209 ± 0.356
1.934LeuTyr: 1.934 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
3.142MetAla: 3.142 ± 0.645
0.483MetCys: 0.483 ± 0.219
1.329MetAsp: 1.329 ± 0.416
0.846MetGlu: 0.846 ± 0.383
0.604MetPhe: 0.604 ± 0.249
1.813MetGly: 1.813 ± 0.637
0.725MetHis: 0.725 ± 0.283
2.417MetIle: 2.417 ± 0.585
0.967MetLys: 0.967 ± 0.332
2.538MetLeu: 2.538 ± 0.514
0.967MetMet: 0.967 ± 0.424
0.846MetAsn: 0.846 ± 0.311
2.175MetPro: 2.175 ± 0.767
1.088MetGln: 1.088 ± 0.367
1.329MetArg: 1.329 ± 0.364
1.813MetSer: 1.813 ± 0.496
1.088MetThr: 1.088 ± 0.365
1.329MetVal: 1.329 ± 0.388
0.483MetTrp: 0.483 ± 0.253
1.088MetTyr: 1.088 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
2.538AsnAla: 2.538 ± 0.971
0.121AsnCys: 0.121 ± 0.105
1.45AsnAsp: 1.45 ± 0.432
1.329AsnGlu: 1.329 ± 0.318
0.604AsnPhe: 0.604 ± 0.242
4.714AsnGly: 4.714 ± 0.66
0.846AsnHis: 0.846 ± 0.331
2.175AsnIle: 2.175 ± 0.495
1.209AsnLys: 1.209 ± 0.476
2.296AsnLeu: 2.296 ± 0.526
1.088AsnMet: 1.088 ± 0.255
2.296AsnAsn: 2.296 ± 0.751
2.296AsnPro: 2.296 ± 0.491
1.209AsnGln: 1.209 ± 0.402
1.934AsnArg: 1.934 ± 0.432
1.934AsnSer: 1.934 ± 0.582
2.417AsnThr: 2.417 ± 0.707
3.022AsnVal: 3.022 ± 0.621
0.604AsnTrp: 0.604 ± 0.277
0.604AsnTyr: 0.604 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
5.439ProAla: 5.439 ± 0.912
0.363ProCys: 0.363 ± 0.228
4.351ProAsp: 4.351 ± 0.74
2.901ProGlu: 2.901 ± 0.737
1.088ProPhe: 1.088 ± 0.401
5.318ProGly: 5.318 ± 1.015
0.967ProHis: 0.967 ± 0.371
1.813ProIle: 1.813 ± 0.451
1.813ProLys: 1.813 ± 0.438
2.901ProLeu: 2.901 ± 0.516
0.604ProMet: 0.604 ± 0.247
1.934ProAsn: 1.934 ± 0.526
2.901ProPro: 2.901 ± 0.728
2.055ProGln: 2.055 ± 0.696
1.45ProArg: 1.45 ± 0.473
3.142ProSer: 3.142 ± 0.551
2.417ProThr: 2.417 ± 0.653
5.318ProVal: 5.318 ± 1.455
1.692ProTrp: 1.692 ± 0.591
1.088ProTyr: 1.088 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
4.351GlnAla: 4.351 ± 0.834
0.363GlnCys: 0.363 ± 0.274
1.692GlnAsp: 1.692 ± 0.321
1.209GlnGlu: 1.209 ± 0.392
0.725GlnPhe: 0.725 ± 0.235
2.417GlnGly: 2.417 ± 0.699
1.692GlnHis: 1.692 ± 0.594
2.538GlnIle: 2.538 ± 0.775
1.329GlnLys: 1.329 ± 0.333
4.472GlnLeu: 4.472 ± 0.787
1.45GlnMet: 1.45 ± 0.469
0.967GlnAsn: 0.967 ± 0.281
2.296GlnPro: 2.296 ± 0.615
2.901GlnGln: 2.901 ± 0.664
2.901GlnArg: 2.901 ± 0.577
2.417GlnSer: 2.417 ± 0.481
2.659GlnThr: 2.659 ± 0.719
2.659GlnVal: 2.659 ± 0.554
0.967GlnTrp: 0.967 ± 0.369
1.329GlnTyr: 1.329 ± 0.37
0.0GlnXaa: 0.0 ± 0.0
Arg
6.647ArgAla: 6.647 ± 1.048
0.483ArgCys: 0.483 ± 0.24
3.868ArgAsp: 3.868 ± 0.733
2.901ArgGlu: 2.901 ± 0.628
2.417ArgPhe: 2.417 ± 0.558
4.351ArgGly: 4.351 ± 0.81
1.209ArgHis: 1.209 ± 0.365
3.263ArgIle: 3.263 ± 0.576
2.175ArgLys: 2.175 ± 0.616
6.043ArgLeu: 6.043 ± 0.939
2.175ArgMet: 2.175 ± 0.678
1.934ArgAsn: 1.934 ± 0.563
1.329ArgPro: 1.329 ± 0.394
2.296ArgGln: 2.296 ± 0.593
4.955ArgArg: 4.955 ± 0.938
4.472ArgSer: 4.472 ± 0.808
2.175ArgThr: 2.175 ± 0.6
5.801ArgVal: 5.801 ± 0.87
0.725ArgTrp: 0.725 ± 0.346
2.055ArgTyr: 2.055 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
7.01SerAla: 7.01 ± 1.252
0.483SerCys: 0.483 ± 0.22
3.988SerAsp: 3.988 ± 0.654
3.022SerGlu: 3.022 ± 0.542
3.384SerPhe: 3.384 ± 0.541
8.219SerGly: 8.219 ± 1.5
1.209SerHis: 1.209 ± 0.315
2.901SerIle: 2.901 ± 0.482
2.055SerLys: 2.055 ± 0.617
4.109SerLeu: 4.109 ± 0.644
2.296SerMet: 2.296 ± 0.39
2.417SerAsn: 2.417 ± 0.468
2.417SerPro: 2.417 ± 0.585
2.055SerGln: 2.055 ± 0.476
3.868SerArg: 3.868 ± 0.848
4.109SerSer: 4.109 ± 0.909
2.538SerThr: 2.538 ± 0.58
6.526SerVal: 6.526 ± 1.337
1.813SerTrp: 1.813 ± 0.445
0.604SerTyr: 0.604 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
6.406ThrAla: 6.406 ± 1.023
0.604ThrCys: 0.604 ± 0.253
3.626ThrAsp: 3.626 ± 0.599
2.538ThrGlu: 2.538 ± 0.484
1.934ThrPhe: 1.934 ± 0.427
5.076ThrGly: 5.076 ± 0.715
1.45ThrHis: 1.45 ± 0.474
4.714ThrIle: 4.714 ± 0.806
1.813ThrLys: 1.813 ± 0.487
3.505ThrLeu: 3.505 ± 0.833
0.846ThrMet: 0.846 ± 0.309
1.329ThrAsn: 1.329 ± 0.338
3.505ThrPro: 3.505 ± 0.489
2.901ThrGln: 2.901 ± 0.805
3.384ThrArg: 3.384 ± 0.644
3.868ThrSer: 3.868 ± 0.779
3.747ThrThr: 3.747 ± 1.21
5.801ThrVal: 5.801 ± 0.891
0.725ThrTrp: 0.725 ± 0.318
1.813ThrTyr: 1.813 ± 0.616
0.0ThrXaa: 0.0 ± 0.0
Val
8.944ValAla: 8.944 ± 2.069
1.088ValCys: 1.088 ± 0.339
5.439ValAsp: 5.439 ± 0.914
4.955ValGlu: 4.955 ± 0.953
1.934ValPhe: 1.934 ± 0.6
5.68ValGly: 5.68 ± 1.192
1.692ValHis: 1.692 ± 0.313
3.263ValIle: 3.263 ± 1.253
4.593ValLys: 4.593 ± 0.928
6.043ValLeu: 6.043 ± 0.714
2.901ValMet: 2.901 ± 0.591
2.296ValAsn: 2.296 ± 0.595
4.23ValPro: 4.23 ± 0.615
2.659ValGln: 2.659 ± 0.613
4.472ValArg: 4.472 ± 0.72
6.285ValSer: 6.285 ± 1.159
5.076ValThr: 5.076 ± 0.832
7.614ValVal: 7.614 ± 1.871
1.088ValTrp: 1.088 ± 0.396
1.934ValTyr: 1.934 ± 0.654
0.0ValXaa: 0.0 ± 0.0
Trp
1.934TrpAla: 1.934 ± 0.512
0.121TrpCys: 0.121 ± 0.104
0.967TrpAsp: 0.967 ± 0.273
1.088TrpGlu: 1.088 ± 0.354
0.363TrpPhe: 0.363 ± 0.173
1.209TrpGly: 1.209 ± 0.487
0.483TrpHis: 0.483 ± 0.207
0.725TrpIle: 0.725 ± 0.262
0.604TrpLys: 0.604 ± 0.268
1.45TrpLeu: 1.45 ± 0.525
0.604TrpMet: 0.604 ± 0.256
1.209TrpAsn: 1.209 ± 0.384
0.725TrpPro: 0.725 ± 0.221
1.088TrpGln: 1.088 ± 0.347
2.175TrpArg: 2.175 ± 0.545
1.088TrpSer: 1.088 ± 0.379
1.571TrpThr: 1.571 ± 0.407
0.846TrpVal: 0.846 ± 0.346
0.363TrpTrp: 0.363 ± 0.175
0.363TrpTyr: 0.363 ± 0.202
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.813TyrAla: 1.813 ± 0.364
0.363TyrCys: 0.363 ± 0.253
2.175TyrAsp: 2.175 ± 0.538
1.209TyrGlu: 1.209 ± 0.315
0.242TyrPhe: 0.242 ± 0.149
3.022TyrGly: 3.022 ± 0.622
0.725TyrHis: 0.725 ± 0.281
1.209TyrIle: 1.209 ± 0.372
0.725TyrLys: 0.725 ± 0.322
0.846TyrLeu: 0.846 ± 0.32
0.604TyrMet: 0.604 ± 0.196
1.088TyrAsn: 1.088 ± 0.493
2.055TyrPro: 2.055 ± 0.512
0.846TyrGln: 0.846 ± 0.328
2.659TyrArg: 2.659 ± 0.542
1.571TyrSer: 1.571 ± 0.387
2.417TyrThr: 2.417 ± 0.518
2.055TyrVal: 2.055 ± 0.388
0.483TyrTrp: 0.483 ± 0.312
0.967TyrTyr: 0.967 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (8275 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski