Amino acid dipepetide frequency for Cellulophaga phage phi17:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.851AlaAla: 4.851 ± 1.498
0.728AlaCys: 0.728 ± 0.259
3.558AlaAsp: 3.558 ± 0.548
3.477AlaGlu: 3.477 ± 0.494
2.506AlaPhe: 2.506 ± 0.56
3.315AlaGly: 3.315 ± 0.582
0.97AlaHis: 0.97 ± 0.253
4.77AlaIle: 4.77 ± 0.768
5.094AlaLys: 5.094 ± 0.626
4.447AlaLeu: 4.447 ± 0.696
1.375AlaMet: 1.375 ± 0.447
2.668AlaAsn: 2.668 ± 0.524
1.779AlaPro: 1.779 ± 0.443
2.668AlaGln: 2.668 ± 0.658
2.587AlaArg: 2.587 ± 0.499
5.417AlaSer: 5.417 ± 0.817
3.396AlaThr: 3.396 ± 0.731
3.396AlaVal: 3.396 ± 0.795
1.132AlaTrp: 1.132 ± 0.294
2.587AlaTyr: 2.587 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.153
0.081CysCys: 0.081 ± 0.092
0.889CysAsp: 0.889 ± 0.246
0.728CysGlu: 0.728 ± 0.242
0.889CysPhe: 0.889 ± 0.262
0.647CysGly: 0.647 ± 0.292
0.081CysHis: 0.081 ± 0.092
0.809CysIle: 0.809 ± 0.295
0.809CysLys: 0.809 ± 0.315
1.051CysLeu: 1.051 ± 0.364
0.081CysMet: 0.081 ± 0.12
0.485CysAsn: 0.485 ± 0.211
0.647CysPro: 0.647 ± 0.203
0.404CysGln: 0.404 ± 0.185
0.97CysArg: 0.97 ± 0.291
0.889CysSer: 0.889 ± 0.244
0.323CysThr: 0.323 ± 0.153
0.485CysVal: 0.485 ± 0.249
0.081CysTrp: 0.081 ± 0.094
0.485CysTyr: 0.485 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
4.609AspAla: 4.609 ± 0.802
1.051AspCys: 1.051 ± 0.346
3.072AspAsp: 3.072 ± 0.479
4.447AspGlu: 4.447 ± 0.682
3.881AspPhe: 3.881 ± 0.498
4.124AspGly: 4.124 ± 0.693
0.889AspHis: 0.889 ± 0.257
5.579AspIle: 5.579 ± 0.664
5.255AspLys: 5.255 ± 0.693
6.064AspLeu: 6.064 ± 0.56
1.698AspMet: 1.698 ± 0.361
3.8AspAsn: 3.8 ± 0.48
2.668AspPro: 2.668 ± 0.571
2.426AspGln: 2.426 ± 0.384
2.668AspArg: 2.668 ± 0.49
3.719AspSer: 3.719 ± 0.695
3.396AspThr: 3.396 ± 0.588
3.477AspVal: 3.477 ± 0.525
0.809AspTrp: 0.809 ± 0.23
2.264AspTyr: 2.264 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
4.77GluAla: 4.77 ± 0.674
0.323GluCys: 0.323 ± 0.155
3.638GluAsp: 3.638 ± 0.604
4.851GluGlu: 4.851 ± 0.996
2.668GluPhe: 2.668 ± 0.364
2.83GluGly: 2.83 ± 0.428
0.728GluHis: 0.728 ± 0.255
5.336GluIle: 5.336 ± 0.617
4.77GluLys: 4.77 ± 0.779
6.549GluLeu: 6.549 ± 0.74
2.183GluMet: 2.183 ± 0.406
4.124GluAsn: 4.124 ± 0.464
1.617GluPro: 1.617 ± 0.425
2.183GluGln: 2.183 ± 0.454
2.264GluArg: 2.264 ± 0.497
4.043GluSer: 4.043 ± 0.689
3.315GluThr: 3.315 ± 0.509
4.528GluVal: 4.528 ± 0.638
1.132GluTrp: 1.132 ± 0.331
2.345GluTyr: 2.345 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.587PheAla: 2.587 ± 0.467
0.243PheCys: 0.243 ± 0.161
4.932PheAsp: 4.932 ± 0.554
2.668PheGlu: 2.668 ± 0.464
2.506PhePhe: 2.506 ± 0.456
2.749PheGly: 2.749 ± 0.429
0.566PheHis: 0.566 ± 0.222
3.719PheIle: 3.719 ± 0.532
4.528PheLys: 4.528 ± 0.509
3.638PheLeu: 3.638 ± 0.6
1.455PheMet: 1.455 ± 0.308
3.638PheAsn: 3.638 ± 0.453
1.779PhePro: 1.779 ± 0.443
1.132PheGln: 1.132 ± 0.303
1.294PheArg: 1.294 ± 0.334
3.234PheSer: 3.234 ± 0.437
2.264PheThr: 2.264 ± 0.384
2.021PheVal: 2.021 ± 0.573
0.404PheTrp: 0.404 ± 0.174
2.021PheTyr: 2.021 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
4.447GlyAla: 4.447 ± 0.997
0.809GlyCys: 0.809 ± 0.278
5.013GlyAsp: 5.013 ± 0.792
3.315GlyGlu: 3.315 ± 0.731
4.043GlyPhe: 4.043 ± 0.567
4.932GlyGly: 4.932 ± 1.268
0.728GlyHis: 0.728 ± 0.25
5.094GlyIle: 5.094 ± 1.086
3.719GlyLys: 3.719 ± 0.418
5.094GlyLeu: 5.094 ± 0.536
1.051GlyMet: 1.051 ± 0.229
2.911GlyAsn: 2.911 ± 0.486
1.779GlyPro: 1.779 ± 0.517
2.183GlyGln: 2.183 ± 0.42
1.698GlyArg: 1.698 ± 0.385
5.417GlySer: 5.417 ± 0.9
4.285GlyThr: 4.285 ± 0.857
4.285GlyVal: 4.285 ± 0.703
0.97GlyTrp: 0.97 ± 0.234
2.587GlyTyr: 2.587 ± 0.532
0.0GlyXaa: 0.0 ± 0.0
His
0.97HisAla: 0.97 ± 0.287
0.323HisCys: 0.323 ± 0.147
0.485HisAsp: 0.485 ± 0.24
0.566HisGlu: 0.566 ± 0.206
0.97HisPhe: 0.97 ± 0.282
0.97HisGly: 0.97 ± 0.331
0.0HisHis: 0.0 ± 0.0
1.051HisIle: 1.051 ± 0.398
1.213HisLys: 1.213 ± 0.365
1.375HisLeu: 1.375 ± 0.358
0.809HisMet: 0.809 ± 0.262
0.728HisAsn: 0.728 ± 0.211
0.566HisPro: 0.566 ± 0.268
0.404HisGln: 0.404 ± 0.154
0.647HisArg: 0.647 ± 0.258
0.728HisSer: 0.728 ± 0.222
0.243HisThr: 0.243 ± 0.121
0.728HisVal: 0.728 ± 0.246
0.0HisTrp: 0.0 ± 0.0
0.889HisTyr: 0.889 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
5.579IleAla: 5.579 ± 0.766
0.809IleCys: 0.809 ± 0.267
6.468IleAsp: 6.468 ± 0.654
7.439IleGlu: 7.439 ± 0.714
3.396IlePhe: 3.396 ± 0.514
4.366IleGly: 4.366 ± 0.542
0.728IleHis: 0.728 ± 0.226
5.579IleIle: 5.579 ± 0.59
8.49IleLys: 8.49 ± 1.11
7.277IleLeu: 7.277 ± 0.734
1.94IleMet: 1.94 ± 0.389
5.498IleAsn: 5.498 ± 0.633
2.345IlePro: 2.345 ± 0.616
2.83IleGln: 2.83 ± 0.825
2.668IleArg: 2.668 ± 0.528
5.094IleSer: 5.094 ± 0.543
3.8IleThr: 3.8 ± 0.603
4.609IleVal: 4.609 ± 0.609
0.809IleTrp: 0.809 ± 0.236
2.506IleTyr: 2.506 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
4.77LysAla: 4.77 ± 0.709
0.647LysCys: 0.647 ± 0.223
5.66LysAsp: 5.66 ± 0.753
7.115LysGlu: 7.115 ± 0.88
3.153LysPhe: 3.153 ± 0.478
5.579LysGly: 5.579 ± 0.699
1.455LysHis: 1.455 ± 0.353
7.6LysIle: 7.6 ± 0.947
8.571LysLys: 8.571 ± 1.052
7.439LysLeu: 7.439 ± 1.041
2.83LysMet: 2.83 ± 0.547
4.69LysAsn: 4.69 ± 0.708
3.719LysPro: 3.719 ± 0.649
2.506LysGln: 2.506 ± 0.42
3.962LysArg: 3.962 ± 0.772
5.417LysSer: 5.417 ± 0.591
4.366LysThr: 4.366 ± 0.556
4.204LysVal: 4.204 ± 0.609
1.132LysTrp: 1.132 ± 0.354
3.881LysTyr: 3.881 ± 0.586
0.0LysXaa: 0.0 ± 0.0
Leu
2.749LeuAla: 2.749 ± 0.604
1.294LeuCys: 1.294 ± 0.388
6.468LeuAsp: 6.468 ± 0.864
5.094LeuGlu: 5.094 ± 0.629
3.881LeuPhe: 3.881 ± 0.759
3.8LeuGly: 3.8 ± 0.55
1.375LeuHis: 1.375 ± 0.373
6.873LeuIle: 6.873 ± 0.935
7.196LeuLys: 7.196 ± 0.896
5.66LeuLeu: 5.66 ± 0.683
2.345LeuMet: 2.345 ± 0.586
5.175LeuAsn: 5.175 ± 0.689
3.234LeuPro: 3.234 ± 0.569
2.264LeuGln: 2.264 ± 0.409
3.072LeuArg: 3.072 ± 0.581
6.792LeuSer: 6.792 ± 0.706
5.175LeuThr: 5.175 ± 0.643
3.638LeuVal: 3.638 ± 0.599
1.051LeuTrp: 1.051 ± 0.298
2.426LeuTyr: 2.426 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
1.455MetAla: 1.455 ± 0.303
0.404MetCys: 0.404 ± 0.16
1.213MetAsp: 1.213 ± 0.249
1.375MetGlu: 1.375 ± 0.339
0.889MetPhe: 0.889 ± 0.23
1.132MetGly: 1.132 ± 0.29
0.566MetHis: 0.566 ± 0.259
1.94MetIle: 1.94 ± 0.376
2.426MetLys: 2.426 ± 0.419
1.051MetLeu: 1.051 ± 0.279
0.809MetMet: 0.809 ± 0.317
2.264MetAsn: 2.264 ± 0.402
1.294MetPro: 1.294 ± 0.381
1.375MetGln: 1.375 ± 0.389
1.375MetArg: 1.375 ± 0.357
2.345MetSer: 2.345 ± 0.435
1.455MetThr: 1.455 ± 0.389
1.617MetVal: 1.617 ± 0.329
0.243MetTrp: 0.243 ± 0.134
1.375MetTyr: 1.375 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
2.992AsnAla: 2.992 ± 0.515
0.97AsnCys: 0.97 ± 0.243
3.558AsnAsp: 3.558 ± 0.562
2.749AsnGlu: 2.749 ± 0.527
2.83AsnPhe: 2.83 ± 0.48
4.609AsnGly: 4.609 ± 0.727
0.97AsnHis: 0.97 ± 0.292
4.77AsnIle: 4.77 ± 0.72
6.873AsnLys: 6.873 ± 0.86
3.962AsnLeu: 3.962 ± 0.593
1.536AsnMet: 1.536 ± 0.281
3.558AsnAsn: 3.558 ± 0.667
1.94AsnPro: 1.94 ± 0.436
1.94AsnGln: 1.94 ± 0.34
2.668AsnArg: 2.668 ± 0.374
4.043AsnSer: 4.043 ± 0.485
3.315AsnThr: 3.315 ± 0.514
2.668AsnVal: 2.668 ± 0.415
0.647AsnTrp: 0.647 ± 0.248
2.102AsnTyr: 2.102 ± 0.472
0.0AsnXaa: 0.0 ± 0.0
Pro
2.587ProAla: 2.587 ± 0.431
0.404ProCys: 0.404 ± 0.215
1.617ProAsp: 1.617 ± 0.468
2.345ProGlu: 2.345 ± 0.485
1.617ProPhe: 1.617 ± 0.391
2.506ProGly: 2.506 ± 0.391
0.566ProHis: 0.566 ± 0.207
2.587ProIle: 2.587 ± 0.415
3.8ProLys: 3.8 ± 0.548
2.345ProLeu: 2.345 ± 0.51
0.728ProMet: 0.728 ± 0.302
1.779ProAsn: 1.779 ± 0.528
1.294ProPro: 1.294 ± 0.379
1.779ProGln: 1.779 ± 0.495
0.728ProArg: 0.728 ± 0.215
2.102ProSer: 2.102 ± 0.467
1.94ProThr: 1.94 ± 0.395
1.779ProVal: 1.779 ± 0.302
0.323ProTrp: 0.323 ± 0.146
1.294ProTyr: 1.294 ± 0.236
0.0ProXaa: 0.0 ± 0.0
Gln
1.86GlnAla: 1.86 ± 0.537
0.162GlnCys: 0.162 ± 0.114
2.426GlnAsp: 2.426 ± 0.457
2.264GlnGlu: 2.264 ± 0.432
1.294GlnPhe: 1.294 ± 0.318
3.396GlnGly: 3.396 ± 1.496
0.404GlnHis: 0.404 ± 0.179
2.83GlnIle: 2.83 ± 0.366
2.021GlnLys: 2.021 ± 0.357
3.396GlnLeu: 3.396 ± 0.539
0.97GlnMet: 0.97 ± 0.247
1.617GlnAsn: 1.617 ± 0.423
0.323GlnPro: 0.323 ± 0.147
1.213GlnGln: 1.213 ± 0.392
1.779GlnArg: 1.779 ± 0.332
1.617GlnSer: 1.617 ± 0.309
2.264GlnThr: 2.264 ± 0.408
1.779GlnVal: 1.779 ± 0.395
0.647GlnTrp: 0.647 ± 0.218
1.051GlnTyr: 1.051 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
2.021ArgAla: 2.021 ± 0.45
0.647ArgCys: 0.647 ± 0.233
2.345ArgAsp: 2.345 ± 0.385
2.264ArgGlu: 2.264 ± 0.471
0.97ArgPhe: 0.97 ± 0.273
1.94ArgGly: 1.94 ± 0.436
0.485ArgHis: 0.485 ± 0.194
3.638ArgIle: 3.638 ± 0.518
4.124ArgLys: 4.124 ± 0.566
3.153ArgLeu: 3.153 ± 0.548
1.536ArgMet: 1.536 ± 0.352
1.86ArgAsn: 1.86 ± 0.322
0.728ArgPro: 0.728 ± 0.207
1.375ArgGln: 1.375 ± 0.333
1.051ArgArg: 1.051 ± 0.365
2.668ArgSer: 2.668 ± 0.516
2.102ArgThr: 2.102 ± 0.406
1.94ArgVal: 1.94 ± 0.393
0.647ArgTrp: 0.647 ± 0.205
1.86ArgTyr: 1.86 ± 0.38
0.0ArgXaa: 0.0 ± 0.0
Ser
3.558SerAla: 3.558 ± 0.472
0.97SerCys: 0.97 ± 0.259
4.528SerAsp: 4.528 ± 0.486
3.638SerGlu: 3.638 ± 0.49
3.719SerPhe: 3.719 ± 0.422
5.902SerGly: 5.902 ± 0.999
0.97SerHis: 0.97 ± 0.311
6.63SerIle: 6.63 ± 0.738
5.821SerLys: 5.821 ± 0.634
5.579SerLeu: 5.579 ± 0.646
1.86SerMet: 1.86 ± 0.389
4.609SerAsn: 4.609 ± 0.776
1.86SerPro: 1.86 ± 0.326
2.426SerGln: 2.426 ± 0.328
2.183SerArg: 2.183 ± 0.459
4.285SerSer: 4.285 ± 0.528
4.204SerThr: 4.204 ± 0.828
2.83SerVal: 2.83 ± 0.483
0.647SerTrp: 0.647 ± 0.195
2.83SerTyr: 2.83 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
3.8ThrAla: 3.8 ± 0.408
0.485ThrCys: 0.485 ± 0.196
3.234ThrAsp: 3.234 ± 0.532
3.072ThrGlu: 3.072 ± 0.431
2.345ThrPhe: 2.345 ± 0.482
5.983ThrGly: 5.983 ± 0.97
0.566ThrHis: 0.566 ± 0.183
4.609ThrIle: 4.609 ± 0.642
4.204ThrLys: 4.204 ± 0.804
3.153ThrLeu: 3.153 ± 0.49
0.889ThrMet: 0.889 ± 0.246
2.749ThrAsn: 2.749 ± 0.369
2.587ThrPro: 2.587 ± 0.493
1.536ThrGln: 1.536 ± 0.287
1.86ThrArg: 1.86 ± 0.312
3.719ThrSer: 3.719 ± 0.701
3.072ThrThr: 3.072 ± 0.711
2.83ThrVal: 2.83 ± 0.44
0.97ThrTrp: 0.97 ± 0.268
2.749ThrTyr: 2.749 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
2.992ValAla: 2.992 ± 0.546
0.323ValCys: 0.323 ± 0.157
4.285ValAsp: 4.285 ± 0.581
3.8ValGlu: 3.8 ± 0.47
2.506ValPhe: 2.506 ± 0.492
2.749ValGly: 2.749 ± 0.484
0.485ValHis: 0.485 ± 0.169
3.881ValIle: 3.881 ± 0.502
5.175ValLys: 5.175 ± 0.478
3.881ValLeu: 3.881 ± 0.718
1.213ValMet: 1.213 ± 0.286
2.992ValAsn: 2.992 ± 0.503
2.426ValPro: 2.426 ± 0.45
0.97ValGln: 0.97 ± 0.267
2.264ValArg: 2.264 ± 0.417
4.124ValSer: 4.124 ± 0.571
3.153ValThr: 3.153 ± 0.533
3.396ValVal: 3.396 ± 0.55
0.97ValTrp: 0.97 ± 0.257
2.668ValTyr: 2.668 ± 0.465
0.0ValXaa: 0.0 ± 0.0
Trp
1.132TrpAla: 1.132 ± 0.311
0.081TrpCys: 0.081 ± 0.083
0.97TrpAsp: 0.97 ± 0.215
1.051TrpGlu: 1.051 ± 0.284
0.647TrpPhe: 0.647 ± 0.201
0.647TrpGly: 0.647 ± 0.215
0.323TrpHis: 0.323 ± 0.162
1.375TrpIle: 1.375 ± 0.354
0.647TrpLys: 0.647 ± 0.226
1.051TrpLeu: 1.051 ± 0.306
0.404TrpMet: 0.404 ± 0.168
0.566TrpAsn: 0.566 ± 0.241
0.081TrpPro: 0.081 ± 0.083
0.404TrpGln: 0.404 ± 0.152
0.485TrpArg: 0.485 ± 0.175
0.809TrpSer: 0.809 ± 0.224
0.728TrpThr: 0.728 ± 0.285
1.213TrpVal: 1.213 ± 0.35
0.081TrpTrp: 0.081 ± 0.087
0.647TrpTyr: 0.647 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.587TyrAla: 2.587 ± 0.388
0.566TyrCys: 0.566 ± 0.205
1.536TyrAsp: 1.536 ± 0.357
1.94TyrGlu: 1.94 ± 0.45
2.668TyrPhe: 2.668 ± 0.537
2.183TyrGly: 2.183 ± 0.349
0.809TyrHis: 0.809 ± 0.25
3.558TyrIle: 3.558 ± 0.56
3.962TyrLys: 3.962 ± 0.767
3.396TyrLeu: 3.396 ± 0.618
0.809TyrMet: 0.809 ± 0.25
2.992TyrAsn: 2.992 ± 0.392
1.455TyrPro: 1.455 ± 0.261
1.294TyrGln: 1.294 ± 0.395
1.132TyrArg: 1.132 ± 0.274
2.587TyrSer: 2.587 ± 0.424
1.617TyrThr: 1.617 ± 0.283
2.83TyrVal: 2.83 ± 0.462
0.647TyrTrp: 0.647 ± 0.322
2.345TyrTyr: 2.345 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski