Amino acid dipepetide frequency for Ruegeria phage DSS3-P1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.668AlaAla: 17.668 ± 1.96
0.613AlaCys: 0.613 ± 0.203
6.744AlaAsp: 6.744 ± 0.72
8.75AlaGlu: 8.75 ± 0.757
3.456AlaPhe: 3.456 ± 0.519
8.583AlaGly: 8.583 ± 1.123
2.285AlaHis: 2.285 ± 0.419
5.016AlaIle: 5.016 ± 0.52
5.406AlaLys: 5.406 ± 0.781
9.364AlaLeu: 9.364 ± 0.788
2.508AlaMet: 2.508 ± 0.329
3.623AlaAsn: 3.623 ± 0.511
4.57AlaPro: 4.57 ± 0.562
3.846AlaGln: 3.846 ± 0.696
8.862AlaArg: 8.862 ± 0.853
5.351AlaSer: 5.351 ± 0.665
6.075AlaThr: 6.075 ± 0.551
9.141AlaVal: 9.141 ± 0.604
1.449AlaTrp: 1.449 ± 0.256
3.679AlaTyr: 3.679 ± 0.404
0.0AlaXaa: 0.0 ± 0.0
Cys
0.892CysAla: 0.892 ± 0.257
0.111CysCys: 0.111 ± 0.085
0.334CysAsp: 0.334 ± 0.131
0.39CysGlu: 0.39 ± 0.164
0.223CysPhe: 0.223 ± 0.116
0.892CysGly: 0.892 ± 0.274
0.557CysHis: 0.557 ± 0.198
0.223CysIle: 0.223 ± 0.116
0.279CysLys: 0.279 ± 0.127
0.892CysLeu: 0.892 ± 0.252
0.111CysMet: 0.111 ± 0.077
0.167CysAsn: 0.167 ± 0.109
0.892CysPro: 0.892 ± 0.234
0.279CysGln: 0.279 ± 0.119
0.613CysArg: 0.613 ± 0.213
0.334CysSer: 0.334 ± 0.134
0.446CysThr: 0.446 ± 0.162
0.39CysVal: 0.39 ± 0.124
0.111CysTrp: 0.111 ± 0.073
0.167CysTyr: 0.167 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
7.803AspAla: 7.803 ± 0.895
0.557AspCys: 0.557 ± 0.175
5.462AspAsp: 5.462 ± 1.25
6.521AspGlu: 6.521 ± 0.993
2.174AspPhe: 2.174 ± 0.317
5.295AspGly: 5.295 ± 0.525
1.226AspHis: 1.226 ± 0.264
3.065AspIle: 3.065 ± 0.491
1.784AspLys: 1.784 ± 0.26
6.911AspLeu: 6.911 ± 0.536
1.505AspMet: 1.505 ± 0.282
1.561AspAsn: 1.561 ± 0.263
4.626AspPro: 4.626 ± 0.688
1.616AspGln: 1.616 ± 0.297
4.18AspArg: 4.18 ± 0.536
2.731AspSer: 2.731 ± 0.412
3.177AspThr: 3.177 ± 0.427
3.288AspVal: 3.288 ± 0.37
1.393AspTrp: 1.393 ± 0.301
2.174AspTyr: 2.174 ± 0.37
0.0AspXaa: 0.0 ± 0.0
Glu
8.527GluAla: 8.527 ± 0.804
0.557GluCys: 0.557 ± 0.178
4.292GluAsp: 4.292 ± 0.619
4.403GluGlu: 4.403 ± 0.52
3.177GluPhe: 3.177 ± 0.388
4.57GluGly: 4.57 ± 0.496
0.836GluHis: 0.836 ± 0.265
4.96GluIle: 4.96 ± 0.566
2.954GluLys: 2.954 ± 0.467
5.072GluLeu: 5.072 ± 0.617
2.174GluMet: 2.174 ± 0.341
2.842GluAsn: 2.842 ± 0.444
2.731GluPro: 2.731 ± 0.431
2.452GluGln: 2.452 ± 0.47
4.96GluArg: 4.96 ± 0.521
4.013GluSer: 4.013 ± 0.488
6.019GluThr: 6.019 ± 0.673
5.295GluVal: 5.295 ± 0.525
1.338GluTrp: 1.338 ± 0.208
1.616GluTyr: 1.616 ± 0.275
0.0GluXaa: 0.0 ± 0.0
Phe
3.288PheAla: 3.288 ± 0.353
0.613PheCys: 0.613 ± 0.225
3.177PheAsp: 3.177 ± 0.432
2.174PheGlu: 2.174 ± 0.423
1.003PhePhe: 1.003 ± 0.251
2.731PheGly: 2.731 ± 0.441
0.836PheHis: 0.836 ± 0.212
1.561PheIle: 1.561 ± 0.251
1.282PheLys: 1.282 ± 0.228
2.452PheLeu: 2.452 ± 0.369
0.836PheMet: 0.836 ± 0.224
1.282PheAsn: 1.282 ± 0.286
1.616PhePro: 1.616 ± 0.277
1.17PheGln: 1.17 ± 0.265
2.675PheArg: 2.675 ± 0.29
1.784PheSer: 1.784 ± 0.353
2.285PheThr: 2.285 ± 0.386
1.505PheVal: 1.505 ± 0.278
1.115PheTrp: 1.115 ± 0.243
1.003PheTyr: 1.003 ± 0.243
0.0PheXaa: 0.0 ± 0.0
Gly
7.747GlyAla: 7.747 ± 0.661
1.059GlyCys: 1.059 ± 0.233
4.905GlyAsp: 4.905 ± 0.515
6.019GlyGlu: 6.019 ± 0.466
3.511GlyPhe: 3.511 ± 0.49
8.36GlyGly: 8.36 ± 1.04
2.229GlyHis: 2.229 ± 0.343
3.456GlyIle: 3.456 ± 0.598
4.013GlyLys: 4.013 ± 0.541
7.301GlyLeu: 7.301 ± 0.713
2.006GlyMet: 2.006 ± 0.378
2.118GlyAsn: 2.118 ± 0.316
2.285GlyPro: 2.285 ± 0.432
2.898GlyGln: 2.898 ± 0.589
5.629GlyArg: 5.629 ± 0.628
4.459GlySer: 4.459 ± 0.538
4.626GlyThr: 4.626 ± 0.551
4.96GlyVal: 4.96 ± 0.567
1.393GlyTrp: 1.393 ± 0.272
2.229GlyTyr: 2.229 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
1.226HisAla: 1.226 ± 0.266
0.334HisCys: 0.334 ± 0.133
1.561HisAsp: 1.561 ± 0.349
1.338HisGlu: 1.338 ± 0.282
0.334HisPhe: 0.334 ± 0.12
1.951HisGly: 1.951 ± 0.286
0.78HisHis: 0.78 ± 0.191
1.226HisIle: 1.226 ± 0.268
0.836HisLys: 0.836 ± 0.214
1.951HisLeu: 1.951 ± 0.298
0.446HisMet: 0.446 ± 0.158
0.836HisAsn: 0.836 ± 0.196
1.226HisPro: 1.226 ± 0.273
0.725HisGln: 0.725 ± 0.209
1.505HisArg: 1.505 ± 0.256
0.557HisSer: 0.557 ± 0.178
0.836HisThr: 0.836 ± 0.226
1.003HisVal: 1.003 ± 0.258
0.502HisTrp: 0.502 ± 0.183
0.669HisTyr: 0.669 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.406IleAla: 5.406 ± 0.609
0.446IleCys: 0.446 ± 0.167
3.734IleAsp: 3.734 ± 0.398
4.905IleGlu: 4.905 ± 0.477
1.282IlePhe: 1.282 ± 0.242
4.013IleGly: 4.013 ± 0.471
0.725IleHis: 0.725 ± 0.203
1.338IleIle: 1.338 ± 0.312
1.616IleLys: 1.616 ± 0.366
2.898IleLeu: 2.898 ± 0.398
0.836IleMet: 0.836 ± 0.268
1.616IleAsn: 1.616 ± 0.25
2.452IlePro: 2.452 ± 0.322
1.895IleGln: 1.895 ± 0.314
2.62IleArg: 2.62 ± 0.394
1.784IleSer: 1.784 ± 0.332
2.898IleThr: 2.898 ± 0.401
2.397IleVal: 2.397 ± 0.327
0.613IleTrp: 0.613 ± 0.222
0.892IleTyr: 0.892 ± 0.213
0.0IleXaa: 0.0 ± 0.0
Lys
5.741LysAla: 5.741 ± 0.58
0.111LysCys: 0.111 ± 0.074
2.62LysAsp: 2.62 ± 0.506
2.341LysGlu: 2.341 ± 0.395
1.17LysPhe: 1.17 ± 0.323
2.787LysGly: 2.787 ± 0.449
0.725LysHis: 0.725 ± 0.189
1.728LysIle: 1.728 ± 0.298
2.118LysLys: 2.118 ± 0.578
3.734LysLeu: 3.734 ± 0.505
1.282LysMet: 1.282 ± 0.273
1.282LysAsn: 1.282 ± 0.275
2.954LysPro: 2.954 ± 0.487
1.561LysGln: 1.561 ± 0.262
2.62LysArg: 2.62 ± 0.426
2.174LysSer: 2.174 ± 0.346
1.839LysThr: 1.839 ± 0.397
2.006LysVal: 2.006 ± 0.35
0.502LysTrp: 0.502 ± 0.154
0.725LysTyr: 0.725 ± 0.184
0.0LysXaa: 0.0 ± 0.0
Leu
11.481LeuAla: 11.481 ± 0.705
0.892LeuCys: 0.892 ± 0.24
6.187LeuAsp: 6.187 ± 0.501
5.685LeuGlu: 5.685 ± 0.515
2.006LeuPhe: 2.006 ± 0.402
7.58LeuGly: 7.58 ± 0.604
1.338LeuHis: 1.338 ± 0.288
3.511LeuIle: 3.511 ± 0.52
3.623LeuLys: 3.623 ± 0.464
7.524LeuLeu: 7.524 ± 0.958
1.226LeuMet: 1.226 ± 0.255
3.344LeuAsn: 3.344 ± 0.414
5.183LeuPro: 5.183 ± 0.605
2.675LeuGln: 2.675 ± 0.413
5.908LeuArg: 5.908 ± 0.656
4.96LeuSer: 4.96 ± 0.464
6.521LeuThr: 6.521 ± 0.714
5.072LeuVal: 5.072 ± 0.54
1.226LeuTrp: 1.226 ± 0.315
1.951LeuTyr: 1.951 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
2.229MetAla: 2.229 ± 0.334
0.334MetCys: 0.334 ± 0.144
1.003MetAsp: 1.003 ± 0.19
0.892MetGlu: 0.892 ± 0.176
0.892MetPhe: 0.892 ± 0.224
1.672MetGly: 1.672 ± 0.291
0.446MetHis: 0.446 ± 0.154
1.17MetIle: 1.17 ± 0.273
1.115MetLys: 1.115 ± 0.294
2.062MetLeu: 2.062 ± 0.309
0.613MetMet: 0.613 ± 0.19
0.947MetAsn: 0.947 ± 0.214
1.784MetPro: 1.784 ± 0.293
0.78MetGln: 0.78 ± 0.179
1.282MetArg: 1.282 ± 0.288
1.672MetSer: 1.672 ± 0.305
1.784MetThr: 1.784 ± 0.252
1.449MetVal: 1.449 ± 0.349
0.223MetTrp: 0.223 ± 0.106
0.167MetTyr: 0.167 ± 0.099
0.0MetXaa: 0.0 ± 0.0
Asn
3.957AsnAla: 3.957 ± 0.585
0.167AsnCys: 0.167 ± 0.101
1.728AsnAsp: 1.728 ± 0.256
1.728AsnGlu: 1.728 ± 0.308
0.613AsnPhe: 0.613 ± 0.156
3.4AsnGly: 3.4 ± 0.425
0.669AsnHis: 0.669 ± 0.169
0.947AsnIle: 0.947 ± 0.243
1.17AsnLys: 1.17 ± 0.253
3.177AsnLeu: 3.177 ± 0.466
0.725AsnMet: 0.725 ± 0.204
0.947AsnAsn: 0.947 ± 0.194
2.285AsnPro: 2.285 ± 0.419
1.17AsnGln: 1.17 ± 0.25
2.675AsnArg: 2.675 ± 0.379
1.393AsnSer: 1.393 ± 0.272
1.449AsnThr: 1.449 ± 0.262
1.561AsnVal: 1.561 ± 0.31
0.836AsnTrp: 0.836 ± 0.231
0.78AsnTyr: 0.78 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
5.518ProAla: 5.518 ± 0.756
0.613ProCys: 0.613 ± 0.195
3.734ProAsp: 3.734 ± 0.507
4.347ProGlu: 4.347 ± 0.578
2.229ProPhe: 2.229 ± 0.409
5.295ProGly: 5.295 ± 0.561
1.17ProHis: 1.17 ± 0.331
1.561ProIle: 1.561 ± 0.31
1.728ProLys: 1.728 ± 0.314
3.623ProLeu: 3.623 ± 0.489
1.003ProMet: 1.003 ± 0.218
1.839ProAsn: 1.839 ± 0.328
2.787ProPro: 2.787 ± 0.49
0.892ProGln: 0.892 ± 0.191
3.01ProArg: 3.01 ± 0.466
2.397ProSer: 2.397 ± 0.373
2.731ProThr: 2.731 ± 0.435
4.236ProVal: 4.236 ± 0.515
0.502ProTrp: 0.502 ± 0.204
1.059ProTyr: 1.059 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
4.682GlnAla: 4.682 ± 0.761
0.167GlnCys: 0.167 ± 0.087
1.561GlnAsp: 1.561 ± 0.294
2.397GlnGlu: 2.397 ± 0.404
1.449GlnPhe: 1.449 ± 0.23
2.006GlnGly: 2.006 ± 0.311
0.446GlnHis: 0.446 ± 0.15
1.784GlnIle: 1.784 ± 0.343
1.338GlnLys: 1.338 ± 0.3
2.118GlnLeu: 2.118 ± 0.462
1.226GlnMet: 1.226 ± 0.279
1.059GlnAsn: 1.059 ± 0.23
1.449GlnPro: 1.449 ± 0.305
1.059GlnGln: 1.059 ± 0.291
2.954GlnArg: 2.954 ± 0.404
1.449GlnSer: 1.449 ± 0.244
2.285GlnThr: 2.285 ± 0.34
2.787GlnVal: 2.787 ± 0.436
0.334GlnTrp: 0.334 ± 0.122
1.059GlnTyr: 1.059 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
7.246ArgAla: 7.246 ± 1.02
0.446ArgCys: 0.446 ± 0.146
4.403ArgAsp: 4.403 ± 0.479
5.462ArgGlu: 5.462 ± 0.587
3.288ArgPhe: 3.288 ± 0.489
4.849ArgGly: 4.849 ± 0.505
1.449ArgHis: 1.449 ± 0.36
3.01ArgIle: 3.01 ± 0.388
3.121ArgLys: 3.121 ± 0.579
7.747ArgLeu: 7.747 ± 0.572
1.616ArgMet: 1.616 ± 0.282
1.561ArgAsn: 1.561 ± 0.233
3.456ArgPro: 3.456 ± 0.537
2.842ArgGln: 2.842 ± 0.484
5.295ArgArg: 5.295 ± 0.587
3.065ArgSer: 3.065 ± 0.446
3.567ArgThr: 3.567 ± 0.526
6.075ArgVal: 6.075 ± 0.65
1.449ArgTrp: 1.449 ± 0.319
1.616ArgTyr: 1.616 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
5.072SerAla: 5.072 ± 0.761
0.334SerCys: 0.334 ± 0.133
3.567SerAsp: 3.567 ± 0.462
3.734SerGlu: 3.734 ± 0.415
2.341SerPhe: 2.341 ± 0.388
4.626SerGly: 4.626 ± 0.543
0.725SerHis: 0.725 ± 0.242
2.675SerIle: 2.675 ± 0.334
1.616SerLys: 1.616 ± 0.296
4.849SerLeu: 4.849 ± 0.41
0.836SerMet: 0.836 ± 0.184
1.616SerAsn: 1.616 ± 0.282
1.895SerPro: 1.895 ± 0.348
1.226SerGln: 1.226 ± 0.277
3.901SerArg: 3.901 ± 0.478
2.229SerSer: 2.229 ± 0.353
2.898SerThr: 2.898 ± 0.395
3.344SerVal: 3.344 ± 0.35
0.613SerTrp: 0.613 ± 0.177
0.947SerTyr: 0.947 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
5.685ThrAla: 5.685 ± 0.525
0.223ThrCys: 0.223 ± 0.092
4.236ThrAsp: 4.236 ± 0.359
3.957ThrGlu: 3.957 ± 0.382
2.118ThrPhe: 2.118 ± 0.273
5.128ThrGly: 5.128 ± 0.584
1.059ThrHis: 1.059 ± 0.267
3.623ThrIle: 3.623 ± 0.466
2.062ThrLys: 2.062 ± 0.42
5.908ThrLeu: 5.908 ± 0.595
0.947ThrMet: 0.947 ± 0.243
1.282ThrAsn: 1.282 ± 0.238
3.233ThrPro: 3.233 ± 0.54
2.452ThrGln: 2.452 ± 0.39
3.79ThrArg: 3.79 ± 0.52
2.341ThrSer: 2.341 ± 0.478
2.842ThrThr: 2.842 ± 0.454
5.406ThrVal: 5.406 ± 0.784
0.947ThrTrp: 0.947 ± 0.231
1.784ThrTyr: 1.784 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
8.249ValAla: 8.249 ± 0.692
0.557ValCys: 0.557 ± 0.19
4.18ValAsp: 4.18 ± 0.442
5.462ValGlu: 5.462 ± 0.618
2.062ValPhe: 2.062 ± 0.318
4.069ValGly: 4.069 ± 0.49
1.059ValHis: 1.059 ± 0.266
1.728ValIle: 1.728 ± 0.309
2.675ValLys: 2.675 ± 0.45
5.741ValLeu: 5.741 ± 0.547
1.449ValMet: 1.449 ± 0.297
2.118ValAsn: 2.118 ± 0.408
3.01ValPro: 3.01 ± 0.446
2.62ValGln: 2.62 ± 0.293
5.685ValArg: 5.685 ± 0.491
3.901ValSer: 3.901 ± 0.465
4.18ValThr: 4.18 ± 0.541
4.905ValVal: 4.905 ± 0.509
1.226ValTrp: 1.226 ± 0.263
2.285ValTyr: 2.285 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.338TrpAla: 1.338 ± 0.242
0.056TrpCys: 0.056 ± 0.055
1.449TrpAsp: 1.449 ± 0.298
0.669TrpGlu: 0.669 ± 0.236
0.669TrpPhe: 0.669 ± 0.23
1.338TrpGly: 1.338 ± 0.28
0.446TrpHis: 0.446 ± 0.169
0.502TrpIle: 0.502 ± 0.165
0.446TrpLys: 0.446 ± 0.162
2.006TrpLeu: 2.006 ± 0.361
0.613TrpMet: 0.613 ± 0.149
0.502TrpAsn: 0.502 ± 0.184
0.78TrpPro: 0.78 ± 0.261
0.502TrpGln: 0.502 ± 0.182
1.226TrpArg: 1.226 ± 0.278
1.003TrpSer: 1.003 ± 0.243
1.115TrpThr: 1.115 ± 0.262
1.003TrpVal: 1.003 ± 0.244
0.557TrpTrp: 0.557 ± 0.26
0.502TrpTyr: 0.502 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.288TyrAla: 3.288 ± 0.5
0.111TyrCys: 0.111 ± 0.071
2.174TyrAsp: 2.174 ± 0.389
1.616TyrGlu: 1.616 ± 0.287
0.39TyrPhe: 0.39 ± 0.133
2.062TyrGly: 2.062 ± 0.42
0.836TyrHis: 0.836 ± 0.235
1.282TyrIle: 1.282 ± 0.25
0.836TyrLys: 0.836 ± 0.255
2.564TyrLeu: 2.564 ± 0.482
0.502TyrMet: 0.502 ± 0.154
0.836TyrAsn: 0.836 ± 0.222
1.059TyrPro: 1.059 ± 0.258
0.947TyrGln: 0.947 ± 0.228
2.174TyrArg: 2.174 ± 0.405
1.449TyrSer: 1.449 ± 0.246
1.449TyrThr: 1.449 ± 0.281
1.338TyrVal: 1.338 ± 0.217
0.39TyrTrp: 0.39 ± 0.153
0.557TyrTyr: 0.557 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (17943 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski