Amino acid dipepetide frequency for Gordonia phage Confidence

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.508AlaAla: 15.508 ± 1.743
1.055AlaCys: 1.055 ± 0.252
7.754AlaAsp: 7.754 ± 0.632
8.374AlaGlu: 8.374 ± 0.712
2.915AlaPhe: 2.915 ± 0.739
10.111AlaGly: 10.111 ± 0.98
2.481AlaHis: 2.481 ± 0.467
5.025AlaIle: 5.025 ± 0.651
4.776AlaLys: 4.776 ± 0.475
8.002AlaLeu: 8.002 ± 0.873
3.35AlaMet: 3.35 ± 0.798
3.164AlaAsn: 3.164 ± 0.706
5.087AlaPro: 5.087 ± 0.639
4.218AlaGln: 4.218 ± 0.89
7.568AlaArg: 7.568 ± 0.704
6.389AlaSer: 6.389 ± 0.832
7.568AlaThr: 7.568 ± 0.655
7.258AlaVal: 7.258 ± 0.579
2.233AlaTrp: 2.233 ± 0.277
2.357AlaTyr: 2.357 ± 0.341
0.0AlaXaa: 0.0 ± 0.0
Cys
0.868CysAla: 0.868 ± 0.31
0.124CysCys: 0.124 ± 0.085
0.806CysAsp: 0.806 ± 0.259
0.62CysGlu: 0.62 ± 0.246
0.124CysPhe: 0.124 ± 0.08
1.365CysGly: 1.365 ± 0.359
0.248CysHis: 0.248 ± 0.118
0.496CysIle: 0.496 ± 0.156
0.186CysLys: 0.186 ± 0.107
0.31CysLeu: 0.31 ± 0.126
0.31CysMet: 0.31 ± 0.15
0.372CysAsn: 0.372 ± 0.14
0.682CysPro: 0.682 ± 0.24
0.248CysGln: 0.248 ± 0.146
0.806CysArg: 0.806 ± 0.296
0.62CysSer: 0.62 ± 0.196
0.62CysThr: 0.62 ± 0.194
0.372CysVal: 0.372 ± 0.167
0.124CysTrp: 0.124 ± 0.086
0.558CysTyr: 0.558 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
7.754AspAla: 7.754 ± 0.735
0.744AspCys: 0.744 ± 0.22
5.335AspAsp: 5.335 ± 0.689
3.97AspGlu: 3.97 ± 0.625
2.233AspPhe: 2.233 ± 0.317
7.258AspGly: 7.258 ± 0.787
1.551AspHis: 1.551 ± 0.314
2.605AspIle: 2.605 ± 0.352
1.613AspLys: 1.613 ± 0.249
7.382AspLeu: 7.382 ± 0.718
1.303AspMet: 1.303 ± 0.286
1.613AspAsn: 1.613 ± 0.251
5.211AspPro: 5.211 ± 0.58
3.04AspGln: 3.04 ± 0.458
5.955AspArg: 5.955 ± 0.825
2.791AspSer: 2.791 ± 0.403
3.164AspThr: 3.164 ± 0.462
3.846AspVal: 3.846 ± 0.461
1.613AspTrp: 1.613 ± 0.403
1.551AspTyr: 1.551 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
6.451GluAla: 6.451 ± 0.701
0.496GluCys: 0.496 ± 0.188
3.35GluAsp: 3.35 ± 0.465
2.853GluGlu: 2.853 ± 0.5
2.481GluPhe: 2.481 ± 0.451
2.977GluGly: 2.977 ± 0.429
1.427GluHis: 1.427 ± 0.322
3.474GluIle: 3.474 ± 0.584
1.799GluLys: 1.799 ± 0.359
5.645GluLeu: 5.645 ± 0.69
1.055GluMet: 1.055 ± 0.237
1.489GluAsn: 1.489 ± 0.411
3.226GluPro: 3.226 ± 0.539
2.357GluGln: 2.357 ± 0.347
4.9GluArg: 4.9 ± 0.582
3.04GluSer: 3.04 ± 0.5
3.35GluThr: 3.35 ± 0.439
4.652GluVal: 4.652 ± 0.671
1.489GluTrp: 1.489 ± 0.397
1.799GluTyr: 1.799 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
3.102PheAla: 3.102 ± 0.503
0.186PheCys: 0.186 ± 0.108
2.481PheAsp: 2.481 ± 0.391
1.923PheGlu: 1.923 ± 0.4
0.372PhePhe: 0.372 ± 0.157
2.915PheGly: 2.915 ± 0.603
0.31PheHis: 0.31 ± 0.131
1.551PheIle: 1.551 ± 0.234
0.62PheLys: 0.62 ± 0.189
2.171PheLeu: 2.171 ± 0.341
0.31PheMet: 0.31 ± 0.154
0.434PheAsn: 0.434 ± 0.127
1.489PhePro: 1.489 ± 0.312
0.868PheGln: 0.868 ± 0.281
1.985PheArg: 1.985 ± 0.3
1.799PheSer: 1.799 ± 0.357
2.543PheThr: 2.543 ± 0.464
2.295PheVal: 2.295 ± 0.368
0.31PheTrp: 0.31 ± 0.14
0.62PheTyr: 0.62 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
7.816GlyAla: 7.816 ± 0.936
0.744GlyCys: 0.744 ± 0.235
5.893GlyAsp: 5.893 ± 0.709
4.9GlyGlu: 4.9 ± 0.43
2.791GlyPhe: 2.791 ± 0.489
9.987GlyGly: 9.987 ± 1.546
2.295GlyHis: 2.295 ± 0.452
3.784GlyIle: 3.784 ± 0.504
3.226GlyLys: 3.226 ± 0.44
5.893GlyLeu: 5.893 ± 0.621
1.117GlyMet: 1.117 ± 0.282
2.667GlyAsn: 2.667 ± 0.42
3.722GlyPro: 3.722 ± 0.463
3.97GlyGln: 3.97 ± 0.634
7.009GlyArg: 7.009 ± 0.713
4.776GlySer: 4.776 ± 0.781
5.893GlyThr: 5.893 ± 0.824
6.265GlyVal: 6.265 ± 0.734
1.799GlyTrp: 1.799 ± 0.315
1.861GlyTyr: 1.861 ± 0.331
0.0GlyXaa: 0.0 ± 0.0
His
2.109HisAla: 2.109 ± 0.367
0.434HisCys: 0.434 ± 0.188
1.365HisAsp: 1.365 ± 0.381
1.117HisGlu: 1.117 ± 0.276
0.248HisPhe: 0.248 ± 0.152
1.985HisGly: 1.985 ± 0.493
0.992HisHis: 0.992 ± 0.27
0.744HisIle: 0.744 ± 0.259
0.31HisLys: 0.31 ± 0.122
1.551HisLeu: 1.551 ± 0.335
0.248HisMet: 0.248 ± 0.123
0.806HisAsn: 0.806 ± 0.216
1.365HisPro: 1.365 ± 0.378
0.558HisGln: 0.558 ± 0.195
1.365HisArg: 1.365 ± 0.279
0.496HisSer: 0.496 ± 0.233
1.675HisThr: 1.675 ± 0.281
1.303HisVal: 1.303 ± 0.26
0.186HisTrp: 0.186 ± 0.096
0.806HisTyr: 0.806 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
5.955IleAla: 5.955 ± 0.681
0.372IleCys: 0.372 ± 0.149
4.59IleAsp: 4.59 ± 0.668
3.66IleGlu: 3.66 ± 0.503
0.806IlePhe: 0.806 ± 0.238
3.784IleGly: 3.784 ± 0.483
0.62IleHis: 0.62 ± 0.233
1.241IleIle: 1.241 ± 0.315
1.241IleLys: 1.241 ± 0.383
3.102IleLeu: 3.102 ± 0.442
0.372IleMet: 0.372 ± 0.16
1.179IleAsn: 1.179 ± 0.259
2.977IlePro: 2.977 ± 0.51
2.047IleGln: 2.047 ± 0.375
3.97IleArg: 3.97 ± 0.425
1.923IleSer: 1.923 ± 0.357
3.97IleThr: 3.97 ± 0.568
3.35IleVal: 3.35 ± 0.538
0.558IleTrp: 0.558 ± 0.202
0.806IleTyr: 0.806 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
3.412LysAla: 3.412 ± 0.465
0.31LysCys: 0.31 ± 0.146
1.861LysAsp: 1.861 ± 0.334
1.117LysGlu: 1.117 ± 0.232
1.055LysPhe: 1.055 ± 0.266
2.109LysGly: 2.109 ± 0.42
0.558LysHis: 0.558 ± 0.224
1.551LysIle: 1.551 ± 0.345
1.365LysLys: 1.365 ± 0.365
2.977LysLeu: 2.977 ± 0.464
0.372LysMet: 0.372 ± 0.15
1.241LysAsn: 1.241 ± 0.28
2.047LysPro: 2.047 ± 0.483
0.682LysGln: 0.682 ± 0.257
3.226LysArg: 3.226 ± 0.331
2.047LysSer: 2.047 ± 0.371
2.791LysThr: 2.791 ± 0.495
2.419LysVal: 2.419 ± 0.41
0.62LysTrp: 0.62 ± 0.225
0.868LysTyr: 0.868 ± 0.182
0.0LysXaa: 0.0 ± 0.0
Leu
10.111LeuAla: 10.111 ± 0.974
0.744LeuCys: 0.744 ± 0.23
5.769LeuAsp: 5.769 ± 0.744
4.342LeuGlu: 4.342 ± 0.478
2.791LeuPhe: 2.791 ± 0.494
6.141LeuGly: 6.141 ± 0.866
0.992LeuHis: 0.992 ± 0.245
3.598LeuIle: 3.598 ± 0.577
2.853LeuLys: 2.853 ± 0.538
4.342LeuLeu: 4.342 ± 0.474
1.675LeuMet: 1.675 ± 0.331
2.729LeuAsn: 2.729 ± 0.429
4.9LeuPro: 4.9 ± 0.615
3.04LeuGln: 3.04 ± 0.404
5.335LeuArg: 5.335 ± 0.829
4.59LeuSer: 4.59 ± 0.618
6.141LeuThr: 6.141 ± 0.67
4.28LeuVal: 4.28 ± 0.586
1.613LeuTrp: 1.613 ± 0.374
1.303LeuTyr: 1.303 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.543MetAla: 2.543 ± 0.418
0.31MetCys: 0.31 ± 0.145
0.558MetAsp: 0.558 ± 0.214
1.055MetGlu: 1.055 ± 0.25
0.558MetPhe: 0.558 ± 0.176
1.117MetGly: 1.117 ± 0.328
0.248MetHis: 0.248 ± 0.137
0.868MetIle: 0.868 ± 0.253
0.682MetLys: 0.682 ± 0.176
1.737MetLeu: 1.737 ± 0.309
0.248MetMet: 0.248 ± 0.135
0.558MetAsn: 0.558 ± 0.169
1.117MetPro: 1.117 ± 0.242
0.806MetGln: 0.806 ± 0.437
1.799MetArg: 1.799 ± 0.307
1.613MetSer: 1.613 ± 0.36
2.977MetThr: 2.977 ± 0.392
1.055MetVal: 1.055 ± 0.215
0.62MetTrp: 0.62 ± 0.252
0.248MetTyr: 0.248 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
4.032AsnAla: 4.032 ± 0.81
0.248AsnCys: 0.248 ± 0.1
1.613AsnAsp: 1.613 ± 0.292
1.365AsnGlu: 1.365 ± 0.304
0.496AsnPhe: 0.496 ± 0.211
3.536AsnGly: 3.536 ± 0.545
0.62AsnHis: 0.62 ± 0.164
0.93AsnIle: 0.93 ± 0.276
0.806AsnLys: 0.806 ± 0.197
2.419AsnLeu: 2.419 ± 0.502
0.558AsnMet: 0.558 ± 0.163
0.806AsnAsn: 0.806 ± 0.254
1.923AsnPro: 1.923 ± 0.402
1.365AsnGln: 1.365 ± 0.3
2.109AsnArg: 2.109 ± 0.342
1.675AsnSer: 1.675 ± 0.333
2.109AsnThr: 2.109 ± 0.477
1.985AsnVal: 1.985 ± 0.357
0.62AsnTrp: 0.62 ± 0.207
0.806AsnTyr: 0.806 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
5.583ProAla: 5.583 ± 0.659
0.682ProCys: 0.682 ± 0.263
5.459ProAsp: 5.459 ± 0.847
4.342ProGlu: 4.342 ± 0.496
0.806ProPhe: 0.806 ± 0.182
5.025ProGly: 5.025 ± 0.529
1.303ProHis: 1.303 ± 0.289
2.853ProIle: 2.853 ± 0.53
1.427ProLys: 1.427 ± 0.274
3.536ProLeu: 3.536 ± 0.429
0.992ProMet: 0.992 ± 0.248
1.923ProAsn: 1.923 ± 0.376
2.915ProPro: 2.915 ± 0.563
1.985ProGln: 1.985 ± 0.302
3.35ProArg: 3.35 ± 0.56
3.288ProSer: 3.288 ± 0.471
3.846ProThr: 3.846 ± 0.579
4.094ProVal: 4.094 ± 0.512
1.613ProTrp: 1.613 ± 0.361
1.179ProTyr: 1.179 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
3.908GlnAla: 3.908 ± 0.804
0.31GlnCys: 0.31 ± 0.144
1.985GlnAsp: 1.985 ± 0.258
1.427GlnGlu: 1.427 ± 0.306
1.737GlnPhe: 1.737 ± 0.358
2.667GlnGly: 2.667 ± 0.698
0.806GlnHis: 0.806 ± 0.256
2.543GlnIle: 2.543 ± 0.437
1.613GlnLys: 1.613 ± 0.383
3.412GlnLeu: 3.412 ± 0.671
0.93GlnMet: 0.93 ± 0.221
1.489GlnAsn: 1.489 ± 0.396
1.675GlnPro: 1.675 ± 0.267
2.543GlnGln: 2.543 ± 0.785
2.543GlnArg: 2.543 ± 0.476
1.923GlnSer: 1.923 ± 0.448
1.985GlnThr: 1.985 ± 0.402
2.853GlnVal: 2.853 ± 0.485
0.992GlnTrp: 0.992 ± 0.312
0.744GlnTyr: 0.744 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
7.196ArgAla: 7.196 ± 0.759
0.682ArgCys: 0.682 ± 0.239
4.156ArgAsp: 4.156 ± 0.473
3.784ArgGlu: 3.784 ± 0.469
2.543ArgPhe: 2.543 ± 0.371
5.087ArgGly: 5.087 ± 0.544
1.427ArgHis: 1.427 ± 0.297
4.156ArgIle: 4.156 ± 0.39
3.35ArgLys: 3.35 ± 0.475
5.955ArgLeu: 5.955 ± 0.548
2.977ArgMet: 2.977 ± 0.37
2.605ArgAsn: 2.605 ± 0.421
4.342ArgPro: 4.342 ± 0.589
2.977ArgGln: 2.977 ± 0.465
5.955ArgArg: 5.955 ± 0.672
3.908ArgSer: 3.908 ± 0.376
4.218ArgThr: 4.218 ± 0.618
5.087ArgVal: 5.087 ± 0.687
2.419ArgTrp: 2.419 ± 0.418
2.047ArgTyr: 2.047 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
7.258SerAla: 7.258 ± 1.128
0.186SerCys: 0.186 ± 0.097
3.908SerAsp: 3.908 ± 0.407
3.102SerGlu: 3.102 ± 0.413
1.799SerPhe: 1.799 ± 0.344
6.141SerGly: 6.141 ± 0.732
0.868SerHis: 0.868 ± 0.261
2.543SerIle: 2.543 ± 0.295
1.613SerLys: 1.613 ± 0.395
3.536SerLeu: 3.536 ± 0.4
1.241SerMet: 1.241 ± 0.265
1.427SerAsn: 1.427 ± 0.299
2.605SerPro: 2.605 ± 0.381
2.295SerGln: 2.295 ± 0.427
3.412SerArg: 3.412 ± 0.563
3.288SerSer: 3.288 ± 0.446
3.288SerThr: 3.288 ± 0.4
3.412SerVal: 3.412 ± 0.347
1.241SerTrp: 1.241 ± 0.231
1.055SerTyr: 1.055 ± 0.206
0.0SerXaa: 0.0 ± 0.0
Thr
9.181ThrAla: 9.181 ± 0.98
0.682ThrCys: 0.682 ± 0.224
5.149ThrAsp: 5.149 ± 0.626
3.908ThrGlu: 3.908 ± 0.528
1.799ThrPhe: 1.799 ± 0.453
6.017ThrGly: 6.017 ± 0.601
0.868ThrHis: 0.868 ± 0.222
3.908ThrIle: 3.908 ± 0.484
1.985ThrLys: 1.985 ± 0.383
6.451ThrLeu: 6.451 ± 0.554
1.055ThrMet: 1.055 ± 0.26
1.923ThrAsn: 1.923 ± 0.301
4.156ThrPro: 4.156 ± 0.652
1.675ThrGln: 1.675 ± 0.316
4.776ThrArg: 4.776 ± 0.604
3.102ThrSer: 3.102 ± 0.472
5.211ThrThr: 5.211 ± 0.695
5.397ThrVal: 5.397 ± 0.837
1.303ThrTrp: 1.303 ± 0.278
1.179ThrTyr: 1.179 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
7.506ValAla: 7.506 ± 0.714
0.93ValCys: 0.93 ± 0.23
5.335ValAsp: 5.335 ± 0.602
4.094ValGlu: 4.094 ± 0.604
1.489ValPhe: 1.489 ± 0.361
5.335ValGly: 5.335 ± 0.526
0.806ValHis: 0.806 ± 0.224
3.412ValIle: 3.412 ± 0.59
1.799ValLys: 1.799 ± 0.402
4.528ValLeu: 4.528 ± 0.454
1.551ValMet: 1.551 ± 0.324
2.295ValAsn: 2.295 ± 0.376
4.962ValPro: 4.962 ± 0.561
1.923ValGln: 1.923 ± 0.527
4.714ValArg: 4.714 ± 0.583
4.404ValSer: 4.404 ± 0.581
5.769ValThr: 5.769 ± 0.651
4.838ValVal: 4.838 ± 0.646
1.179ValTrp: 1.179 ± 0.263
1.055ValTyr: 1.055 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
2.419TrpAla: 2.419 ± 0.5
0.434TrpCys: 0.434 ± 0.215
1.117TrpAsp: 1.117 ± 0.235
1.117TrpGlu: 1.117 ± 0.255
0.558TrpPhe: 0.558 ± 0.198
1.365TrpGly: 1.365 ± 0.235
0.806TrpHis: 0.806 ± 0.312
0.496TrpIle: 0.496 ± 0.151
0.868TrpLys: 0.868 ± 0.192
2.419TrpLeu: 2.419 ± 0.394
0.558TrpMet: 0.558 ± 0.156
0.744TrpAsn: 0.744 ± 0.228
0.868TrpPro: 0.868 ± 0.248
0.806TrpGln: 0.806 ± 0.198
1.985TrpArg: 1.985 ± 0.324
1.117TrpSer: 1.117 ± 0.238
1.241TrpThr: 1.241 ± 0.257
1.427TrpVal: 1.427 ± 0.254
0.62TrpTrp: 0.62 ± 0.246
0.62TrpTyr: 0.62 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.481TyrAla: 2.481 ± 0.384
0.186TyrCys: 0.186 ± 0.099
2.047TyrAsp: 2.047 ± 0.262
1.241TyrGlu: 1.241 ± 0.272
0.682TyrPhe: 0.682 ± 0.169
1.489TyrGly: 1.489 ± 0.223
0.434TyrHis: 0.434 ± 0.239
0.744TyrIle: 0.744 ± 0.2
0.496TyrLys: 0.496 ± 0.153
1.923TyrLeu: 1.923 ± 0.313
0.434TyrMet: 0.434 ± 0.188
0.558TyrAsn: 0.558 ± 0.172
0.992TyrPro: 0.992 ± 0.301
0.682TyrGln: 0.682 ± 0.201
2.109TyrArg: 2.109 ± 0.37
1.427TyrSer: 1.427 ± 0.292
1.365TyrThr: 1.365 ± 0.293
1.737TyrVal: 1.737 ± 0.345
0.434TyrTrp: 0.434 ± 0.173
0.496TyrTyr: 0.496 ± 0.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16122 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski