Amino acid dipepetide frequency for Gordonia Phage Zitch

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.944AlaAla: 18.944 ± 1.942
0.864AlaCys: 0.864 ± 0.224
8.582AlaAsp: 8.582 ± 0.888
9.445AlaGlu: 9.445 ± 0.711
2.753AlaPhe: 2.753 ± 0.574
10.255AlaGly: 10.255 ± 1.172
2.105AlaHis: 2.105 ± 0.375
6.315AlaIle: 6.315 ± 0.759
3.724AlaLys: 3.724 ± 0.454
9.283AlaLeu: 9.283 ± 0.977
2.105AlaMet: 2.105 ± 0.319
3.454AlaAsn: 3.454 ± 0.526
5.775AlaPro: 5.775 ± 0.605
5.073AlaGln: 5.073 ± 0.572
8.15AlaArg: 8.15 ± 0.701
4.965AlaSer: 4.965 ± 0.491
8.096AlaThr: 8.096 ± 0.767
8.528AlaVal: 8.528 ± 0.731
2.213AlaTrp: 2.213 ± 0.31
2.861AlaTyr: 2.861 ± 0.336
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.232
0.0CysCys: 0.0 ± 0.0
0.81CysAsp: 0.81 ± 0.256
0.27CysGlu: 0.27 ± 0.135
0.216CysPhe: 0.216 ± 0.102
1.295CysGly: 1.295 ± 0.37
0.108CysHis: 0.108 ± 0.07
0.378CysIle: 0.378 ± 0.189
0.27CysLys: 0.27 ± 0.114
0.324CysLeu: 0.324 ± 0.114
0.054CysMet: 0.054 ± 0.054
0.162CysAsn: 0.162 ± 0.102
0.918CysPro: 0.918 ± 0.248
0.162CysGln: 0.162 ± 0.108
1.133CysArg: 1.133 ± 0.296
0.702CysSer: 0.702 ± 0.184
0.702CysThr: 0.702 ± 0.232
0.27CysVal: 0.27 ± 0.111
0.216CysTrp: 0.216 ± 0.111
0.27CysTyr: 0.27 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
8.096AspAla: 8.096 ± 0.716
0.648AspCys: 0.648 ± 0.164
7.61AspAsp: 7.61 ± 1.358
6.585AspGlu: 6.585 ± 1.09
0.864AspPhe: 0.864 ± 0.269
7.232AspGly: 7.232 ± 0.602
1.511AspHis: 1.511 ± 0.377
1.781AspIle: 1.781 ± 0.299
1.673AspLys: 1.673 ± 0.306
5.559AspLeu: 5.559 ± 0.564
1.889AspMet: 1.889 ± 0.337
2.105AspAsn: 2.105 ± 0.279
5.937AspPro: 5.937 ± 0.594
2.213AspGln: 2.213 ± 0.379
5.019AspArg: 5.019 ± 0.65
2.537AspSer: 2.537 ± 0.437
5.019AspThr: 5.019 ± 0.575
5.073AspVal: 5.073 ± 0.738
1.835AspTrp: 1.835 ± 0.286
1.511AspTyr: 1.511 ± 0.356
0.0AspXaa: 0.0 ± 0.0
Glu
5.775GluAla: 5.775 ± 0.691
0.432GluCys: 0.432 ± 0.211
2.051GluAsp: 2.051 ± 0.413
1.133GluGlu: 1.133 ± 0.276
2.537GluPhe: 2.537 ± 0.321
3.292GluGly: 3.292 ± 0.392
1.781GluHis: 1.781 ± 0.405
2.807GluIle: 2.807 ± 0.385
1.673GluLys: 1.673 ± 0.406
5.505GluLeu: 5.505 ± 0.689
1.457GluMet: 1.457 ± 0.295
1.187GluAsn: 1.187 ± 0.22
3.238GluPro: 3.238 ± 0.523
3.994GluGln: 3.994 ± 0.536
4.264GluArg: 4.264 ± 0.619
2.968GluSer: 2.968 ± 0.48
3.238GluThr: 3.238 ± 0.396
4.372GluVal: 4.372 ± 0.579
1.727GluTrp: 1.727 ± 0.278
1.727GluTyr: 1.727 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
3.292PheAla: 3.292 ± 0.5
0.324PheCys: 0.324 ± 0.161
2.267PheAsp: 2.267 ± 0.348
1.025PheGlu: 1.025 ± 0.306
0.54PhePhe: 0.54 ± 0.293
2.429PheGly: 2.429 ± 0.369
0.378PheHis: 0.378 ± 0.135
1.241PheIle: 1.241 ± 0.247
0.648PheLys: 0.648 ± 0.224
1.403PheLeu: 1.403 ± 0.219
0.702PheMet: 0.702 ± 0.207
0.702PheAsn: 0.702 ± 0.25
1.295PhePro: 1.295 ± 0.289
0.702PheGln: 0.702 ± 0.161
1.457PheArg: 1.457 ± 0.283
0.756PheSer: 0.756 ± 0.225
1.403PheThr: 1.403 ± 0.283
1.025PheVal: 1.025 ± 0.173
0.432PheTrp: 0.432 ± 0.15
0.378PheTyr: 0.378 ± 0.147
0.0PheXaa: 0.0 ± 0.0
Gly
9.661GlyAla: 9.661 ± 1.228
0.702GlyCys: 0.702 ± 0.24
6.099GlyAsp: 6.099 ± 0.703
4.372GlyGlu: 4.372 ± 0.486
1.457GlyPhe: 1.457 ± 0.421
8.15GlyGly: 8.15 ± 1.262
1.943GlyHis: 1.943 ± 0.338
4.372GlyIle: 4.372 ± 0.522
2.915GlyLys: 2.915 ± 0.396
5.127GlyLeu: 5.127 ± 0.555
2.375GlyMet: 2.375 ± 0.378
2.645GlyAsn: 2.645 ± 0.513
3.886GlyPro: 3.886 ± 0.49
3.832GlyGln: 3.832 ± 0.448
6.801GlyArg: 6.801 ± 0.528
4.21GlySer: 4.21 ± 0.501
5.937GlyThr: 5.937 ± 0.768
6.153GlyVal: 6.153 ± 0.559
1.565GlyTrp: 1.565 ± 0.306
1.781GlyTyr: 1.781 ± 0.342
0.0GlyXaa: 0.0 ± 0.0
His
2.213HisAla: 2.213 ± 0.332
0.108HisCys: 0.108 ± 0.078
1.619HisAsp: 1.619 ± 0.367
0.972HisGlu: 0.972 ± 0.26
0.378HisPhe: 0.378 ± 0.133
1.673HisGly: 1.673 ± 0.324
0.648HisHis: 0.648 ± 0.252
0.918HisIle: 0.918 ± 0.224
0.27HisLys: 0.27 ± 0.131
1.781HisLeu: 1.781 ± 0.376
0.324HisMet: 0.324 ± 0.12
0.27HisAsn: 0.27 ± 0.114
1.619HisPro: 1.619 ± 0.37
0.972HisGln: 0.972 ± 0.192
2.699HisArg: 2.699 ± 0.498
0.324HisSer: 0.324 ± 0.12
1.295HisThr: 1.295 ± 0.307
1.727HisVal: 1.727 ± 0.365
0.432HisTrp: 0.432 ± 0.163
0.27HisTyr: 0.27 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
6.099IleAla: 6.099 ± 0.524
0.27IleCys: 0.27 ± 0.129
4.102IleAsp: 4.102 ± 0.435
3.292IleGlu: 3.292 ± 0.461
0.324IlePhe: 0.324 ± 0.138
5.343IleGly: 5.343 ± 0.871
0.81IleHis: 0.81 ± 0.182
1.565IleIle: 1.565 ± 0.425
1.295IleLys: 1.295 ± 0.52
2.807IleLeu: 2.807 ± 0.329
0.81IleMet: 0.81 ± 0.154
1.943IleAsn: 1.943 ± 0.362
2.645IlePro: 2.645 ± 0.329
1.187IleGln: 1.187 ± 0.313
2.861IleArg: 2.861 ± 0.382
1.835IleSer: 1.835 ± 0.371
3.778IleThr: 3.778 ± 0.402
3.67IleVal: 3.67 ± 0.381
0.486IleTrp: 0.486 ± 0.167
1.079IleTyr: 1.079 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
2.591LysAla: 2.591 ± 0.334
0.486LysCys: 0.486 ± 0.163
1.619LysAsp: 1.619 ± 0.359
0.648LysGlu: 0.648 ± 0.188
0.972LysPhe: 0.972 ± 0.31
2.105LysGly: 2.105 ± 0.376
0.486LysHis: 0.486 ± 0.148
1.187LysIle: 1.187 ± 0.26
0.486LysLys: 0.486 ± 0.187
2.861LysLeu: 2.861 ± 0.39
0.81LysMet: 0.81 ± 0.231
1.133LysAsn: 1.133 ± 0.264
1.673LysPro: 1.673 ± 0.283
0.864LysGln: 0.864 ± 0.213
2.321LysArg: 2.321 ± 0.365
1.349LysSer: 1.349 ± 0.276
1.403LysThr: 1.403 ± 0.283
2.159LysVal: 2.159 ± 0.416
0.702LysTrp: 0.702 ± 0.169
0.81LysTyr: 0.81 ± 0.198
0.0LysXaa: 0.0 ± 0.0
Leu
9.445LeuAla: 9.445 ± 0.743
0.432LeuCys: 0.432 ± 0.153
5.613LeuAsp: 5.613 ± 0.576
5.019LeuGlu: 5.019 ± 0.578
1.727LeuPhe: 1.727 ± 0.385
7.34LeuGly: 7.34 ± 0.999
1.241LeuHis: 1.241 ± 0.298
3.832LeuIle: 3.832 ± 0.41
1.511LeuLys: 1.511 ± 0.308
5.775LeuLeu: 5.775 ± 0.569
1.241LeuMet: 1.241 ± 0.286
2.321LeuAsn: 2.321 ± 0.313
5.127LeuPro: 5.127 ± 0.502
2.051LeuGln: 2.051 ± 0.414
5.451LeuArg: 5.451 ± 0.515
3.616LeuSer: 3.616 ± 0.535
5.181LeuThr: 5.181 ± 0.58
6.099LeuVal: 6.099 ± 0.522
1.889LeuTrp: 1.889 ± 0.354
1.133LeuTyr: 1.133 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
3.076MetAla: 3.076 ± 0.422
0.054MetCys: 0.054 ± 0.056
0.81MetAsp: 0.81 ± 0.178
0.594MetGlu: 0.594 ± 0.167
0.486MetPhe: 0.486 ± 0.193
1.079MetGly: 1.079 ± 0.251
0.324MetHis: 0.324 ± 0.128
1.187MetIle: 1.187 ± 0.218
0.378MetLys: 0.378 ± 0.129
1.997MetLeu: 1.997 ± 0.31
0.216MetMet: 0.216 ± 0.098
0.864MetAsn: 0.864 ± 0.245
2.483MetPro: 2.483 ± 0.394
0.648MetGln: 0.648 ± 0.161
1.943MetArg: 1.943 ± 0.398
1.133MetSer: 1.133 ± 0.284
3.886MetThr: 3.886 ± 0.407
1.133MetVal: 1.133 ± 0.252
0.432MetTrp: 0.432 ± 0.146
0.216MetTyr: 0.216 ± 0.109
0.0MetXaa: 0.0 ± 0.0
Asn
3.616AsnAla: 3.616 ± 0.515
0.378AsnCys: 0.378 ± 0.13
2.051AsnAsp: 2.051 ± 0.373
1.781AsnGlu: 1.781 ± 0.254
0.27AsnPhe: 0.27 ± 0.134
3.292AsnGly: 3.292 ± 0.436
0.27AsnHis: 0.27 ± 0.122
0.864AsnIle: 0.864 ± 0.303
0.864AsnLys: 0.864 ± 0.272
2.375AsnLeu: 2.375 ± 0.336
0.432AsnMet: 0.432 ± 0.186
0.702AsnAsn: 0.702 ± 0.231
2.699AsnPro: 2.699 ± 0.414
0.486AsnGln: 0.486 ± 0.171
1.619AsnArg: 1.619 ± 0.272
1.241AsnSer: 1.241 ± 0.323
1.673AsnThr: 1.673 ± 0.301
1.619AsnVal: 1.619 ± 0.321
0.648AsnTrp: 0.648 ± 0.205
0.648AsnTyr: 0.648 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
8.366ProAla: 8.366 ± 0.61
0.432ProCys: 0.432 ± 0.163
6.693ProAsp: 6.693 ± 0.882
3.292ProGlu: 3.292 ± 0.492
1.619ProPhe: 1.619 ± 0.278
4.965ProGly: 4.965 ± 0.463
1.187ProHis: 1.187 ± 0.28
2.537ProIle: 2.537 ± 0.377
1.889ProLys: 1.889 ± 0.348
3.346ProLeu: 3.346 ± 0.433
1.133ProMet: 1.133 ± 0.273
1.403ProAsn: 1.403 ± 0.248
5.721ProPro: 5.721 ± 1.065
1.727ProGln: 1.727 ± 0.352
4.102ProArg: 4.102 ± 0.479
2.645ProSer: 2.645 ± 0.371
5.721ProThr: 5.721 ± 0.906
5.343ProVal: 5.343 ± 0.52
1.403ProTrp: 1.403 ± 0.295
1.403ProTyr: 1.403 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.642GlnAla: 4.642 ± 0.486
0.324GlnCys: 0.324 ± 0.135
1.673GlnAsp: 1.673 ± 0.287
1.187GlnGlu: 1.187 ± 0.265
1.025GlnPhe: 1.025 ± 0.281
2.321GlnGly: 2.321 ± 0.28
1.187GlnHis: 1.187 ± 0.316
2.159GlnIle: 2.159 ± 0.296
0.81GlnLys: 0.81 ± 0.217
4.102GlnLeu: 4.102 ± 0.445
1.079GlnMet: 1.079 ± 0.238
0.972GlnAsn: 0.972 ± 0.225
2.375GlnPro: 2.375 ± 0.375
2.591GlnGln: 2.591 ± 0.436
2.645GlnArg: 2.645 ± 0.43
2.051GlnSer: 2.051 ± 0.622
2.159GlnThr: 2.159 ± 0.33
2.645GlnVal: 2.645 ± 0.359
0.864GlnTrp: 0.864 ± 0.162
0.864GlnTyr: 0.864 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
9.067ArgAla: 9.067 ± 0.767
0.756ArgCys: 0.756 ± 0.203
4.804ArgAsp: 4.804 ± 0.511
4.156ArgGlu: 4.156 ± 0.656
1.295ArgPhe: 1.295 ± 0.204
4.911ArgGly: 4.911 ± 0.579
1.943ArgHis: 1.943 ± 0.447
3.994ArgIle: 3.994 ± 0.449
2.537ArgLys: 2.537 ± 0.353
5.883ArgLeu: 5.883 ± 0.546
2.321ArgMet: 2.321 ± 0.327
2.159ArgAsn: 2.159 ± 0.374
3.508ArgPro: 3.508 ± 0.484
3.076ArgGln: 3.076 ± 0.404
9.067ArgArg: 9.067 ± 1.042
2.915ArgSer: 2.915 ± 0.423
4.858ArgThr: 4.858 ± 0.542
5.019ArgVal: 5.019 ± 0.704
1.565ArgTrp: 1.565 ± 0.332
2.159ArgTyr: 2.159 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
4.696SerAla: 4.696 ± 0.918
0.81SerCys: 0.81 ± 0.255
2.915SerAsp: 2.915 ± 0.388
1.727SerGlu: 1.727 ± 0.312
1.133SerPhe: 1.133 ± 0.301
4.48SerGly: 4.48 ± 0.648
0.81SerHis: 0.81 ± 0.231
2.267SerIle: 2.267 ± 0.369
1.079SerLys: 1.079 ± 0.214
3.076SerLeu: 3.076 ± 0.411
1.565SerMet: 1.565 ± 0.294
1.187SerAsn: 1.187 ± 0.247
2.753SerPro: 2.753 ± 0.457
1.619SerGln: 1.619 ± 0.327
3.076SerArg: 3.076 ± 0.458
3.076SerSer: 3.076 ± 0.413
3.562SerThr: 3.562 ± 0.419
3.13SerVal: 3.13 ± 0.433
1.295SerTrp: 1.295 ± 0.251
0.864SerTyr: 0.864 ± 0.209
0.0SerXaa: 0.0 ± 0.0
Thr
9.553ThrAla: 9.553 ± 0.957
0.81ThrCys: 0.81 ± 0.224
6.315ThrAsp: 6.315 ± 0.688
3.562ThrGlu: 3.562 ± 0.468
1.727ThrPhe: 1.727 ± 0.284
5.397ThrGly: 5.397 ± 0.656
1.241ThrHis: 1.241 ± 0.296
3.94ThrIle: 3.94 ± 0.583
1.565ThrLys: 1.565 ± 0.256
5.613ThrLeu: 5.613 ± 0.773
2.213ThrMet: 2.213 ± 0.361
1.349ThrAsn: 1.349 ± 0.264
5.883ThrPro: 5.883 ± 0.665
1.565ThrGln: 1.565 ± 0.269
4.318ThrArg: 4.318 ± 0.566
3.238ThrSer: 3.238 ± 0.391
4.696ThrThr: 4.696 ± 0.675
6.153ThrVal: 6.153 ± 0.767
1.133ThrTrp: 1.133 ± 0.229
1.241ThrTyr: 1.241 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
7.826ValAla: 7.826 ± 0.786
1.025ValCys: 1.025 ± 0.36
6.261ValAsp: 6.261 ± 0.717
4.426ValGlu: 4.426 ± 0.516
1.673ValPhe: 1.673 ± 0.345
5.829ValGly: 5.829 ± 0.607
1.673ValHis: 1.673 ± 0.221
3.4ValIle: 3.4 ± 0.48
1.457ValLys: 1.457 ± 0.327
5.235ValLeu: 5.235 ± 0.454
1.079ValMet: 1.079 ± 0.233
1.997ValAsn: 1.997 ± 0.342
4.534ValPro: 4.534 ± 0.469
3.13ValGln: 3.13 ± 0.401
5.019ValArg: 5.019 ± 0.539
3.076ValSer: 3.076 ± 0.401
5.883ValThr: 5.883 ± 0.74
5.505ValVal: 5.505 ± 0.5
1.565ValTrp: 1.565 ± 0.294
1.997ValTyr: 1.997 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
2.968TrpAla: 2.968 ± 0.386
0.27TrpCys: 0.27 ± 0.112
0.81TrpAsp: 0.81 ± 0.177
0.702TrpGlu: 0.702 ± 0.176
0.864TrpPhe: 0.864 ± 0.192
0.702TrpGly: 0.702 ± 0.208
0.594TrpHis: 0.594 ± 0.161
0.81TrpIle: 0.81 ± 0.218
1.079TrpLys: 1.079 ± 0.263
2.051TrpLeu: 2.051 ± 0.294
0.486TrpMet: 0.486 ± 0.178
0.432TrpAsn: 0.432 ± 0.26
1.457TrpPro: 1.457 ± 0.23
1.025TrpGln: 1.025 ± 0.217
1.619TrpArg: 1.619 ± 0.355
1.565TrpSer: 1.565 ± 0.339
1.565TrpThr: 1.565 ± 0.249
1.457TrpVal: 1.457 ± 0.299
0.486TrpTrp: 0.486 ± 0.178
0.378TrpTyr: 0.378 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.699TyrAla: 2.699 ± 0.475
0.054TyrCys: 0.054 ± 0.057
1.997TyrAsp: 1.997 ± 0.417
0.972TyrGlu: 0.972 ± 0.247
0.54TyrPhe: 0.54 ± 0.2
1.889TyrGly: 1.889 ± 0.275
0.27TyrHis: 0.27 ± 0.117
0.648TyrIle: 0.648 ± 0.176
0.54TyrLys: 0.54 ± 0.176
1.943TyrLeu: 1.943 ± 0.32
0.594TyrMet: 0.594 ± 0.177
0.54TyrAsn: 0.54 ± 0.183
1.403TyrPro: 1.403 ± 0.334
0.756TyrGln: 0.756 ± 0.18
2.429TyrArg: 2.429 ± 0.381
0.918TyrSer: 0.918 ± 0.191
1.457TyrThr: 1.457 ± 0.29
1.511TyrVal: 1.511 ± 0.305
0.432TyrTrp: 0.432 ± 0.129
0.486TyrTyr: 0.486 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (18529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski