Amino acid dipepetide frequency for Gordonia phage Chikenjars

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.06AlaAla: 9.06 ± 1.535
0.312AlaCys: 0.312 ± 0.129
4.947AlaAsp: 4.947 ± 0.488
5.78AlaGlu: 5.78 ± 0.506
3.124AlaPhe: 3.124 ± 0.4
6.821AlaGly: 6.821 ± 0.952
1.666AlaHis: 1.666 ± 0.347
5.519AlaIle: 5.519 ± 0.559
5.832AlaLys: 5.832 ± 0.677
7.394AlaLeu: 7.394 ± 1.23
2.603AlaMet: 2.603 ± 0.451
4.114AlaAsn: 4.114 ± 0.597
1.614AlaPro: 1.614 ± 0.266
2.864AlaGln: 2.864 ± 0.347
4.426AlaArg: 4.426 ± 0.458
5.207AlaSer: 5.207 ± 0.519
5.207AlaThr: 5.207 ± 0.59
7.029AlaVal: 7.029 ± 0.929
0.781AlaTrp: 0.781 ± 0.206
2.291AlaTyr: 2.291 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.417CysAla: 0.417 ± 0.201
0.052CysCys: 0.052 ± 0.048
0.312CysAsp: 0.312 ± 0.096
0.417CysGlu: 0.417 ± 0.192
0.052CysPhe: 0.052 ± 0.045
0.573CysGly: 0.573 ± 0.196
0.104CysHis: 0.104 ± 0.068
0.469CysIle: 0.469 ± 0.166
0.208CysLys: 0.208 ± 0.106
0.729CysLeu: 0.729 ± 0.21
0.0CysMet: 0.0 ± 0.0
0.156CysAsn: 0.156 ± 0.074
0.208CysPro: 0.208 ± 0.095
0.156CysGln: 0.156 ± 0.092
0.312CysArg: 0.312 ± 0.148
0.469CysSer: 0.469 ± 0.161
0.521CysThr: 0.521 ± 0.181
0.469CysVal: 0.469 ± 0.153
0.208CysTrp: 0.208 ± 0.103
0.26CysTyr: 0.26 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
5.467AspAla: 5.467 ± 0.585
0.208AspCys: 0.208 ± 0.104
3.957AspAsp: 3.957 ± 0.63
5.051AspGlu: 5.051 ± 0.524
2.812AspPhe: 2.812 ± 0.369
4.842AspGly: 4.842 ± 0.63
1.093AspHis: 1.093 ± 0.206
3.385AspIle: 3.385 ± 0.444
3.228AspLys: 3.228 ± 0.499
4.842AspLeu: 4.842 ± 0.816
1.77AspMet: 1.77 ± 0.329
2.083AspAsn: 2.083 ± 0.301
3.228AspPro: 3.228 ± 0.451
2.291AspGln: 2.291 ± 0.308
3.332AspArg: 3.332 ± 0.362
3.749AspSer: 3.749 ± 0.325
2.812AspThr: 2.812 ± 0.348
3.645AspVal: 3.645 ± 0.366
1.146AspTrp: 1.146 ± 0.209
1.77AspTyr: 1.77 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
5.519GluAla: 5.519 ± 0.473
0.417GluCys: 0.417 ± 0.152
4.426GluAsp: 4.426 ± 0.577
4.842GluGlu: 4.842 ± 0.602
2.031GluPhe: 2.031 ± 0.34
4.947GluGly: 4.947 ± 0.581
1.406GluHis: 1.406 ± 0.323
4.53GluIle: 4.53 ± 0.471
4.947GluLys: 4.947 ± 0.521
6.405GluLeu: 6.405 ± 0.53
2.291GluMet: 2.291 ± 0.298
2.864GluAsn: 2.864 ± 0.409
2.447GluPro: 2.447 ± 0.451
3.28GluGln: 3.28 ± 0.431
2.864GluArg: 2.864 ± 0.466
3.853GluSer: 3.853 ± 0.395
3.697GluThr: 3.697 ± 0.44
4.686GluVal: 4.686 ± 0.377
1.302GluTrp: 1.302 ± 0.224
2.447GluTyr: 2.447 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.76PheAla: 2.76 ± 0.385
0.521PheCys: 0.521 ± 0.179
3.437PheAsp: 3.437 ± 0.4
3.228PheGlu: 3.228 ± 0.457
1.458PhePhe: 1.458 ± 0.284
2.551PheGly: 2.551 ± 0.372
0.417PheHis: 0.417 ± 0.143
1.77PheIle: 1.77 ± 0.35
1.77PheLys: 1.77 ± 0.325
2.708PheLeu: 2.708 ± 0.365
1.198PheMet: 1.198 ± 0.262
1.666PheAsn: 1.666 ± 0.311
1.354PhePro: 1.354 ± 0.318
0.677PheGln: 0.677 ± 0.222
1.77PheArg: 1.77 ± 0.274
2.239PheSer: 2.239 ± 0.449
2.343PheThr: 2.343 ± 0.38
2.187PheVal: 2.187 ± 0.407
0.26PheTrp: 0.26 ± 0.104
0.885PheTyr: 0.885 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
6.3GlyAla: 6.3 ± 0.932
0.521GlyCys: 0.521 ± 0.22
5.051GlyAsp: 5.051 ± 0.432
5.103GlyGlu: 5.103 ± 0.507
2.916GlyPhe: 2.916 ± 0.46
5.988GlyGly: 5.988 ± 0.741
1.406GlyHis: 1.406 ± 0.316
4.79GlyIle: 4.79 ± 1.015
3.437GlyLys: 3.437 ± 0.402
5.832GlyLeu: 5.832 ± 1.074
2.812GlyMet: 2.812 ± 0.302
3.072GlyAsn: 3.072 ± 0.405
2.864GlyPro: 2.864 ± 0.376
2.239GlyGln: 2.239 ± 0.373
3.072GlyArg: 3.072 ± 0.368
5.676GlySer: 5.676 ± 0.637
5.103GlyThr: 5.103 ± 0.467
6.196GlyVal: 6.196 ± 0.707
0.937GlyTrp: 0.937 ± 0.225
2.499GlyTyr: 2.499 ± 0.366
0.0GlyXaa: 0.0 ± 0.0
His
1.51HisAla: 1.51 ± 0.27
0.208HisCys: 0.208 ± 0.101
0.989HisAsp: 0.989 ± 0.234
1.614HisGlu: 1.614 ± 0.314
0.469HisPhe: 0.469 ± 0.163
1.718HisGly: 1.718 ± 0.259
0.729HisHis: 0.729 ± 0.233
1.875HisIle: 1.875 ± 0.36
0.833HisLys: 0.833 ± 0.207
1.614HisLeu: 1.614 ± 0.29
0.469HisMet: 0.469 ± 0.158
0.781HisAsn: 0.781 ± 0.2
1.041HisPro: 1.041 ± 0.244
0.677HisGln: 0.677 ± 0.18
1.718HisArg: 1.718 ± 0.343
1.146HisSer: 1.146 ± 0.232
0.729HisThr: 0.729 ± 0.199
1.354HisVal: 1.354 ± 0.282
0.104HisTrp: 0.104 ± 0.071
1.302HisTyr: 1.302 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
5.103IleAla: 5.103 ± 0.555
0.469IleCys: 0.469 ± 0.167
3.645IleAsp: 3.645 ± 0.409
4.478IleGlu: 4.478 ± 0.597
2.135IlePhe: 2.135 ± 0.318
5.103IleGly: 5.103 ± 0.824
1.562IleHis: 1.562 ± 0.313
3.437IleIle: 3.437 ± 0.559
3.28IleLys: 3.28 ± 0.43
5.207IleLeu: 5.207 ± 0.576
0.833IleMet: 0.833 ± 0.236
2.031IleAsn: 2.031 ± 0.383
2.291IlePro: 2.291 ± 0.277
1.822IleGln: 1.822 ± 0.303
2.916IleArg: 2.916 ± 0.456
3.385IleSer: 3.385 ± 0.501
3.645IleThr: 3.645 ± 0.45
4.895IleVal: 4.895 ± 0.478
0.677IleTrp: 0.677 ± 0.145
1.614IleTyr: 1.614 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
5.519LysAla: 5.519 ± 0.489
0.104LysCys: 0.104 ± 0.078
3.437LysAsp: 3.437 ± 0.537
4.218LysGlu: 4.218 ± 0.412
1.927LysPhe: 1.927 ± 0.362
4.114LysGly: 4.114 ± 0.508
1.718LysHis: 1.718 ± 0.307
3.228LysIle: 3.228 ± 0.433
4.009LysLys: 4.009 ± 0.63
4.426LysLeu: 4.426 ± 0.552
1.562LysMet: 1.562 ± 0.256
2.708LysAsn: 2.708 ± 0.338
3.02LysPro: 3.02 ± 0.476
2.343LysGln: 2.343 ± 0.378
2.603LysArg: 2.603 ± 0.36
3.541LysSer: 3.541 ± 0.371
3.593LysThr: 3.593 ± 0.395
4.686LysVal: 4.686 ± 0.539
0.833LysTrp: 0.833 ± 0.22
1.718LysTyr: 1.718 ± 0.322
0.0LysXaa: 0.0 ± 0.0
Leu
7.446LeuAla: 7.446 ± 1.423
0.625LeuCys: 0.625 ± 0.208
4.947LeuAsp: 4.947 ± 0.565
4.166LeuGlu: 4.166 ± 0.464
2.812LeuPhe: 2.812 ± 0.416
6.196LeuGly: 6.196 ± 1.036
1.458LeuHis: 1.458 ± 0.323
4.166LeuIle: 4.166 ± 0.548
5.624LeuLys: 5.624 ± 0.509
4.947LeuLeu: 4.947 ± 0.516
2.395LeuMet: 2.395 ± 0.392
4.842LeuAsn: 4.842 ± 0.547
2.499LeuPro: 2.499 ± 0.436
2.343LeuGln: 2.343 ± 0.334
4.114LeuArg: 4.114 ± 0.416
6.092LeuSer: 6.092 ± 0.649
6.561LeuThr: 6.561 ± 0.538
5.832LeuVal: 5.832 ± 0.617
1.198LeuTrp: 1.198 ± 0.226
2.708LeuTyr: 2.708 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
2.76MetAla: 2.76 ± 0.415
0.156MetCys: 0.156 ± 0.086
1.614MetAsp: 1.614 ± 0.228
1.77MetGlu: 1.77 ± 0.306
0.885MetPhe: 0.885 ± 0.176
1.875MetGly: 1.875 ± 0.241
0.521MetHis: 0.521 ± 0.152
1.822MetIle: 1.822 ± 0.253
1.354MetLys: 1.354 ± 0.217
2.499MetLeu: 2.499 ± 0.367
0.833MetMet: 0.833 ± 0.198
1.041MetAsn: 1.041 ± 0.235
1.146MetPro: 1.146 ± 0.28
0.937MetGln: 0.937 ± 0.183
1.458MetArg: 1.458 ± 0.278
2.343MetSer: 2.343 ± 0.361
2.499MetThr: 2.499 ± 0.304
1.458MetVal: 1.458 ± 0.269
0.156MetTrp: 0.156 ± 0.088
0.469MetTyr: 0.469 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.385AsnAla: 3.385 ± 0.408
0.417AsnCys: 0.417 ± 0.178
2.291AsnAsp: 2.291 ± 0.362
3.072AsnGlu: 3.072 ± 0.339
1.77AsnPhe: 1.77 ± 0.297
3.176AsnGly: 3.176 ± 0.492
1.25AsnHis: 1.25 ± 0.241
1.77AsnIle: 1.77 ± 0.268
2.447AsnLys: 2.447 ± 0.449
3.801AsnLeu: 3.801 ± 0.464
1.093AsnMet: 1.093 ± 0.221
1.822AsnAsn: 1.822 ± 0.349
2.968AsnPro: 2.968 ± 0.462
1.77AsnGln: 1.77 ± 0.311
2.812AsnArg: 2.812 ± 0.31
3.28AsnSer: 3.28 ± 0.505
2.656AsnThr: 2.656 ± 0.332
2.447AsnVal: 2.447 ± 0.392
0.417AsnTrp: 0.417 ± 0.14
1.302AsnTyr: 1.302 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
2.864ProAla: 2.864 ± 0.346
0.312ProCys: 0.312 ± 0.124
2.812ProAsp: 2.812 ± 0.439
3.332ProGlu: 3.332 ± 0.473
1.614ProPhe: 1.614 ± 0.233
3.385ProGly: 3.385 ± 0.45
0.781ProHis: 0.781 ± 0.211
1.927ProIle: 1.927 ± 0.29
2.395ProLys: 2.395 ± 0.336
2.395ProLeu: 2.395 ± 0.331
1.146ProMet: 1.146 ± 0.234
1.875ProAsn: 1.875 ± 0.33
2.603ProPro: 2.603 ± 0.49
1.875ProGln: 1.875 ± 0.3
1.822ProArg: 1.822 ± 0.315
2.968ProSer: 2.968 ± 0.479
2.76ProThr: 2.76 ± 0.457
3.176ProVal: 3.176 ± 0.355
0.469ProTrp: 0.469 ± 0.134
1.562ProTyr: 1.562 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 0.517
0.104GlnCys: 0.104 ± 0.077
1.562GlnAsp: 1.562 ± 0.281
1.718GlnGlu: 1.718 ± 0.271
0.989GlnPhe: 0.989 ± 0.229
2.239GlnGly: 2.239 ± 0.353
1.093GlnHis: 1.093 ± 0.233
2.031GlnIle: 2.031 ± 0.299
1.979GlnLys: 1.979 ± 0.365
3.957GlnLeu: 3.957 ± 0.432
0.937GlnMet: 0.937 ± 0.192
1.562GlnAsn: 1.562 ± 0.251
1.354GlnPro: 1.354 ± 0.45
1.25GlnGln: 1.25 ± 0.208
1.927GlnArg: 1.927 ± 0.306
2.135GlnSer: 2.135 ± 0.362
1.927GlnThr: 1.927 ± 0.254
3.02GlnVal: 3.02 ± 0.357
0.417GlnTrp: 0.417 ± 0.185
1.302GlnTyr: 1.302 ± 0.274
0.0GlnXaa: 0.0 ± 0.0
Arg
3.905ArgAla: 3.905 ± 0.488
0.26ArgCys: 0.26 ± 0.097
2.343ArgAsp: 2.343 ± 0.255
3.489ArgGlu: 3.489 ± 0.547
1.666ArgPhe: 1.666 ± 0.262
2.968ArgGly: 2.968 ± 0.392
1.198ArgHis: 1.198 ± 0.244
2.551ArgIle: 2.551 ± 0.348
3.332ArgLys: 3.332 ± 0.468
4.322ArgLeu: 4.322 ± 0.58
1.25ArgMet: 1.25 ± 0.211
2.447ArgAsn: 2.447 ± 0.34
2.76ArgPro: 2.76 ± 0.439
2.239ArgGln: 2.239 ± 0.481
2.968ArgArg: 2.968 ± 0.443
2.864ArgSer: 2.864 ± 0.484
3.124ArgThr: 3.124 ± 0.418
3.228ArgVal: 3.228 ± 0.379
0.729ArgTrp: 0.729 ± 0.202
1.77ArgTyr: 1.77 ± 0.26
0.0ArgXaa: 0.0 ± 0.0
Ser
5.103SerAla: 5.103 ± 0.524
0.364SerCys: 0.364 ± 0.158
4.061SerAsp: 4.061 ± 0.359
5.103SerGlu: 5.103 ± 0.492
1.718SerPhe: 1.718 ± 0.223
5.363SerGly: 5.363 ± 0.634
1.093SerHis: 1.093 ± 0.231
4.009SerIle: 4.009 ± 0.344
4.322SerLys: 4.322 ± 0.564
5.519SerLeu: 5.519 ± 0.577
1.875SerMet: 1.875 ± 0.291
2.968SerAsn: 2.968 ± 0.408
2.239SerPro: 2.239 ± 0.382
2.083SerGln: 2.083 ± 0.3
2.343SerArg: 2.343 ± 0.357
4.999SerSer: 4.999 ± 0.531
4.842SerThr: 4.842 ± 0.472
4.478SerVal: 4.478 ± 0.422
1.302SerTrp: 1.302 ± 0.284
2.656SerTyr: 2.656 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
5.363ThrAla: 5.363 ± 0.798
0.208ThrCys: 0.208 ± 0.112
3.957ThrAsp: 3.957 ± 0.43
3.697ThrGlu: 3.697 ± 0.434
2.291ThrPhe: 2.291 ± 0.319
5.259ThrGly: 5.259 ± 0.462
1.354ThrHis: 1.354 ± 0.292
4.009ThrIle: 4.009 ± 0.491
3.593ThrLys: 3.593 ± 0.351
5.207ThrLeu: 5.207 ± 0.449
1.77ThrMet: 1.77 ± 0.285
2.447ThrAsn: 2.447 ± 0.353
3.541ThrPro: 3.541 ± 0.474
2.187ThrGln: 2.187 ± 0.294
2.76ThrArg: 2.76 ± 0.458
4.322ThrSer: 4.322 ± 0.392
4.895ThrThr: 4.895 ± 0.562
5.051ThrVal: 5.051 ± 0.565
0.677ThrTrp: 0.677 ± 0.19
2.291ThrTyr: 2.291 ± 0.416
0.0ThrXaa: 0.0 ± 0.0
Val
7.081ValAla: 7.081 ± 0.62
0.364ValCys: 0.364 ± 0.151
4.061ValAsp: 4.061 ± 0.465
5.103ValGlu: 5.103 ± 0.598
2.499ValPhe: 2.499 ± 0.355
5.259ValGly: 5.259 ± 0.581
1.146ValHis: 1.146 ± 0.221
4.738ValIle: 4.738 ± 0.534
4.009ValLys: 4.009 ± 0.405
5.467ValLeu: 5.467 ± 0.496
1.822ValMet: 1.822 ± 0.257
2.916ValAsn: 2.916 ± 0.4
3.228ValPro: 3.228 ± 0.367
2.083ValGln: 2.083 ± 0.336
3.801ValArg: 3.801 ± 0.568
4.686ValSer: 4.686 ± 0.487
5.051ValThr: 5.051 ± 0.541
4.999ValVal: 4.999 ± 0.647
0.989ValTrp: 0.989 ± 0.227
2.656ValTyr: 2.656 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
1.146TrpAla: 1.146 ± 0.197
0.208TrpCys: 0.208 ± 0.105
0.677TrpAsp: 0.677 ± 0.186
1.146TrpGlu: 1.146 ± 0.218
0.625TrpPhe: 0.625 ± 0.191
0.781TrpGly: 0.781 ± 0.166
0.417TrpHis: 0.417 ± 0.124
0.677TrpIle: 0.677 ± 0.195
0.469TrpLys: 0.469 ± 0.156
1.093TrpLeu: 1.093 ± 0.277
0.364TrpMet: 0.364 ± 0.142
1.146TrpAsn: 1.146 ± 0.234
0.573TrpPro: 0.573 ± 0.205
0.26TrpGln: 0.26 ± 0.088
0.625TrpArg: 0.625 ± 0.159
0.937TrpSer: 0.937 ± 0.219
0.833TrpThr: 0.833 ± 0.191
0.625TrpVal: 0.625 ± 0.149
0.052TrpTrp: 0.052 ± 0.054
0.417TrpTyr: 0.417 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.812TyrAla: 2.812 ± 0.53
0.26TyrCys: 0.26 ± 0.112
2.187TyrAsp: 2.187 ± 0.392
1.875TyrGlu: 1.875 ± 0.381
1.406TyrPhe: 1.406 ± 0.288
2.656TyrGly: 2.656 ± 0.397
0.469TyrHis: 0.469 ± 0.155
1.822TyrIle: 1.822 ± 0.356
2.291TyrLys: 2.291 ± 0.444
2.603TyrLeu: 2.603 ± 0.344
0.521TyrMet: 0.521 ± 0.179
1.406TyrAsn: 1.406 ± 0.246
1.093TyrPro: 1.093 ± 0.229
1.198TyrGln: 1.198 ± 0.243
1.77TyrArg: 1.77 ± 0.284
2.551TyrSer: 2.551 ± 0.398
2.083TyrThr: 2.083 ± 0.396
2.499TyrVal: 2.499 ± 0.274
0.417TyrTrp: 0.417 ± 0.164
0.833TyrTyr: 0.833 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (19206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski