Amino acid dipepetide frequency for Aliarcobacter faecis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.351AlaAla: 1.351 ± 0.214
0.371AlaCys: 0.371 ± 0.08
1.616AlaAsp: 1.616 ± 0.223
1.695AlaGlu: 1.695 ± 0.213
2.225AlaPhe: 2.225 ± 0.244
1.801AlaGly: 1.801 ± 0.237
0.662AlaHis: 0.662 ± 0.125
4.583AlaIle: 4.583 ± 0.432
3.921AlaLys: 3.921 ± 0.305
4.954AlaLeu: 4.954 ± 0.383
1.06AlaMet: 1.06 ± 0.172
2.755AlaAsn: 2.755 ± 0.265
0.954AlaPro: 0.954 ± 0.164
1.007AlaGln: 1.007 ± 0.177
1.298AlaArg: 1.298 ± 0.221
2.411AlaSer: 2.411 ± 0.232
2.278AlaThr: 2.278 ± 0.309
1.854AlaVal: 1.854 ± 0.251
0.185AlaTrp: 0.185 ± 0.073
2.225AlaTyr: 2.225 ± 0.221
0.0AlaXaa: 0.0 ± 0.0
Cys
0.238CysAla: 0.238 ± 0.079
0.079CysCys: 0.079 ± 0.039
0.821CysAsp: 0.821 ± 0.154
0.795CysGlu: 0.795 ± 0.124
0.583CysPhe: 0.583 ± 0.122
0.583CysGly: 0.583 ± 0.119
0.132CysHis: 0.132 ± 0.056
0.503CysIle: 0.503 ± 0.139
0.954CysLys: 0.954 ± 0.165
0.53CysLeu: 0.53 ± 0.12
0.106CysMet: 0.106 ± 0.054
0.742CysAsn: 0.742 ± 0.134
0.371CysPro: 0.371 ± 0.112
0.371CysGln: 0.371 ± 0.103
0.344CysArg: 0.344 ± 0.091
0.636CysSer: 0.636 ± 0.131
0.371CysThr: 0.371 ± 0.103
0.238CysVal: 0.238 ± 0.072
0.079CysTrp: 0.079 ± 0.043
0.424CysTyr: 0.424 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
2.066AspAla: 2.066 ± 0.273
0.477AspCys: 0.477 ± 0.102
2.994AspAsp: 2.994 ± 0.271
5.484AspGlu: 5.484 ± 0.428
4.239AspPhe: 4.239 ± 0.351
2.411AspGly: 2.411 ± 0.319
0.503AspHis: 0.503 ± 0.098
7.285AspIle: 7.285 ± 0.449
6.649AspLys: 6.649 ± 0.401
4.954AspLeu: 4.954 ± 0.364
1.431AspMet: 1.431 ± 0.166
4.689AspAsn: 4.689 ± 0.294
0.927AspPro: 0.927 ± 0.146
0.848AspGln: 0.848 ± 0.142
1.589AspArg: 1.589 ± 0.206
3.629AspSer: 3.629 ± 0.279
2.305AspThr: 2.305 ± 0.277
2.119AspVal: 2.119 ± 0.231
0.45AspTrp: 0.45 ± 0.124
2.941AspTyr: 2.941 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
2.994GluAla: 2.994 ± 0.261
0.609GluCys: 0.609 ± 0.11
4.742GluAsp: 4.742 ± 0.36
5.828GluGlu: 5.828 ± 0.475
4.027GluPhe: 4.027 ± 0.305
2.384GluGly: 2.384 ± 0.253
2.093GluHis: 2.093 ± 0.284
8.398GluIle: 8.398 ± 0.467
8.822GluLys: 8.822 ± 0.507
7.788GluLeu: 7.788 ± 0.817
1.378GluMet: 1.378 ± 0.187
6.411GluAsn: 6.411 ± 0.368
1.537GluPro: 1.537 ± 0.165
3.179GluGln: 3.179 ± 0.384
2.782GluArg: 2.782 ± 0.282
3.444GluSer: 3.444 ± 0.311
3.02GluThr: 3.02 ± 0.242
3.841GluVal: 3.841 ± 0.309
0.477GluTrp: 0.477 ± 0.119
3.947GluTyr: 3.947 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.384PheAla: 2.384 ± 0.28
0.503PheCys: 0.503 ± 0.109
4.186PheAsp: 4.186 ± 0.305
4.451PheGlu: 4.451 ± 0.321
2.967PhePhe: 2.967 ± 0.297
2.676PheGly: 2.676 ± 0.297
0.689PheHis: 0.689 ± 0.142
5.51PheIle: 5.51 ± 0.477
5.219PheLys: 5.219 ± 0.382
5.669PheLeu: 5.669 ± 0.396
1.06PheMet: 1.06 ± 0.145
3.629PheAsn: 3.629 ± 0.273
1.139PhePro: 1.139 ± 0.147
1.722PheGln: 1.722 ± 0.198
1.298PheArg: 1.298 ± 0.148
4.027PheSer: 4.027 ± 0.333
2.676PheThr: 2.676 ± 0.272
2.013PheVal: 2.013 ± 0.214
0.689PheTrp: 0.689 ± 0.162
2.967PheTyr: 2.967 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
2.119GlyAla: 2.119 ± 0.309
0.583GlyCys: 0.583 ± 0.109
1.881GlyAsp: 1.881 ± 0.213
2.225GlyGlu: 2.225 ± 0.199
2.172GlyPhe: 2.172 ± 0.262
2.013GlyGly: 2.013 ± 0.289
0.662GlyHis: 0.662 ± 0.138
3.788GlyIle: 3.788 ± 0.264
4.212GlyLys: 4.212 ± 0.361
3.1GlyLeu: 3.1 ± 0.334
0.742GlyMet: 0.742 ± 0.153
2.808GlyAsn: 2.808 ± 0.279
0.344GlyPro: 0.344 ± 0.098
0.901GlyGln: 0.901 ± 0.139
1.219GlyArg: 1.219 ± 0.211
2.464GlySer: 2.464 ± 0.347
2.358GlyThr: 2.358 ± 0.275
2.278GlyVal: 2.278 ± 0.298
0.265GlyTrp: 0.265 ± 0.077
1.987GlyTyr: 1.987 ± 0.266
0.0GlyXaa: 0.0 ± 0.0
His
0.715HisAla: 0.715 ± 0.141
0.079HisCys: 0.079 ± 0.043
0.874HisAsp: 0.874 ± 0.168
1.272HisGlu: 1.272 ± 0.192
1.219HisPhe: 1.219 ± 0.196
0.503HisGly: 0.503 ± 0.113
0.45HisHis: 0.45 ± 0.127
1.722HisIle: 1.722 ± 0.174
1.669HisLys: 1.669 ± 0.25
1.934HisLeu: 1.934 ± 0.259
0.344HisMet: 0.344 ± 0.094
1.06HisAsn: 1.06 ± 0.132
0.503HisPro: 0.503 ± 0.101
0.583HisGln: 0.583 ± 0.157
0.503HisArg: 0.503 ± 0.134
0.927HisSer: 0.927 ± 0.207
0.795HisThr: 0.795 ± 0.141
0.45HisVal: 0.45 ± 0.114
0.238HisTrp: 0.238 ± 0.068
1.06HisTyr: 1.06 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.292IleAla: 4.292 ± 0.319
1.219IleCys: 1.219 ± 0.156
6.994IleAsp: 6.994 ± 0.374
9.087IleGlu: 9.087 ± 0.447
5.272IlePhe: 5.272 ± 0.429
4.027IleGly: 4.027 ± 0.404
1.484IleHis: 1.484 ± 0.245
8.451IleIle: 8.451 ± 0.65
10.967IleLys: 10.967 ± 0.724
10.199IleLeu: 10.199 ± 0.571
1.51IleMet: 1.51 ± 0.207
8.424IleAsn: 8.424 ± 0.428
3.047IlePro: 3.047 ± 0.347
3.205IleGln: 3.205 ± 0.304
3.047IleArg: 3.047 ± 0.353
7.471IleSer: 7.471 ± 0.42
5.245IleThr: 5.245 ± 0.257
4.583IleVal: 4.583 ± 0.341
0.689IleTrp: 0.689 ± 0.127
4.08IleTyr: 4.08 ± 0.329
0.0IleXaa: 0.0 ± 0.0
Lys
3.921LysAla: 3.921 ± 0.332
0.715LysCys: 0.715 ± 0.133
6.861LysAsp: 6.861 ± 0.442
10.517LysGlu: 10.517 ± 0.675
4.398LysPhe: 4.398 ± 0.407
3.152LysGly: 3.152 ± 0.308
1.589LysHis: 1.589 ± 0.191
11.63LysIle: 11.63 ± 0.504
10.12LysLys: 10.12 ± 0.656
9.775LysLeu: 9.775 ± 0.459
1.748LysMet: 1.748 ± 0.206
8.451LysAsn: 8.451 ± 0.542
2.146LysPro: 2.146 ± 0.316
3.815LysGln: 3.815 ± 0.394
3.02LysArg: 3.02 ± 0.249
6.861LysSer: 6.861 ± 0.437
5.033LysThr: 5.033 ± 0.375
4.61LysVal: 4.61 ± 0.367
0.742LysTrp: 0.742 ± 0.143
5.987LysTyr: 5.987 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
3.947LeuAla: 3.947 ± 0.382
0.848LeuCys: 0.848 ± 0.156
5.563LeuAsp: 5.563 ± 0.358
8.848LeuGlu: 8.848 ± 0.626
5.934LeuPhe: 5.934 ± 0.569
3.603LeuGly: 3.603 ± 0.327
1.616LeuHis: 1.616 ± 0.182
9.246LeuIle: 9.246 ± 0.47
11.391LeuLys: 11.391 ± 0.552
9.457LeuLeu: 9.457 ± 0.566
1.722LeuMet: 1.722 ± 0.186
8.08LeuAsn: 8.08 ± 0.483
2.596LeuPro: 2.596 ± 0.236
3.788LeuGln: 3.788 ± 0.48
2.649LeuArg: 2.649 ± 0.296
7.153LeuSer: 7.153 ± 0.516
4.61LeuThr: 4.61 ± 0.397
4.689LeuVal: 4.689 ± 0.427
0.662LeuTrp: 0.662 ± 0.108
4.239LeuTyr: 4.239 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
0.954MetAla: 0.954 ± 0.171
0.159MetCys: 0.159 ± 0.067
0.954MetAsp: 0.954 ± 0.169
1.219MetGlu: 1.219 ± 0.183
1.007MetPhe: 1.007 ± 0.195
0.742MetGly: 0.742 ± 0.126
0.318MetHis: 0.318 ± 0.096
2.013MetIle: 2.013 ± 0.244
1.722MetLys: 1.722 ± 0.205
1.801MetLeu: 1.801 ± 0.224
0.212MetMet: 0.212 ± 0.08
1.192MetAsn: 1.192 ± 0.194
0.397MetPro: 0.397 ± 0.115
0.715MetGln: 0.715 ± 0.144
0.742MetArg: 0.742 ± 0.121
1.166MetSer: 1.166 ± 0.164
0.901MetThr: 0.901 ± 0.167
0.954MetVal: 0.954 ± 0.167
0.159MetTrp: 0.159 ± 0.063
0.636MetTyr: 0.636 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
2.755AsnAla: 2.755 ± 0.265
0.742AsnCys: 0.742 ± 0.142
4.159AsnAsp: 4.159 ± 0.364
6.384AsnGlu: 6.384 ± 0.475
4.027AsnPhe: 4.027 ± 0.314
2.994AsnGly: 2.994 ± 0.294
1.192AsnHis: 1.192 ± 0.21
9.299AsnIle: 9.299 ± 0.662
8.133AsnLys: 8.133 ± 0.495
7.577AsnLeu: 7.577 ± 0.495
1.166AsnMet: 1.166 ± 0.17
6.384AsnAsn: 6.384 ± 0.485
2.331AsnPro: 2.331 ± 0.216
2.358AsnGln: 2.358 ± 0.281
2.543AsnArg: 2.543 ± 0.268
5.563AsnSer: 5.563 ± 0.366
2.914AsnThr: 2.914 ± 0.203
2.941AsnVal: 2.941 ± 0.293
0.662AsnTrp: 0.662 ± 0.123
3.285AsnTyr: 3.285 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
0.795ProAla: 0.795 ± 0.139
0.106ProCys: 0.106 ± 0.049
1.457ProAsp: 1.457 ± 0.196
1.695ProGlu: 1.695 ± 0.197
1.563ProPhe: 1.563 ± 0.212
0.503ProGly: 0.503 ± 0.103
0.397ProHis: 0.397 ± 0.106
2.464ProIle: 2.464 ± 0.306
2.331ProLys: 2.331 ± 0.22
2.384ProLeu: 2.384 ± 0.303
0.291ProMet: 0.291 ± 0.075
1.775ProAsn: 1.775 ± 0.226
0.715ProPro: 0.715 ± 0.108
0.53ProGln: 0.53 ± 0.118
0.636ProArg: 0.636 ± 0.125
1.96ProSer: 1.96 ± 0.252
1.351ProThr: 1.351 ± 0.197
1.007ProVal: 1.007 ± 0.163
0.159ProTrp: 0.159 ± 0.066
1.192ProTyr: 1.192 ± 0.167
0.0ProXaa: 0.0 ± 0.0
Gln
1.113GlnAla: 1.113 ± 0.181
0.265GlnCys: 0.265 ± 0.094
1.695GlnAsp: 1.695 ± 0.166
2.702GlnGlu: 2.702 ± 0.294
1.616GlnPhe: 1.616 ± 0.251
1.272GlnGly: 1.272 ± 0.19
0.477GlnHis: 0.477 ± 0.104
3.179GlnIle: 3.179 ± 0.307
4.053GlnLys: 4.053 ± 0.364
3.152GlnLeu: 3.152 ± 0.338
0.477GlnMet: 0.477 ± 0.117
2.861GlnAsn: 2.861 ± 0.271
0.424GlnPro: 0.424 ± 0.108
1.139GlnGln: 1.139 ± 0.184
1.298GlnArg: 1.298 ± 0.195
1.934GlnSer: 1.934 ± 0.249
1.854GlnThr: 1.854 ± 0.237
1.272GlnVal: 1.272 ± 0.207
0.318GlnTrp: 0.318 ± 0.086
1.378GlnTyr: 1.378 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
1.537ArgAla: 1.537 ± 0.202
0.238ArgCys: 0.238 ± 0.083
1.669ArgAsp: 1.669 ± 0.24
2.596ArgGlu: 2.596 ± 0.247
1.669ArgPhe: 1.669 ± 0.231
1.192ArgGly: 1.192 ± 0.171
0.609ArgHis: 0.609 ± 0.12
2.543ArgIle: 2.543 ± 0.243
2.225ArgLys: 2.225 ± 0.242
3.841ArgLeu: 3.841 ± 0.289
0.609ArgMet: 0.609 ± 0.11
2.146ArgAsn: 2.146 ± 0.225
0.583ArgPro: 0.583 ± 0.133
0.795ArgGln: 0.795 ± 0.148
0.848ArgArg: 0.848 ± 0.166
1.616ArgSer: 1.616 ± 0.231
1.378ArgThr: 1.378 ± 0.196
2.013ArgVal: 2.013 ± 0.234
0.318ArgTrp: 0.318 ± 0.075
1.589ArgTyr: 1.589 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
2.04SerAla: 2.04 ± 0.241
0.556SerCys: 0.556 ± 0.15
3.47SerAsp: 3.47 ± 0.291
3.656SerGlu: 3.656 ± 0.303
4.53SerPhe: 4.53 ± 0.335
2.623SerGly: 2.623 ± 0.28
1.219SerHis: 1.219 ± 0.147
7.153SerIle: 7.153 ± 0.435
7.55SerLys: 7.55 ± 0.547
7.656SerLeu: 7.656 ± 0.517
1.272SerMet: 1.272 ± 0.16
5.669SerAsn: 5.669 ± 0.33
1.431SerPro: 1.431 ± 0.17
2.331SerGln: 2.331 ± 0.208
1.722SerArg: 1.722 ± 0.227
5.696SerSer: 5.696 ± 0.436
3.391SerThr: 3.391 ± 0.293
2.702SerVal: 2.702 ± 0.248
0.503SerTrp: 0.503 ± 0.093
3.285SerTyr: 3.285 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
1.722ThrAla: 1.722 ± 0.237
0.371ThrCys: 0.371 ± 0.109
2.305ThrAsp: 2.305 ± 0.227
2.331ThrGlu: 2.331 ± 0.293
2.172ThrPhe: 2.172 ± 0.239
1.907ThrGly: 1.907 ± 0.229
0.715ThrHis: 0.715 ± 0.139
5.616ThrIle: 5.616 ± 0.389
5.537ThrLys: 5.537 ± 0.353
5.192ThrLeu: 5.192 ± 0.336
0.768ThrMet: 0.768 ± 0.128
3.603ThrAsn: 3.603 ± 0.262
1.51ThrPro: 1.51 ± 0.219
1.934ThrGln: 1.934 ± 0.247
1.351ThrArg: 1.351 ± 0.168
3.391ThrSer: 3.391 ± 0.309
2.861ThrThr: 2.861 ± 0.284
1.987ThrVal: 1.987 ± 0.242
0.265ThrTrp: 0.265 ± 0.083
2.278ThrTyr: 2.278 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
2.252ValAla: 2.252 ± 0.278
0.397ValCys: 0.397 ± 0.111
2.57ValAsp: 2.57 ± 0.242
2.888ValGlu: 2.888 ± 0.247
2.305ValPhe: 2.305 ± 0.213
1.616ValGly: 1.616 ± 0.202
0.901ValHis: 0.901 ± 0.157
4.318ValIle: 4.318 ± 0.266
3.947ValLys: 3.947 ± 0.33
4.61ValLeu: 4.61 ± 0.358
0.848ValMet: 0.848 ± 0.144
3.02ValAsn: 3.02 ± 0.293
1.245ValPro: 1.245 ± 0.197
1.298ValGln: 1.298 ± 0.202
1.325ValArg: 1.325 ± 0.208
3.603ValSer: 3.603 ± 0.375
2.013ValThr: 2.013 ± 0.239
1.96ValVal: 1.96 ± 0.27
0.344ValTrp: 0.344 ± 0.084
1.828ValTyr: 1.828 ± 0.208
0.0ValXaa: 0.0 ± 0.0
Trp
0.291TrpAla: 0.291 ± 0.079
0.079TrpCys: 0.079 ± 0.044
0.424TrpAsp: 0.424 ± 0.101
0.53TrpGlu: 0.53 ± 0.134
0.45TrpPhe: 0.45 ± 0.112
0.291TrpGly: 0.291 ± 0.105
0.185TrpHis: 0.185 ± 0.082
0.821TrpIle: 0.821 ± 0.131
0.636TrpLys: 0.636 ± 0.104
0.636TrpLeu: 0.636 ± 0.117
0.212TrpMet: 0.212 ± 0.074
0.503TrpAsn: 0.503 ± 0.098
0.053TrpPro: 0.053 ± 0.031
0.265TrpGln: 0.265 ± 0.068
0.424TrpArg: 0.424 ± 0.099
0.583TrpSer: 0.583 ± 0.145
0.424TrpThr: 0.424 ± 0.1
0.265TrpVal: 0.265 ± 0.085
0.132TrpTrp: 0.132 ± 0.054
0.424TrpTyr: 0.424 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.722TyrAla: 1.722 ± 0.21
0.503TyrCys: 0.503 ± 0.111
2.755TyrAsp: 2.755 ± 0.322
3.523TyrGlu: 3.523 ± 0.266
3.1TyrPhe: 3.1 ± 0.38
1.828TyrGly: 1.828 ± 0.203
1.007TyrHis: 1.007 ± 0.154
4.768TyrIle: 4.768 ± 0.426
5.033TyrLys: 5.033 ± 0.345
5.431TyrLeu: 5.431 ± 0.373
0.98TyrMet: 0.98 ± 0.155
3.205TyrAsn: 3.205 ± 0.357
1.06TyrPro: 1.06 ± 0.156
1.695TyrGln: 1.695 ± 0.223
1.378TyrArg: 1.378 ± 0.205
3.921TyrSer: 3.921 ± 0.37
2.04TyrThr: 2.04 ± 0.22
1.51TyrVal: 1.51 ± 0.173
0.291TyrTrp: 0.291 ± 0.076
2.57TyrTyr: 2.57 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 124 proteins (37749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski