Amino acid dipepetide frequency for Ipomoea nil (Japanese morning glory) (Pharbitis nil)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.514AlaAla: 4.514 ± 0.605
0.815AlaCys: 0.815 ± 0.182
1.787AlaAsp: 1.787 ± 0.269
2.978AlaGlu: 2.978 ± 0.341
3.479AlaPhe: 3.479 ± 0.413
4.263AlaGly: 4.263 ± 0.467
1.912AlaHis: 1.912 ± 0.349
4.796AlaIle: 4.796 ± 0.436
1.975AlaLys: 1.975 ± 0.256
6.331AlaLeu: 6.331 ± 0.522
1.661AlaMet: 1.661 ± 0.307
1.881AlaAsn: 1.881 ± 0.194
2.946AlaPro: 2.946 ± 0.412
1.975AlaGln: 1.975 ± 0.322
4.137AlaArg: 4.137 ± 0.501
4.137AlaSer: 4.137 ± 0.404
3.009AlaThr: 3.009 ± 0.32
3.667AlaVal: 3.667 ± 0.383
1.066AlaTrp: 1.066 ± 0.232
1.943AlaTyr: 1.943 ± 0.283
0.0AlaXaa: 0.0 ± 0.0
Cys
0.533CysAla: 0.533 ± 0.14
0.125CysCys: 0.125 ± 0.064
0.47CysAsp: 0.47 ± 0.128
0.721CysGlu: 0.721 ± 0.149
1.003CysPhe: 1.003 ± 0.19
0.972CysGly: 0.972 ± 0.201
0.282CysHis: 0.282 ± 0.1
1.661CysIle: 1.661 ± 0.215
0.502CysLys: 0.502 ± 0.124
1.066CysLeu: 1.066 ± 0.184
0.188CysMet: 0.188 ± 0.065
0.502CysAsn: 0.502 ± 0.13
0.47CysPro: 0.47 ± 0.134
0.313CysGln: 0.313 ± 0.098
0.658CysArg: 0.658 ± 0.159
1.034CysSer: 1.034 ± 0.184
0.407CysThr: 0.407 ± 0.099
0.784CysVal: 0.784 ± 0.191
0.094CysTrp: 0.094 ± 0.062
0.627CysTyr: 0.627 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
1.943AspAla: 1.943 ± 0.249
0.219AspCys: 0.219 ± 0.072
1.442AspAsp: 1.442 ± 0.272
1.536AspGlu: 1.536 ± 0.214
1.943AspPhe: 1.943 ± 0.262
2.1AspGly: 2.1 ± 0.278
0.721AspHis: 0.721 ± 0.175
3.009AspIle: 3.009 ± 0.284
1.818AspLys: 1.818 ± 0.333
4.231AspLeu: 4.231 ± 0.357
0.815AspMet: 0.815 ± 0.156
1.536AspAsn: 1.536 ± 0.26
3.04AspPro: 3.04 ± 0.38
2.037AspGln: 2.037 ± 0.177
2.633AspArg: 2.633 ± 0.262
3.04AspSer: 3.04 ± 0.474
2.257AspThr: 2.257 ± 0.281
2.006AspVal: 2.006 ± 0.285
0.47AspTrp: 0.47 ± 0.113
1.003AspTyr: 1.003 ± 0.184
0.0AspXaa: 0.0 ± 0.0
Glu
3.197GluAla: 3.197 ± 0.338
0.439GluCys: 0.439 ± 0.145
1.599GluAsp: 1.599 ± 0.233
3.605GluGlu: 3.605 ± 0.67
2.194GluPhe: 2.194 ± 0.227
2.758GluGly: 2.758 ± 0.348
0.909GluHis: 0.909 ± 0.144
4.733GluIle: 4.733 ± 0.419
4.043GluLys: 4.043 ± 0.621
5.14GluLeu: 5.14 ± 0.424
1.724GluMet: 1.724 ± 0.237
2.1GluAsn: 2.1 ± 0.292
1.442GluPro: 1.442 ± 0.204
2.1GluGln: 2.1 ± 0.327
3.416GluArg: 3.416 ± 0.442
3.291GluSer: 3.291 ± 0.306
3.479GluThr: 3.479 ± 0.84
2.633GluVal: 2.633 ± 0.286
0.596GluTrp: 0.596 ± 0.158
1.724GluTyr: 1.724 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
3.26PheAla: 3.26 ± 0.341
1.034PheCys: 1.034 ± 0.153
2.194PheAsp: 2.194 ± 0.248
2.539PheGlu: 2.539 ± 0.302
5.266PhePhe: 5.266 ± 0.548
3.887PheGly: 3.887 ± 0.524
2.194PheHis: 2.194 ± 0.298
4.231PheIle: 4.231 ± 0.396
1.254PheLys: 1.254 ± 0.213
9.341PheLeu: 9.341 ± 0.953
1.254PheMet: 1.254 ± 0.211
1.755PheAsn: 1.755 ± 0.252
3.072PhePro: 3.072 ± 0.336
1.943PheGln: 1.943 ± 0.206
3.354PheArg: 3.354 ± 0.263
4.482PheSer: 4.482 ± 0.507
2.633PheThr: 2.633 ± 0.311
3.385PheVal: 3.385 ± 0.44
0.94PheTrp: 0.94 ± 0.178
2.319PheTyr: 2.319 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
4.294GlyAla: 4.294 ± 0.47
0.69GlyCys: 0.69 ± 0.144
2.257GlyAsp: 2.257 ± 0.251
2.946GlyGlu: 2.946 ± 0.361
4.106GlyPhe: 4.106 ± 0.362
4.514GlyGly: 4.514 ± 0.555
1.63GlyHis: 1.63 ± 0.315
6.394GlyIle: 6.394 ± 0.584
3.636GlyLys: 3.636 ± 0.355
6.927GlyLeu: 6.927 ± 0.617
1.818GlyMet: 1.818 ± 0.231
2.602GlyAsn: 2.602 ± 0.245
2.413GlyPro: 2.413 ± 0.308
2.445GlyGln: 2.445 ± 0.335
4.702GlyArg: 4.702 ± 0.684
5.485GlySer: 5.485 ± 0.452
3.918GlyThr: 3.918 ± 0.344
4.514GlyVal: 4.514 ± 0.403
1.254GlyTrp: 1.254 ± 0.25
2.037GlyTyr: 2.037 ± 0.268
0.0GlyXaa: 0.0 ± 0.0
His
1.222HisAla: 1.222 ± 0.256
0.063HisCys: 0.063 ± 0.046
0.658HisAsp: 0.658 ± 0.138
0.69HisGlu: 0.69 ± 0.176
2.163HisPhe: 2.163 ± 0.259
1.661HisGly: 1.661 ± 0.21
1.003HisHis: 1.003 ± 0.229
1.881HisIle: 1.881 ± 0.241
1.003HisLys: 1.003 ± 0.2
2.696HisLeu: 2.696 ± 0.421
0.784HisMet: 0.784 ± 0.168
1.16HisAsn: 1.16 ± 0.23
0.878HisPro: 0.878 ± 0.196
0.752HisGln: 0.752 ± 0.146
1.254HisArg: 1.254 ± 0.228
1.975HisSer: 1.975 ± 0.311
1.16HisThr: 1.16 ± 0.151
1.285HisVal: 1.285 ± 0.213
0.251HisTrp: 0.251 ± 0.078
1.034HisTyr: 1.034 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
5.015IleAla: 5.015 ± 0.482
1.505IleCys: 1.505 ± 0.224
3.511IleAsp: 3.511 ± 0.349
4.231IleGlu: 4.231 ± 0.427
5.172IlePhe: 5.172 ± 0.432
6.112IleGly: 6.112 ± 0.406
2.602IleHis: 2.602 ± 0.282
4.952IleIle: 4.952 ± 0.414
2.727IleLys: 2.727 ± 0.312
9.309IleLeu: 9.309 ± 0.697
0.878IleMet: 0.878 ± 0.208
2.508IleAsn: 2.508 ± 0.364
4.858IlePro: 4.858 ± 0.316
3.103IleGln: 3.103 ± 0.324
4.89IleArg: 4.89 ± 0.533
7.084IleSer: 7.084 ± 0.562
4.608IleThr: 4.608 ± 0.443
4.451IleVal: 4.451 ± 0.351
1.316IleTrp: 1.316 ± 0.194
3.322IleTyr: 3.322 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
2.131LysAla: 2.131 ± 0.307
0.564LysCys: 0.564 ± 0.117
2.351LysAsp: 2.351 ± 0.286
3.981LysGlu: 3.981 ± 0.673
1.787LysPhe: 1.787 ± 0.253
3.072LysGly: 3.072 ± 0.317
0.94LysHis: 0.94 ± 0.172
4.294LysIle: 4.294 ± 0.396
6.206LysLys: 6.206 ± 1.031
3.605LysLeu: 3.605 ± 0.337
1.191LysMet: 1.191 ± 0.181
3.228LysAsn: 3.228 ± 0.451
1.912LysPro: 1.912 ± 0.272
1.787LysGln: 1.787 ± 0.276
4.075LysArg: 4.075 ± 0.471
3.73LysSer: 3.73 ± 0.475
2.257LysThr: 2.257 ± 0.488
2.131LysVal: 2.131 ± 0.233
0.533LysTrp: 0.533 ± 0.173
1.912LysTyr: 1.912 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
7.272LeuAla: 7.272 ± 0.66
1.505LeuCys: 1.505 ± 0.307
3.479LeuAsp: 3.479 ± 0.412
4.733LeuGlu: 4.733 ± 0.496
7.961LeuPhe: 7.961 ± 0.79
8.118LeuGly: 8.118 ± 0.69
2.1LeuHis: 2.1 ± 0.262
8.181LeuIle: 8.181 ± 0.524
3.824LeuLys: 3.824 ± 0.363
11.597LeuLeu: 11.597 ± 0.716
2.476LeuMet: 2.476 ± 0.259
4.357LeuAsn: 4.357 ± 0.432
4.514LeuPro: 4.514 ± 0.413
2.852LeuGln: 2.852 ± 0.287
5.799LeuArg: 5.799 ± 0.445
8.776LeuSer: 8.776 ± 0.742
4.733LeuThr: 4.733 ± 0.398
7.084LeuVal: 7.084 ± 0.619
1.41LeuTrp: 1.41 ± 0.237
3.636LeuTyr: 3.636 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
1.63MetAla: 1.63 ± 0.247
0.157MetCys: 0.157 ± 0.067
0.878MetAsp: 0.878 ± 0.146
1.191MetGlu: 1.191 ± 0.199
1.191MetPhe: 1.191 ± 0.191
1.849MetGly: 1.849 ± 0.263
0.752MetHis: 0.752 ± 0.207
2.194MetIle: 2.194 ± 0.342
1.191MetLys: 1.191 ± 0.176
2.037MetLeu: 2.037 ± 0.271
0.721MetMet: 0.721 ± 0.143
1.316MetAsn: 1.316 ± 0.204
1.254MetPro: 1.254 ± 0.185
0.69MetGln: 0.69 ± 0.161
0.94MetArg: 0.94 ± 0.175
2.1MetSer: 2.1 ± 0.254
1.348MetThr: 1.348 ± 0.192
1.128MetVal: 1.128 ± 0.181
0.407MetTrp: 0.407 ± 0.102
0.784MetTyr: 0.784 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
1.975AsnAla: 1.975 ± 0.274
0.533AsnCys: 0.533 ± 0.141
1.881AsnAsp: 1.881 ± 0.302
2.413AsnGlu: 2.413 ± 0.371
2.225AsnPhe: 2.225 ± 0.24
2.319AsnGly: 2.319 ± 0.282
1.16AsnHis: 1.16 ± 0.166
2.727AsnIle: 2.727 ± 0.355
2.476AsnLys: 2.476 ± 0.316
4.294AsnLeu: 4.294 ± 0.402
0.909AsnMet: 0.909 ± 0.197
1.63AsnAsn: 1.63 ± 0.259
2.696AsnPro: 2.696 ± 0.305
2.037AsnGln: 2.037 ± 0.385
3.354AsnArg: 3.354 ± 0.284
3.824AsnSer: 3.824 ± 0.406
2.037AsnThr: 2.037 ± 0.261
2.069AsnVal: 2.069 ± 0.299
0.752AsnTrp: 0.752 ± 0.186
1.316AsnTyr: 1.316 ± 0.206
0.0AsnXaa: 0.0 ± 0.0
Pro
2.194ProAla: 2.194 ± 0.278
0.376ProCys: 0.376 ± 0.112
1.285ProAsp: 1.285 ± 0.202
2.633ProGlu: 2.633 ± 0.28
3.04ProPhe: 3.04 ± 0.306
3.072ProGly: 3.072 ± 0.277
0.94ProHis: 0.94 ± 0.167
4.67ProIle: 4.67 ± 0.363
2.163ProLys: 2.163 ± 0.258
4.576ProLeu: 4.576 ± 0.394
1.097ProMet: 1.097 ± 0.187
2.131ProAsn: 2.131 ± 0.3
1.755ProPro: 1.755 ± 0.26
1.442ProGln: 1.442 ± 0.201
2.288ProArg: 2.288 ± 0.281
2.915ProSer: 2.915 ± 0.295
2.57ProThr: 2.57 ± 0.353
2.852ProVal: 2.852 ± 0.388
1.066ProTrp: 1.066 ± 0.148
1.473ProTyr: 1.473 ± 0.183
0.0ProXaa: 0.0 ± 0.0
Gln
2.1GlnAla: 2.1 ± 0.336
0.282GlnCys: 0.282 ± 0.108
1.661GlnAsp: 1.661 ± 0.232
2.225GlnGlu: 2.225 ± 0.246
1.599GlnPhe: 1.599 ± 0.222
2.1GlnGly: 2.1 ± 0.245
0.47GlnHis: 0.47 ± 0.102
3.354GlnIle: 3.354 ± 0.326
2.413GlnLys: 2.413 ± 0.343
3.103GlnLeu: 3.103 ± 0.287
0.909GlnMet: 0.909 ± 0.151
1.505GlnAsn: 1.505 ± 0.243
1.41GlnPro: 1.41 ± 0.197
1.473GlnGln: 1.473 ± 0.194
2.163GlnArg: 2.163 ± 0.27
2.351GlnSer: 2.351 ± 0.246
2.131GlnThr: 2.131 ± 0.307
1.943GlnVal: 1.943 ± 0.308
0.533GlnTrp: 0.533 ± 0.147
1.128GlnTyr: 1.128 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
3.887ArgAla: 3.887 ± 0.436
0.815ArgCys: 0.815 ± 0.184
2.57ArgAsp: 2.57 ± 0.306
3.385ArgGlu: 3.385 ± 0.375
2.758ArgPhe: 2.758 ± 0.311
4.325ArgGly: 4.325 ± 0.474
0.815ArgHis: 0.815 ± 0.183
6.081ArgIle: 6.081 ± 0.507
4.294ArgLys: 4.294 ± 0.416
4.921ArgLeu: 4.921 ± 0.399
1.599ArgMet: 1.599 ± 0.218
3.385ArgAsn: 3.385 ± 0.382
1.881ArgPro: 1.881 ± 0.209
2.194ArgGln: 2.194 ± 0.291
4.639ArgArg: 4.639 ± 0.479
5.642ArgSer: 5.642 ± 0.654
3.166ArgThr: 3.166 ± 0.314
3.385ArgVal: 3.385 ± 0.359
1.003ArgTrp: 1.003 ± 0.156
1.661ArgTyr: 1.661 ± 0.205
0.0ArgXaa: 0.0 ± 0.0
Ser
4.075SerAla: 4.075 ± 0.356
0.909SerCys: 0.909 ± 0.225
3.072SerAsp: 3.072 ± 0.394
3.918SerGlu: 3.918 ± 0.535
5.109SerPhe: 5.109 ± 0.503
5.14SerGly: 5.14 ± 0.473
1.63SerHis: 1.63 ± 0.214
7.209SerIle: 7.209 ± 0.627
4.043SerLys: 4.043 ± 0.577
8.588SerLeu: 8.588 ± 0.466
2.037SerMet: 2.037 ± 0.316
3.949SerAsn: 3.949 ± 0.58
3.354SerPro: 3.354 ± 0.303
2.57SerGln: 2.57 ± 0.332
4.545SerArg: 4.545 ± 0.317
7.24SerSer: 7.24 ± 0.614
4.514SerThr: 4.514 ± 0.36
4.733SerVal: 4.733 ± 0.348
1.63SerTrp: 1.63 ± 0.227
2.884SerTyr: 2.884 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
2.978ThrAla: 2.978 ± 0.33
0.815ThrCys: 0.815 ± 0.149
2.037ThrAsp: 2.037 ± 0.451
2.037ThrGlu: 2.037 ± 0.235
2.946ThrPhe: 2.946 ± 0.353
3.887ThrGly: 3.887 ± 0.343
1.128ThrHis: 1.128 ± 0.192
4.2ThrIle: 4.2 ± 0.315
3.04ThrLys: 3.04 ± 0.532
4.984ThrLeu: 4.984 ± 0.385
1.191ThrMet: 1.191 ± 0.204
2.852ThrAsn: 2.852 ± 0.431
2.194ThrPro: 2.194 ± 0.273
1.724ThrGln: 1.724 ± 0.215
3.166ThrArg: 3.166 ± 0.346
4.984ThrSer: 4.984 ± 0.381
2.758ThrThr: 2.758 ± 0.315
2.884ThrVal: 2.884 ± 0.357
0.658ThrTrp: 0.658 ± 0.152
1.787ThrTyr: 1.787 ± 0.198
0.0ThrXaa: 0.0 ± 0.0
Val
4.169ValAla: 4.169 ± 0.458
0.784ValCys: 0.784 ± 0.16
2.413ValAsp: 2.413 ± 0.283
2.57ValGlu: 2.57 ± 0.259
3.448ValPhe: 3.448 ± 0.296
4.42ValGly: 4.42 ± 0.488
1.316ValHis: 1.316 ± 0.247
3.636ValIle: 3.636 ± 0.39
2.664ValLys: 2.664 ± 0.331
5.799ValLeu: 5.799 ± 0.431
1.316ValMet: 1.316 ± 0.239
2.1ValAsn: 2.1 ± 0.248
2.696ValPro: 2.696 ± 0.372
1.693ValGln: 1.693 ± 0.231
3.416ValArg: 3.416 ± 0.433
4.545ValSer: 4.545 ± 0.384
2.978ValThr: 2.978 ± 0.355
3.605ValVal: 3.605 ± 0.4
1.222ValTrp: 1.222 ± 0.26
1.943ValTyr: 1.943 ± 0.249
0.0ValXaa: 0.0 ± 0.0
Trp
1.316TrpAla: 1.316 ± 0.276
0.188TrpCys: 0.188 ± 0.077
0.721TrpAsp: 0.721 ± 0.146
0.752TrpGlu: 0.752 ± 0.183
0.94TrpPhe: 0.94 ± 0.17
1.097TrpGly: 1.097 ± 0.167
0.533TrpHis: 0.533 ± 0.144
1.599TrpIle: 1.599 ± 0.252
0.846TrpLys: 0.846 ± 0.152
1.912TrpLeu: 1.912 ± 0.282
0.376TrpMet: 0.376 ± 0.116
0.721TrpAsn: 0.721 ± 0.149
0.376TrpPro: 0.376 ± 0.125
0.439TrpGln: 0.439 ± 0.122
0.627TrpArg: 0.627 ± 0.108
1.505TrpSer: 1.505 ± 0.194
0.658TrpThr: 0.658 ± 0.157
0.784TrpVal: 0.784 ± 0.165
0.313TrpTrp: 0.313 ± 0.125
0.47TrpTyr: 0.47 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.567TyrAla: 1.567 ± 0.167
0.596TyrCys: 0.596 ± 0.11
1.536TyrAsp: 1.536 ± 0.209
2.037TyrGlu: 2.037 ± 0.227
2.006TyrPhe: 2.006 ± 0.269
2.915TyrGly: 2.915 ± 0.33
0.533TyrHis: 0.533 ± 0.114
2.351TyrIle: 2.351 ± 0.256
1.567TyrLys: 1.567 ± 0.243
3.949TyrLeu: 3.949 ± 0.423
0.784TyrMet: 0.784 ± 0.206
1.41TyrAsn: 1.41 ± 0.171
1.379TyrPro: 1.379 ± 0.226
1.285TyrGln: 1.285 ± 0.177
2.351TyrArg: 2.351 ± 0.252
3.04TyrSer: 3.04 ± 0.357
1.63TyrThr: 1.63 ± 0.223
1.41TyrVal: 1.41 ± 0.2
0.658TyrTrp: 0.658 ± 0.115
1.16TyrTyr: 1.16 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 109 proteins (31905 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski