Amino acid dipepetide frequency for Dialister sp. CAG:357

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.958AlaAla: 9.958 ± 0.21
1.075AlaCys: 1.075 ± 0.052
5.105AlaAsp: 5.105 ± 0.103
6.158AlaGlu: 6.158 ± 0.11
3.612AlaPhe: 3.612 ± 0.096
7.627AlaGly: 7.627 ± 0.134
1.463AlaHis: 1.463 ± 0.049
5.388AlaIle: 5.388 ± 0.122
5.096AlaLys: 5.096 ± 0.11
8.27AlaLeu: 8.27 ± 0.136
2.594AlaMet: 2.594 ± 0.086
2.393AlaAsn: 2.393 ± 0.071
2.974AlaPro: 2.974 ± 0.087
2.235AlaGln: 2.235 ± 0.075
3.932AlaArg: 3.932 ± 0.093
4.967AlaSer: 4.967 ± 0.111
2.989AlaThr: 2.989 ± 0.082
7.04AlaVal: 7.04 ± 0.133
0.675AlaTrp: 0.675 ± 0.031
2.706AlaTyr: 2.706 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.991CysAla: 0.991 ± 0.043
0.215CysCys: 0.215 ± 0.021
0.629CysAsp: 0.629 ± 0.034
0.665CysGlu: 0.665 ± 0.034
0.557CysPhe: 0.557 ± 0.035
1.162CysGly: 1.162 ± 0.056
0.366CysHis: 0.366 ± 0.031
0.912CysIle: 0.912 ± 0.045
0.504CysLys: 0.504 ± 0.03
1.132CysLeu: 1.132 ± 0.046
0.287CysMet: 0.287 ± 0.022
0.338CysAsn: 0.338 ± 0.024
0.585CysPro: 0.585 ± 0.039
0.331CysGln: 0.331 ± 0.027
0.669CysArg: 0.669 ± 0.039
0.623CysSer: 0.623 ± 0.037
0.647CysThr: 0.647 ± 0.033
0.768CysVal: 0.768 ± 0.036
0.107CysTrp: 0.107 ± 0.014
0.406CysTyr: 0.406 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
4.678AspAla: 4.678 ± 0.095
0.667AspCys: 0.667 ± 0.035
3.267AspAsp: 3.267 ± 0.089
4.338AspGlu: 4.338 ± 0.09
2.761AspPhe: 2.761 ± 0.077
4.493AspGly: 4.493 ± 0.101
1.382AspHis: 1.382 ± 0.05
4.449AspIle: 4.449 ± 0.087
3.517AspLys: 3.517 ± 0.088
5.255AspLeu: 5.255 ± 0.118
1.98AspMet: 1.98 ± 0.064
1.95AspAsn: 1.95 ± 0.066
2.601AspPro: 2.601 ± 0.075
1.542AspGln: 1.542 ± 0.053
2.79AspArg: 2.79 ± 0.087
3.006AspSer: 3.006 ± 0.078
3.132AspThr: 3.132 ± 0.083
4.158AspVal: 4.158 ± 0.092
0.592AspTrp: 0.592 ± 0.032
2.324AspTyr: 2.324 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
6.173GluAla: 6.173 ± 0.129
0.599GluCys: 0.599 ± 0.035
4.182GluAsp: 4.182 ± 0.098
6.546GluGlu: 6.546 ± 0.159
2.193GluPhe: 2.193 ± 0.057
5.189GluGly: 5.189 ± 0.112
1.283GluHis: 1.283 ± 0.054
5.0GluIle: 5.0 ± 0.116
6.331GluLys: 6.331 ± 0.131
5.522GluLeu: 5.522 ± 0.121
2.421GluMet: 2.421 ± 0.066
3.199GluAsn: 3.199 ± 0.09
1.818GluPro: 1.818 ± 0.062
1.811GluGln: 1.811 ± 0.056
3.392GluArg: 3.392 ± 0.092
3.344GluSer: 3.344 ± 0.086
3.715GluThr: 3.715 ± 0.088
4.176GluVal: 4.176 ± 0.092
0.625GluTrp: 0.625 ± 0.034
2.112GluTyr: 2.112 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.248PheAla: 3.248 ± 0.087
0.601PheCys: 0.601 ± 0.033
2.399PheAsp: 2.399 ± 0.07
2.118PheGlu: 2.118 ± 0.061
2.151PhePhe: 2.151 ± 0.075
3.246PheGly: 3.246 ± 0.078
1.094PheHis: 1.094 ± 0.047
2.945PheIle: 2.945 ± 0.087
2.04PheLys: 2.04 ± 0.06
4.399PheLeu: 4.399 ± 0.114
1.351PheMet: 1.351 ± 0.051
1.522PheAsn: 1.522 ± 0.057
1.737PhePro: 1.737 ± 0.059
1.07PheGln: 1.07 ± 0.045
2.013PheArg: 2.013 ± 0.064
2.803PheSer: 2.803 ± 0.079
2.329PheThr: 2.329 ± 0.076
2.583PheVal: 2.583 ± 0.071
0.463PheTrp: 0.463 ± 0.033
1.485PheTyr: 1.485 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
6.075GlyAla: 6.075 ± 0.131
1.121GlyCys: 1.121 ± 0.049
4.14GlyAsp: 4.14 ± 0.088
4.733GlyGlu: 4.733 ± 0.103
3.246GlyPhe: 3.246 ± 0.088
5.667GlyGly: 5.667 ± 0.166
1.77GlyHis: 1.77 ± 0.061
6.447GlyIle: 6.447 ± 0.13
5.691GlyLys: 5.691 ± 0.109
6.61GlyLeu: 6.61 ± 0.125
2.621GlyMet: 2.621 ± 0.069
3.167GlyAsn: 3.167 ± 0.107
1.873GlyPro: 1.873 ± 0.063
1.91GlyGln: 1.91 ± 0.057
3.919GlyArg: 3.919 ± 0.09
4.596GlySer: 4.596 ± 0.128
4.75GlyThr: 4.75 ± 0.12
5.235GlyVal: 5.235 ± 0.123
0.814GlyTrp: 0.814 ± 0.042
2.932GlyTyr: 2.932 ± 0.081
0.0GlyXaa: 0.0 ± 0.0
His
1.594HisAla: 1.594 ± 0.062
0.318HisCys: 0.318 ± 0.032
1.219HisAsp: 1.219 ± 0.051
1.301HisGlu: 1.301 ± 0.052
1.035HisPhe: 1.035 ± 0.045
1.612HisGly: 1.612 ± 0.058
0.662HisHis: 0.662 ± 0.076
1.524HisIle: 1.524 ± 0.063
1.081HisLys: 1.081 ± 0.042
2.057HisLeu: 2.057 ± 0.07
0.682HisMet: 0.682 ± 0.035
0.792HisAsn: 0.792 ± 0.033
1.221HisPro: 1.221 ± 0.048
0.642HisGln: 0.642 ± 0.033
0.934HisArg: 0.934 ± 0.044
1.055HisSer: 1.055 ± 0.044
1.101HisThr: 1.101 ± 0.04
1.616HisVal: 1.616 ± 0.06
0.243HisTrp: 0.243 ± 0.02
0.765HisTyr: 0.765 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
5.994IleAla: 5.994 ± 0.113
1.064IleCys: 1.064 ± 0.044
4.193IleAsp: 4.193 ± 0.095
4.112IleGlu: 4.112 ± 0.084
2.893IlePhe: 2.893 ± 0.095
5.437IleGly: 5.437 ± 0.125
1.812IleHis: 1.812 ± 0.064
4.722IleIle: 4.722 ± 0.097
3.327IleLys: 3.327 ± 0.079
7.064IleLeu: 7.064 ± 0.128
1.842IleMet: 1.842 ± 0.058
2.404IleAsn: 2.404 ± 0.07
3.528IlePro: 3.528 ± 0.079
1.943IleGln: 1.943 ± 0.06
3.858IleArg: 3.858 ± 0.087
4.728IleSer: 4.728 ± 0.098
3.824IleThr: 3.824 ± 0.081
4.8IleVal: 4.8 ± 0.091
0.618IleTrp: 0.618 ± 0.039
2.237IleTyr: 2.237 ± 0.076
0.0IleXaa: 0.0 ± 0.0
Lys
5.914LysAla: 5.914 ± 0.127
0.487LysCys: 0.487 ± 0.032
4.597LysAsp: 4.597 ± 0.091
6.324LysGlu: 6.324 ± 0.136
1.726LysPhe: 1.726 ± 0.062
4.772LysGly: 4.772 ± 0.105
0.967LysHis: 0.967 ± 0.042
4.081LysIle: 4.081 ± 0.092
5.678LysLys: 5.678 ± 0.124
4.982LysLeu: 4.982 ± 0.101
2.226LysMet: 2.226 ± 0.068
3.015LysAsn: 3.015 ± 0.083
2.162LysPro: 2.162 ± 0.076
1.689LysGln: 1.689 ± 0.06
3.057LysArg: 3.057 ± 0.085
3.272LysSer: 3.272 ± 0.08
3.447LysThr: 3.447 ± 0.088
4.116LysVal: 4.116 ± 0.093
0.596LysTrp: 0.596 ± 0.038
2.154LysTyr: 2.154 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
8.437LeuAla: 8.437 ± 0.145
1.175LeuCys: 1.175 ± 0.055
5.119LeuAsp: 5.119 ± 0.097
5.423LeuGlu: 5.423 ± 0.107
4.088LeuPhe: 4.088 ± 0.1
6.608LeuGly: 6.608 ± 0.138
2.035LeuHis: 2.035 ± 0.06
5.888LeuIle: 5.888 ± 0.119
5.801LeuLys: 5.801 ± 0.12
8.463LeuLeu: 8.463 ± 0.189
2.686LeuMet: 2.686 ± 0.071
3.055LeuAsn: 3.055 ± 0.08
4.169LeuPro: 4.169 ± 0.104
2.415LeuGln: 2.415 ± 0.072
4.064LeuArg: 4.064 ± 0.096
6.509LeuSer: 6.509 ± 0.118
5.325LeuThr: 5.325 ± 0.115
5.336LeuVal: 5.336 ± 0.111
0.857LeuTrp: 0.857 ± 0.042
3.125LeuTyr: 3.125 ± 0.084
0.0LeuXaa: 0.0 ± 0.0
Met
3.105MetAla: 3.105 ± 0.082
0.268MetCys: 0.268 ± 0.021
2.029MetAsp: 2.029 ± 0.065
2.327MetGlu: 2.327 ± 0.069
0.851MetPhe: 0.851 ± 0.042
2.368MetGly: 2.368 ± 0.075
0.54MetHis: 0.54 ± 0.03
1.998MetIle: 1.998 ± 0.061
2.772MetLys: 2.772 ± 0.074
2.454MetLeu: 2.454 ± 0.073
1.006MetMet: 1.006 ± 0.049
1.498MetAsn: 1.498 ± 0.054
1.182MetPro: 1.182 ± 0.047
0.932MetGln: 0.932 ± 0.035
1.443MetArg: 1.443 ± 0.053
1.772MetSer: 1.772 ± 0.064
1.943MetThr: 1.943 ± 0.053
1.789MetVal: 1.789 ± 0.063
0.195MetTrp: 0.195 ± 0.018
0.868MetTyr: 0.868 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
2.947AsnAla: 2.947 ± 0.072
0.403AsnCys: 0.403 ± 0.034
2.116AsnAsp: 2.116 ± 0.063
2.395AsnGlu: 2.395 ± 0.068
1.382AsnPhe: 1.382 ± 0.05
3.07AsnGly: 3.07 ± 0.084
0.917AsnHis: 0.917 ± 0.043
2.737AsnIle: 2.737 ± 0.077
2.239AsnLys: 2.239 ± 0.066
3.263AsnLeu: 3.263 ± 0.077
1.184AsnMet: 1.184 ± 0.051
1.292AsnAsn: 1.292 ± 0.056
2.066AsnPro: 2.066 ± 0.066
1.206AsnGln: 1.206 ± 0.053
1.642AsnArg: 1.642 ± 0.056
1.818AsnSer: 1.818 ± 0.071
2.013AsnThr: 2.013 ± 0.1
2.629AsnVal: 2.629 ± 0.074
0.406AsnTrp: 0.406 ± 0.028
1.3AsnTyr: 1.3 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
3.368ProAla: 3.368 ± 0.099
0.43ProCys: 0.43 ± 0.026
2.805ProAsp: 2.805 ± 0.074
3.743ProGlu: 3.743 ± 0.1
1.926ProPhe: 1.926 ± 0.054
2.949ProGly: 2.949 ± 0.07
0.789ProHis: 0.789 ± 0.04
2.311ProIle: 2.311 ± 0.055
2.119ProLys: 2.119 ± 0.075
3.279ProLeu: 3.279 ± 0.076
1.022ProMet: 1.022 ± 0.046
1.195ProAsn: 1.195 ± 0.048
1.094ProPro: 1.094 ± 0.047
1.118ProGln: 1.118 ± 0.045
1.316ProArg: 1.316 ± 0.05
2.408ProSer: 2.408 ± 0.073
1.601ProThr: 1.601 ± 0.06
3.693ProVal: 3.693 ± 0.084
0.425ProTrp: 0.425 ± 0.031
1.483ProTyr: 1.483 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
2.46GlnAla: 2.46 ± 0.061
0.261GlnCys: 0.261 ± 0.025
1.518GlnAsp: 1.518 ± 0.058
2.035GlnGlu: 2.035 ± 0.069
1.033GlnPhe: 1.033 ± 0.042
1.825GlnGly: 1.825 ± 0.059
0.533GlnHis: 0.533 ± 0.031
2.04GlnIle: 2.04 ± 0.057
2.279GlnLys: 2.279 ± 0.068
2.221GlnLeu: 2.221 ± 0.064
1.011GlnMet: 1.011 ± 0.038
1.116GlnAsn: 1.116 ± 0.047
0.976GlnPro: 0.976 ± 0.042
0.993GlnGln: 0.993 ± 0.053
1.318GlnArg: 1.318 ± 0.052
1.456GlnSer: 1.456 ± 0.051
1.362GlnThr: 1.362 ± 0.046
1.77GlnVal: 1.77 ± 0.062
0.272GlnTrp: 0.272 ± 0.023
1.015GlnTyr: 1.015 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.542ArgAla: 3.542 ± 0.089
0.493ArgCys: 0.493 ± 0.03
2.653ArgAsp: 2.653 ± 0.061
3.776ArgGlu: 3.776 ± 0.091
2.064ArgPhe: 2.064 ± 0.065
2.937ArgGly: 2.937 ± 0.08
1.007ArgHis: 1.007 ± 0.047
3.561ArgIle: 3.561 ± 0.074
3.647ArgLys: 3.647 ± 0.085
4.248ArgLeu: 4.248 ± 0.078
1.654ArgMet: 1.654 ± 0.055
2.059ArgAsn: 2.059 ± 0.069
1.798ArgPro: 1.798 ± 0.071
1.684ArgGln: 1.684 ± 0.06
2.678ArgArg: 2.678 ± 0.075
2.54ArgSer: 2.54 ± 0.068
2.377ArgThr: 2.377 ± 0.068
2.904ArgVal: 2.904 ± 0.075
0.513ArgTrp: 0.513 ± 0.031
1.687ArgTyr: 1.687 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
4.923SerAla: 4.923 ± 0.115
0.717SerCys: 0.717 ± 0.042
3.384SerAsp: 3.384 ± 0.085
3.548SerGlu: 3.548 ± 0.082
2.879SerPhe: 2.879 ± 0.072
5.421SerGly: 5.421 ± 0.152
1.3SerHis: 1.3 ± 0.054
3.923SerIle: 3.923 ± 0.089
2.949SerLys: 2.949 ± 0.075
5.737SerLeu: 5.737 ± 0.106
1.75SerMet: 1.75 ± 0.062
1.735SerAsn: 1.735 ± 0.061
2.083SerPro: 2.083 ± 0.067
1.638SerGln: 1.638 ± 0.059
2.82SerArg: 2.82 ± 0.067
3.608SerSer: 3.608 ± 0.107
2.785SerThr: 2.785 ± 0.08
4.221SerVal: 4.221 ± 0.088
0.66SerTrp: 0.66 ± 0.037
2.213SerTyr: 2.213 ± 0.065
0.0SerXaa: 0.0 ± 0.0
Thr
4.82ThrAla: 4.82 ± 0.111
0.542ThrCys: 0.542 ± 0.032
2.982ThrAsp: 2.982 ± 0.078
3.314ThrGlu: 3.314 ± 0.082
2.044ThrPhe: 2.044 ± 0.066
4.892ThrGly: 4.892 ± 0.109
0.98ThrHis: 0.98 ± 0.041
4.132ThrIle: 4.132 ± 0.086
2.949ThrLys: 2.949 ± 0.072
4.871ThrLeu: 4.871 ± 0.095
1.562ThrMet: 1.562 ± 0.053
1.715ThrAsn: 1.715 ± 0.06
2.443ThrPro: 2.443 ± 0.072
1.182ThrGln: 1.182 ± 0.051
2.267ThrArg: 2.267 ± 0.066
2.746ThrSer: 2.746 ± 0.073
2.559ThrThr: 2.559 ± 0.079
4.233ThrVal: 4.233 ± 0.105
0.502ThrTrp: 0.502 ± 0.032
1.875ThrTyr: 1.875 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
4.785ValAla: 4.785 ± 0.11
0.884ValCys: 0.884 ± 0.043
3.653ValAsp: 3.653 ± 0.078
4.033ValGlu: 4.033 ± 0.103
3.037ValPhe: 3.037 ± 0.074
4.364ValGly: 4.364 ± 0.106
1.463ValHis: 1.463 ± 0.06
5.292ValIle: 5.292 ± 0.104
4.511ValLys: 4.511 ± 0.103
6.682ValLeu: 6.682 ± 0.115
2.055ValMet: 2.055 ± 0.07
2.715ValAsn: 2.715 ± 0.07
3.085ValPro: 3.085 ± 0.075
1.768ValGln: 1.768 ± 0.054
3.426ValArg: 3.426 ± 0.081
4.776ValSer: 4.776 ± 0.112
4.289ValThr: 4.289 ± 0.106
4.717ValVal: 4.717 ± 0.127
0.656ValTrp: 0.656 ± 0.033
2.403ValTyr: 2.403 ± 0.073
0.0ValXaa: 0.0 ± 0.0
Trp
0.647TrpAla: 0.647 ± 0.036
0.123TrpCys: 0.123 ± 0.015
0.579TrpAsp: 0.579 ± 0.036
0.588TrpGlu: 0.588 ± 0.034
0.414TrpPhe: 0.414 ± 0.026
0.66TrpGly: 0.66 ± 0.046
0.259TrpHis: 0.259 ± 0.025
0.752TrpIle: 0.752 ± 0.039
0.803TrpLys: 0.803 ± 0.041
0.881TrpLeu: 0.881 ± 0.039
0.386TrpMet: 0.386 ± 0.03
0.55TrpAsn: 0.55 ± 0.033
0.274TrpPro: 0.274 ± 0.023
0.366TrpGln: 0.366 ± 0.026
0.434TrpArg: 0.434 ± 0.027
0.485TrpSer: 0.485 ± 0.031
0.469TrpThr: 0.469 ± 0.027
0.517TrpVal: 0.517 ± 0.032
0.112TrpTrp: 0.112 ± 0.015
0.375TrpTyr: 0.375 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.599TyrAla: 2.599 ± 0.072
0.449TyrCys: 0.449 ± 0.027
2.314TyrAsp: 2.314 ± 0.068
2.153TyrGlu: 2.153 ± 0.074
1.704TyrPhe: 1.704 ± 0.067
3.02TyrGly: 3.02 ± 0.091
0.829TyrHis: 0.829 ± 0.042
2.303TyrIle: 2.303 ± 0.065
1.926TyrLys: 1.926 ± 0.063
3.189TyrLeu: 3.189 ± 0.078
1.018TyrMet: 1.018 ± 0.04
1.312TyrAsn: 1.312 ± 0.045
1.406TyrPro: 1.406 ± 0.047
1.02TyrGln: 1.02 ± 0.05
1.851TyrArg: 1.851 ± 0.051
1.779TyrSer: 1.779 ± 0.064
1.954TyrThr: 1.954 ± 0.059
2.292TyrVal: 2.292 ± 0.072
0.344TyrTrp: 0.344 ± 0.024
1.336TyrTyr: 1.336 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1809 proteins (544003 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski