Amino acid dipepetide frequency for Porphyromonas cangingivalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.503AlaAla: 4.503 ± 0.122
0.778AlaCys: 0.778 ± 0.037
4.18AlaAsp: 4.18 ± 0.086
5.01AlaGlu: 5.01 ± 0.098
3.263AlaPhe: 3.263 ± 0.077
4.86AlaGly: 4.86 ± 0.098
1.595AlaHis: 1.595 ± 0.046
5.012AlaIle: 5.012 ± 0.098
4.556AlaLys: 4.556 ± 0.096
7.992AlaLeu: 7.992 ± 0.128
2.1AlaMet: 2.1 ± 0.069
2.666AlaAsn: 2.666 ± 0.07
2.806AlaPro: 2.806 ± 0.082
2.762AlaGln: 2.762 ± 0.071
3.457AlaArg: 3.457 ± 0.073
4.304AlaSer: 4.304 ± 0.08
3.972AlaThr: 3.972 ± 0.091
4.737AlaVal: 4.737 ± 0.082
0.69AlaTrp: 0.69 ± 0.033
2.811AlaTyr: 2.811 ± 0.075
0.002AlaXaa: 0.002 ± 0.002
Cys
0.667CysAla: 0.667 ± 0.036
0.117CysCys: 0.117 ± 0.017
0.566CysAsp: 0.566 ± 0.031
0.558CysGlu: 0.558 ± 0.036
0.436CysPhe: 0.436 ± 0.027
0.832CysGly: 0.832 ± 0.039
0.302CysHis: 0.302 ± 0.025
0.724CysIle: 0.724 ± 0.034
0.502CysLys: 0.502 ± 0.031
0.835CysLeu: 0.835 ± 0.038
0.204CysMet: 0.204 ± 0.016
0.393CysAsn: 0.393 ± 0.029
0.498CysPro: 0.498 ± 0.03
0.271CysGln: 0.271 ± 0.019
0.555CysArg: 0.555 ± 0.032
0.625CysSer: 0.625 ± 0.031
0.548CysThr: 0.548 ± 0.029
0.701CysVal: 0.701 ± 0.037
0.1CysTrp: 0.1 ± 0.014
0.349CysTyr: 0.349 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.972AspAla: 3.972 ± 0.082
0.509AspCys: 0.509 ± 0.031
2.757AspAsp: 2.757 ± 0.07
4.144AspGlu: 4.144 ± 0.087
2.945AspPhe: 2.945 ± 0.063
3.848AspGly: 3.848 ± 0.093
1.171AspHis: 1.171 ± 0.049
4.503AspIle: 4.503 ± 0.085
4.113AspLys: 4.113 ± 0.098
5.508AspLeu: 5.508 ± 0.096
1.537AspMet: 1.537 ± 0.049
2.361AspAsn: 2.361 ± 0.068
2.328AspPro: 2.328 ± 0.05
1.313AspGln: 1.313 ± 0.044
3.126AspArg: 3.126 ± 0.071
2.896AspSer: 2.896 ± 0.069
3.085AspThr: 3.085 ± 0.075
3.799AspVal: 3.799 ± 0.083
0.609AspTrp: 0.609 ± 0.034
2.54AspTyr: 2.54 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
5.583GluAla: 5.583 ± 0.111
0.547GluCys: 0.547 ± 0.032
3.76GluAsp: 3.76 ± 0.077
5.379GluGlu: 5.379 ± 0.109
2.215GluPhe: 2.215 ± 0.068
4.651GluGly: 4.651 ± 0.098
1.454GluHis: 1.454 ± 0.052
4.937GluIle: 4.937 ± 0.093
4.568GluLys: 4.568 ± 0.103
6.124GluLeu: 6.124 ± 0.105
2.046GluMet: 2.046 ± 0.061
2.486GluAsn: 2.486 ± 0.082
1.78GluPro: 1.78 ± 0.052
2.403GluGln: 2.403 ± 0.062
3.992GluArg: 3.992 ± 0.091
3.33GluSer: 3.33 ± 0.074
3.565GluThr: 3.565 ± 0.077
5.113GluVal: 5.113 ± 0.099
0.767GluTrp: 0.767 ± 0.037
2.666GluTyr: 2.666 ± 0.066
0.002GluXaa: 0.002 ± 0.002
Phe
3.32PheAla: 3.32 ± 0.073
0.44PheCys: 0.44 ± 0.029
2.853PheAsp: 2.853 ± 0.073
2.744PheGlu: 2.744 ± 0.065
2.104PhePhe: 2.104 ± 0.064
3.436PheGly: 3.436 ± 0.085
0.734PheHis: 0.734 ± 0.031
2.892PheIle: 2.892 ± 0.074
2.338PheLys: 2.338 ± 0.062
3.64PheLeu: 3.64 ± 0.078
1.165PheMet: 1.165 ± 0.046
1.873PheAsn: 1.873 ± 0.058
1.594PhePro: 1.594 ± 0.045
0.972PheGln: 0.972 ± 0.037
2.155PheArg: 2.155 ± 0.06
3.3PheSer: 3.3 ± 0.072
2.478PheThr: 2.478 ± 0.06
3.558PheVal: 3.558 ± 0.068
0.442PheTrp: 0.442 ± 0.028
1.605PheTyr: 1.605 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.979GlyAla: 4.979 ± 0.109
0.759GlyCys: 0.759 ± 0.039
3.664GlyAsp: 3.664 ± 0.075
4.431GlyGlu: 4.431 ± 0.083
3.054GlyPhe: 3.054 ± 0.074
5.069GlyGly: 5.069 ± 0.112
1.573GlyHis: 1.573 ± 0.06
5.242GlyIle: 5.242 ± 0.106
4.873GlyLys: 4.873 ± 0.089
6.648GlyLeu: 6.648 ± 0.109
1.976GlyMet: 1.976 ± 0.063
2.671GlyAsn: 2.671 ± 0.066
1.238GlyPro: 1.238 ± 0.051
2.137GlyGln: 2.137 ± 0.059
3.594GlyArg: 3.594 ± 0.076
4.002GlySer: 4.002 ± 0.093
3.886GlyThr: 3.886 ± 0.081
5.219GlyVal: 5.219 ± 0.105
0.777GlyTrp: 0.777 ± 0.037
3.103GlyTyr: 3.103 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
1.222HisAla: 1.222 ± 0.047
0.261HisCys: 0.261 ± 0.025
1.07HisAsp: 1.07 ± 0.043
1.139HisGlu: 1.139 ± 0.042
1.09HisPhe: 1.09 ± 0.04
1.383HisGly: 1.383 ± 0.047
0.581HisHis: 0.581 ± 0.031
1.695HisIle: 1.695 ± 0.043
1.29HisLys: 1.29 ± 0.044
2.313HisLeu: 2.313 ± 0.07
0.44HisMet: 0.44 ± 0.026
0.949HisAsn: 0.949 ± 0.04
1.263HisPro: 1.263 ± 0.055
0.628HisGln: 0.628 ± 0.037
1.336HisArg: 1.336 ± 0.052
1.375HisSer: 1.375 ± 0.053
1.217HisThr: 1.217 ± 0.042
1.142HisVal: 1.142 ± 0.048
0.197HisTrp: 0.197 ± 0.019
0.878HisTyr: 0.878 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
5.463IleAla: 5.463 ± 0.1
0.667IleCys: 0.667 ± 0.036
4.566IleAsp: 4.566 ± 0.09
4.987IleGlu: 4.987 ± 0.098
2.894IlePhe: 2.894 ± 0.073
4.71IleGly: 4.71 ± 0.097
1.579IleHis: 1.579 ± 0.058
4.873IleIle: 4.873 ± 0.102
4.273IleLys: 4.273 ± 0.091
6.434IleLeu: 6.434 ± 0.114
1.506IleMet: 1.506 ± 0.05
2.946IleAsn: 2.946 ± 0.077
3.4IlePro: 3.4 ± 0.082
1.971IleGln: 1.971 ± 0.062
3.76IleArg: 3.76 ± 0.091
5.253IleSer: 5.253 ± 0.108
4.1IleThr: 4.1 ± 0.094
4.716IleVal: 4.716 ± 0.1
0.463IleTrp: 0.463 ± 0.026
2.476IleTyr: 2.476 ± 0.074
0.0IleXaa: 0.0 ± 0.0
Lys
4.922LysAla: 4.922 ± 0.101
0.419LysCys: 0.419 ± 0.025
4.059LysAsp: 4.059 ± 0.095
5.286LysGlu: 5.286 ± 0.121
2.109LysPhe: 2.109 ± 0.064
4.749LysGly: 4.749 ± 0.089
1.323LysHis: 1.323 ± 0.047
4.191LysIle: 4.191 ± 0.085
4.441LysLys: 4.441 ± 0.11
4.995LysLeu: 4.995 ± 0.098
2.127LysMet: 2.127 ± 0.065
2.573LysAsn: 2.573 ± 0.071
2.069LysPro: 2.069 ± 0.063
2.008LysGln: 2.008 ± 0.061
3.331LysArg: 3.331 ± 0.073
3.79LysSer: 3.79 ± 0.073
3.574LysThr: 3.574 ± 0.08
4.7LysVal: 4.7 ± 0.092
0.633LysTrp: 0.633 ± 0.033
2.688LysTyr: 2.688 ± 0.066
0.003LysXaa: 0.003 ± 0.002
Leu
6.574LeuAla: 6.574 ± 0.107
1.135LeuCys: 1.135 ± 0.045
5.056LeuAsp: 5.056 ± 0.09
5.238LeuGlu: 5.238 ± 0.094
4.684LeuPhe: 4.684 ± 0.097
6.346LeuGly: 6.346 ± 0.114
1.927LeuHis: 1.927 ± 0.051
6.322LeuIle: 6.322 ± 0.122
5.976LeuLys: 5.976 ± 0.098
9.759LeuLeu: 9.759 ± 0.176
2.535LeuMet: 2.535 ± 0.067
3.961LeuAsn: 3.961 ± 0.092
4.476LeuPro: 4.476 ± 0.095
2.866LeuGln: 2.866 ± 0.069
5.444LeuArg: 5.444 ± 0.125
8.333LeuSer: 8.333 ± 0.128
5.346LeuThr: 5.346 ± 0.094
5.501LeuVal: 5.501 ± 0.111
0.91LeuTrp: 0.91 ± 0.043
3.609LeuTyr: 3.609 ± 0.079
0.002LeuXaa: 0.002 ± 0.002
Met
2.181MetAla: 2.181 ± 0.068
0.233MetCys: 0.233 ± 0.02
1.605MetAsp: 1.605 ± 0.042
1.651MetGlu: 1.651 ± 0.048
0.863MetPhe: 0.863 ± 0.039
2.091MetGly: 2.091 ± 0.057
0.488MetHis: 0.488 ± 0.029
1.87MetIle: 1.87 ± 0.057
2.139MetLys: 2.139 ± 0.056
2.406MetLeu: 2.406 ± 0.064
0.83MetMet: 0.83 ± 0.039
1.25MetAsn: 1.25 ± 0.046
1.183MetPro: 1.183 ± 0.046
0.85MetGln: 0.85 ± 0.036
1.507MetArg: 1.507 ± 0.059
1.884MetSer: 1.884 ± 0.05
1.656MetThr: 1.656 ± 0.055
1.524MetVal: 1.524 ± 0.052
0.232MetTrp: 0.232 ± 0.022
0.827MetTyr: 0.827 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.012AsnAla: 3.012 ± 0.074
0.294AsnCys: 0.294 ± 0.024
2.122AsnAsp: 2.122 ± 0.066
2.525AsnGlu: 2.525 ± 0.067
1.744AsnPhe: 1.744 ± 0.053
2.723AsnGly: 2.723 ± 0.076
0.825AsnHis: 0.825 ± 0.036
3.237AsnIle: 3.237 ± 0.074
2.829AsnLys: 2.829 ± 0.081
3.69AsnLeu: 3.69 ± 0.079
1.121AsnMet: 1.121 ± 0.043
1.964AsnAsn: 1.964 ± 0.06
2.176AsnPro: 2.176 ± 0.059
1.134AsnGln: 1.134 ± 0.044
1.953AsnArg: 1.953 ± 0.061
2.199AsnSer: 2.199 ± 0.056
2.403AsnThr: 2.403 ± 0.07
2.579AsnVal: 2.579 ± 0.068
0.434AsnTrp: 0.434 ± 0.026
1.628AsnTyr: 1.628 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
2.61ProAla: 2.61 ± 0.06
0.3ProCys: 0.3 ± 0.024
2.527ProAsp: 2.527 ± 0.067
3.418ProGlu: 3.418 ± 0.076
1.853ProPhe: 1.853 ± 0.056
2.349ProGly: 2.349 ± 0.068
0.935ProHis: 0.935 ± 0.042
2.764ProIle: 2.764 ± 0.06
2.439ProLys: 2.439 ± 0.071
3.6ProLeu: 3.6 ± 0.08
1.126ProMet: 1.126 ± 0.046
1.594ProAsn: 1.594 ± 0.053
1.093ProPro: 1.093 ± 0.046
1.431ProGln: 1.431 ± 0.045
1.594ProArg: 1.594 ± 0.049
2.767ProSer: 2.767 ± 0.073
2.372ProThr: 2.372 ± 0.068
2.566ProVal: 2.566 ± 0.069
0.37ProTrp: 0.37 ± 0.024
1.698ProTyr: 1.698 ± 0.047
0.002ProXaa: 0.002 ± 0.002
Gln
2.419GlnAla: 2.419 ± 0.075
0.24GlnCys: 0.24 ± 0.021
1.64GlnAsp: 1.64 ± 0.057
2.286GlnGlu: 2.286 ± 0.067
1.015GlnPhe: 1.015 ± 0.038
2.18GlnGly: 2.18 ± 0.065
0.628GlnHis: 0.628 ± 0.032
2.098GlnIle: 2.098 ± 0.059
2.253GlnLys: 2.253 ± 0.068
2.75GlnLeu: 2.75 ± 0.083
0.953GlnMet: 0.953 ± 0.039
1.356GlnAsn: 1.356 ± 0.047
1.039GlnPro: 1.039 ± 0.043
1.108GlnGln: 1.108 ± 0.062
1.666GlnArg: 1.666 ± 0.052
1.945GlnSer: 1.945 ± 0.063
1.684GlnThr: 1.684 ± 0.053
2.114GlnVal: 2.114 ± 0.061
0.343GlnTrp: 0.343 ± 0.021
1.16GlnTyr: 1.16 ± 0.05
0.002GlnXaa: 0.002 ± 0.002
Arg
3.52ArgAla: 3.52 ± 0.084
0.445ArgCys: 0.445 ± 0.025
2.697ArgAsp: 2.697 ± 0.064
3.842ArgGlu: 3.842 ± 0.085
2.232ArgPhe: 2.232 ± 0.063
3.016ArgGly: 3.016 ± 0.078
1.286ArgHis: 1.286 ± 0.049
3.817ArgIle: 3.817 ± 0.074
3.555ArgLys: 3.555 ± 0.078
5.542ArgLeu: 5.542 ± 0.114
1.666ArgMet: 1.666 ± 0.056
2.083ArgAsn: 2.083 ± 0.056
2.013ArgPro: 2.013 ± 0.052
1.896ArgGln: 1.896 ± 0.058
3.046ArgArg: 3.046 ± 0.08
3.116ArgSer: 3.116 ± 0.074
2.812ArgThr: 2.812 ± 0.079
3.137ArgVal: 3.137 ± 0.066
0.61ArgTrp: 0.61 ± 0.033
2.246ArgTyr: 2.246 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
4.721SerAla: 4.721 ± 0.077
0.721SerCys: 0.721 ± 0.038
3.806SerAsp: 3.806 ± 0.072
4.047SerGlu: 4.047 ± 0.078
3.186SerPhe: 3.186 ± 0.088
4.734SerGly: 4.734 ± 0.092
1.377SerHis: 1.377 ± 0.049
4.749SerIle: 4.749 ± 0.088
3.798SerLys: 3.798 ± 0.073
6.545SerLeu: 6.545 ± 0.121
1.594SerMet: 1.594 ± 0.055
2.387SerAsn: 2.387 ± 0.056
2.814SerPro: 2.814 ± 0.078
1.948SerGln: 1.948 ± 0.06
3.15SerArg: 3.15 ± 0.077
4.304SerSer: 4.304 ± 0.105
3.459SerThr: 3.459 ± 0.08
4.277SerVal: 4.277 ± 0.092
0.6SerTrp: 0.6 ± 0.031
2.773SerTyr: 2.773 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
3.985ThrAla: 3.985 ± 0.072
0.488ThrCys: 0.488 ± 0.029
3.158ThrAsp: 3.158 ± 0.076
3.299ThrGlu: 3.299 ± 0.08
2.736ThrPhe: 2.736 ± 0.067
3.819ThrGly: 3.819 ± 0.082
1.312ThrHis: 1.312 ± 0.054
3.953ThrIle: 3.953 ± 0.081
3.183ThrLys: 3.183 ± 0.072
6.333ThrLeu: 6.333 ± 0.107
1.264ThrMet: 1.264 ± 0.046
2.134ThrAsn: 2.134 ± 0.064
3.108ThrPro: 3.108 ± 0.072
1.666ThrGln: 1.666 ± 0.057
2.323ThrArg: 2.323 ± 0.064
3.507ThrSer: 3.507 ± 0.087
3.251ThrThr: 3.251 ± 0.081
3.78ThrVal: 3.78 ± 0.095
0.509ThrTrp: 0.509 ± 0.032
2.325ThrTyr: 2.325 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
4.954ValAla: 4.954 ± 0.1
0.866ValCys: 0.866 ± 0.042
3.842ValAsp: 3.842 ± 0.097
4.346ValGlu: 4.346 ± 0.097
2.861ValPhe: 2.861 ± 0.077
4.583ValGly: 4.583 ± 0.103
1.199ValHis: 1.199 ± 0.045
4.777ValIle: 4.777 ± 0.087
3.927ValLys: 3.927 ± 0.095
6.149ValLeu: 6.149 ± 0.124
1.821ValMet: 1.821 ± 0.06
2.48ValAsn: 2.48 ± 0.068
2.638ValPro: 2.638 ± 0.073
1.932ValGln: 1.932 ± 0.061
3.744ValArg: 3.744 ± 0.074
5.075ValSer: 5.075 ± 0.089
3.708ValThr: 3.708 ± 0.087
4.896ValVal: 4.896 ± 0.098
0.675ValTrp: 0.675 ± 0.034
2.504ValTyr: 2.504 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.653TrpAla: 0.653 ± 0.03
0.122TrpCys: 0.122 ± 0.014
0.618TrpAsp: 0.618 ± 0.034
0.595TrpGlu: 0.595 ± 0.031
0.411TrpPhe: 0.411 ± 0.025
0.773TrpGly: 0.773 ± 0.037
0.287TrpHis: 0.287 ± 0.021
0.718TrpIle: 0.718 ± 0.033
0.542TrpLys: 0.542 ± 0.029
1.008TrpLeu: 1.008 ± 0.04
0.336TrpMet: 0.336 ± 0.027
0.424TrpAsn: 0.424 ± 0.026
0.153TrpPro: 0.153 ± 0.017
0.439TrpGln: 0.439 ± 0.025
0.516TrpArg: 0.516 ± 0.029
0.576TrpSer: 0.576 ± 0.029
0.532TrpThr: 0.532 ± 0.031
0.612TrpVal: 0.612 ± 0.035
0.144TrpTrp: 0.144 ± 0.015
0.398TrpTyr: 0.398 ± 0.024
0.002TrpXaa: 0.002 ± 0.001
Tyr
2.793TyrAla: 2.793 ± 0.062
0.44TyrCys: 0.44 ± 0.028
2.516TyrAsp: 2.516 ± 0.071
2.441TyrGlu: 2.441 ± 0.068
1.949TyrPhe: 1.949 ± 0.054
2.739TyrGly: 2.739 ± 0.072
0.896TyrHis: 0.896 ± 0.04
2.664TyrIle: 2.664 ± 0.069
2.4TyrLys: 2.4 ± 0.075
3.728TyrLeu: 3.728 ± 0.083
0.922TyrMet: 0.922 ± 0.035
2.042TyrAsn: 2.042 ± 0.065
1.739TyrPro: 1.739 ± 0.056
1.14TyrGln: 1.14 ± 0.048
2.33TyrArg: 2.33 ± 0.064
2.41TyrSer: 2.41 ± 0.067
2.483TyrThr: 2.483 ± 0.063
2.302TyrVal: 2.302 ± 0.064
0.354TyrTrp: 0.354 ± 0.022
1.724TyrTyr: 1.724 ± 0.064
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.003XaaIle: 0.003 ± 0.003
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.002XaaMet: 0.002 ± 0.002
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.002XaaGln: 0.002 ± 0.002
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.001
0.002XaaThr: 0.002 ± 0.002
0.002XaaVal: 0.002 ± 0.002
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.002
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1769 proteins (612983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski