Amino acid dipepetide frequency for Glaciecola nitratireducens (strain JCM 12485 / KCTC 12276 / FR1064)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.527AlaAla: 7.527 ± 0.1
0.927AlaCys: 0.927 ± 0.028
4.852AlaAsp: 4.852 ± 0.067
5.421AlaGlu: 5.421 ± 0.072
3.696AlaPhe: 3.696 ± 0.06
5.626AlaGly: 5.626 ± 0.07
1.548AlaHis: 1.548 ± 0.038
6.144AlaIle: 6.144 ± 0.079
5.416AlaLys: 5.416 ± 0.082
9.349AlaLeu: 9.349 ± 0.092
2.436AlaMet: 2.436 ± 0.047
4.117AlaAsn: 4.117 ± 0.061
2.804AlaPro: 2.804 ± 0.055
3.642AlaGln: 3.642 ± 0.06
3.375AlaArg: 3.375 ± 0.053
5.902AlaSer: 5.902 ± 0.07
4.416AlaThr: 4.416 ± 0.069
5.836AlaVal: 5.836 ± 0.077
0.848AlaTrp: 0.848 ± 0.029
2.393AlaTyr: 2.393 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.027
0.139CysCys: 0.139 ± 0.012
0.572CysAsp: 0.572 ± 0.022
0.587CysGlu: 0.587 ± 0.026
0.473CysPhe: 0.473 ± 0.021
0.721CysGly: 0.721 ± 0.027
0.276CysHis: 0.276 ± 0.015
0.675CysIle: 0.675 ± 0.027
0.47CysLys: 0.47 ± 0.023
0.892CysLeu: 0.892 ± 0.028
0.225CysMet: 0.225 ± 0.013
0.357CysAsn: 0.357 ± 0.017
0.395CysPro: 0.395 ± 0.019
0.378CysGln: 0.378 ± 0.018
0.385CysArg: 0.385 ± 0.018
0.666CysSer: 0.666 ± 0.021
0.411CysThr: 0.411 ± 0.017
0.643CysVal: 0.643 ± 0.026
0.1CysTrp: 0.1 ± 0.009
0.315CysTyr: 0.315 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.85AspAla: 4.85 ± 0.07
0.482AspCys: 0.482 ± 0.024
3.55AspAsp: 3.55 ± 0.063
4.045AspGlu: 4.045 ± 0.069
2.722AspPhe: 2.722 ± 0.051
3.797AspGly: 3.797 ± 0.07
0.951AspHis: 0.951 ± 0.033
4.533AspIle: 4.533 ± 0.063
3.469AspLys: 3.469 ± 0.056
5.301AspLeu: 5.301 ± 0.072
1.526AspMet: 1.526 ± 0.037
2.652AspAsn: 2.652 ± 0.045
2.038AspPro: 2.038 ± 0.045
1.762AspGln: 1.762 ± 0.04
2.137AspArg: 2.137 ± 0.047
3.585AspSer: 3.585 ± 0.062
2.943AspThr: 2.943 ± 0.054
4.154AspVal: 4.154 ± 0.061
0.822AspTrp: 0.822 ± 0.026
2.035AspTyr: 2.035 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.149GluAla: 5.149 ± 0.069
0.455GluCys: 0.455 ± 0.018
3.01GluAsp: 3.01 ± 0.057
3.627GluGlu: 3.627 ± 0.069
2.631GluPhe: 2.631 ± 0.047
3.429GluGly: 3.429 ± 0.059
1.475GluHis: 1.475 ± 0.037
4.073GluIle: 4.073 ± 0.063
3.959GluLys: 3.959 ± 0.072
6.748GluLeu: 6.748 ± 0.08
1.633GluMet: 1.633 ± 0.037
3.132GluAsn: 3.132 ± 0.053
1.848GluPro: 1.848 ± 0.041
3.679GluGln: 3.679 ± 0.066
3.117GluArg: 3.117 ± 0.061
4.076GluSer: 4.076 ± 0.056
3.121GluThr: 3.121 ± 0.056
4.061GluVal: 4.061 ± 0.066
0.689GluTrp: 0.689 ± 0.025
1.836GluTyr: 1.836 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.92PheAla: 3.92 ± 0.066
0.488PheCys: 0.488 ± 0.021
3.045PheAsp: 3.045 ± 0.051
2.817PheGlu: 2.817 ± 0.05
1.959PhePhe: 1.959 ± 0.046
3.204PheGly: 3.204 ± 0.056
0.795PheHis: 0.795 ± 0.026
3.073PheIle: 3.073 ± 0.058
2.213PheLys: 2.213 ± 0.048
3.642PheLeu: 3.642 ± 0.055
1.047PheMet: 1.047 ± 0.032
2.118PheAsn: 2.118 ± 0.041
1.396PhePro: 1.396 ± 0.04
1.231PheGln: 1.231 ± 0.036
1.478PheArg: 1.478 ± 0.04
3.706PheSer: 3.706 ± 0.061
2.573PheThr: 2.573 ± 0.046
3.072PheVal: 3.072 ± 0.052
0.511PheTrp: 0.511 ± 0.023
1.38PheTyr: 1.38 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
5.01GlyAla: 5.01 ± 0.086
0.779GlyCys: 0.779 ± 0.026
3.627GlyAsp: 3.627 ± 0.062
3.892GlyGlu: 3.892 ± 0.055
3.368GlyPhe: 3.368 ± 0.056
4.476GlyGly: 4.476 ± 0.085
1.408GlyHis: 1.408 ± 0.037
4.728GlyIle: 4.728 ± 0.065
4.033GlyLys: 4.033 ± 0.063
6.747GlyLeu: 6.747 ± 0.079
1.824GlyMet: 1.824 ± 0.04
2.785GlyAsn: 2.785 ± 0.062
1.632GlyPro: 1.632 ± 0.039
2.441GlyGln: 2.441 ± 0.046
2.706GlyArg: 2.706 ± 0.045
4.091GlySer: 4.091 ± 0.062
3.352GlyThr: 3.352 ± 0.062
4.803GlyVal: 4.803 ± 0.072
0.782GlyTrp: 0.782 ± 0.026
2.296GlyTyr: 2.296 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.64HisAla: 1.64 ± 0.035
0.289HisCys: 0.289 ± 0.018
1.058HisAsp: 1.058 ± 0.031
1.169HisGlu: 1.169 ± 0.03
1.017HisPhe: 1.017 ± 0.026
1.364HisGly: 1.364 ± 0.031
0.571HisHis: 0.571 ± 0.025
1.407HisIle: 1.407 ± 0.037
1.118HisLys: 1.118 ± 0.028
2.016HisLeu: 2.016 ± 0.042
0.46HisMet: 0.46 ± 0.021
0.843HisAsn: 0.843 ± 0.027
1.01HisPro: 1.01 ± 0.03
0.98HisGln: 0.98 ± 0.028
0.888HisArg: 0.888 ± 0.026
1.362HisSer: 1.362 ± 0.036
0.988HisThr: 0.988 ± 0.03
1.291HisVal: 1.291 ± 0.033
0.325HisTrp: 0.325 ± 0.017
0.761HisTyr: 0.761 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.671IleAla: 6.671 ± 0.079
0.71IleCys: 0.71 ± 0.02
4.753IleAsp: 4.753 ± 0.069
4.982IleGlu: 4.982 ± 0.076
2.546IlePhe: 2.546 ± 0.05
4.723IleGly: 4.723 ± 0.07
1.188IleHis: 1.188 ± 0.032
4.113IleIle: 4.113 ± 0.063
3.794IleLys: 3.794 ± 0.064
5.607IleLeu: 5.607 ± 0.081
1.427IleMet: 1.427 ± 0.032
3.636IleAsn: 3.636 ± 0.064
2.604IlePro: 2.604 ± 0.044
2.481IleGln: 2.481 ± 0.044
2.716IleArg: 2.716 ± 0.046
5.069IleSer: 5.069 ± 0.066
3.852IleThr: 3.852 ± 0.055
4.621IleVal: 4.621 ± 0.06
0.619IleTrp: 0.619 ± 0.026
1.782IleTyr: 1.782 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
5.099LysAla: 5.099 ± 0.075
0.367LysCys: 0.367 ± 0.019
2.91LysAsp: 2.91 ± 0.053
3.414LysGlu: 3.414 ± 0.06
1.833LysPhe: 1.833 ± 0.035
3.284LysGly: 3.284 ± 0.053
1.354LysHis: 1.354 ± 0.039
3.466LysIle: 3.466 ± 0.05
3.485LysLys: 3.485 ± 0.066
5.813LysLeu: 5.813 ± 0.063
1.56LysMet: 1.56 ± 0.035
2.747LysAsn: 2.747 ± 0.05
2.371LysPro: 2.371 ± 0.044
3.29LysGln: 3.29 ± 0.059
2.861LysArg: 2.861 ± 0.048
3.705LysSer: 3.705 ± 0.05
3.393LysThr: 3.393 ± 0.06
3.973LysVal: 3.973 ± 0.055
0.631LysTrp: 0.631 ± 0.024
1.599LysTyr: 1.599 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
9.21LeuAla: 9.21 ± 0.089
1.025LeuCys: 1.025 ± 0.036
5.538LeuAsp: 5.538 ± 0.068
5.642LeuGlu: 5.642 ± 0.072
4.356LeuPhe: 4.356 ± 0.069
6.508LeuGly: 6.508 ± 0.078
2.019LeuHis: 2.019 ± 0.047
6.501LeuIle: 6.501 ± 0.089
5.384LeuLys: 5.384 ± 0.079
10.377LeuLeu: 10.377 ± 0.143
2.544LeuMet: 2.544 ± 0.041
4.883LeuAsn: 4.883 ± 0.053
4.465LeuPro: 4.465 ± 0.072
4.003LeuGln: 4.003 ± 0.065
4.236LeuArg: 4.236 ± 0.065
8.517LeuSer: 8.517 ± 0.088
5.683LeuThr: 5.683 ± 0.071
6.906LeuVal: 6.906 ± 0.075
0.892LeuTrp: 0.892 ± 0.03
2.616LeuTyr: 2.616 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.254MetAla: 2.254 ± 0.04
0.189MetCys: 0.189 ± 0.012
1.181MetAsp: 1.181 ± 0.031
1.094MetGlu: 1.094 ± 0.027
0.951MetPhe: 0.951 ± 0.03
1.657MetGly: 1.657 ± 0.04
0.602MetHis: 0.602 ± 0.021
1.48MetIle: 1.48 ± 0.032
1.477MetLys: 1.477 ± 0.032
2.912MetLeu: 2.912 ± 0.062
0.705MetMet: 0.705 ± 0.029
1.156MetAsn: 1.156 ± 0.035
1.256MetPro: 1.256 ± 0.03
1.401MetGln: 1.401 ± 0.035
1.189MetArg: 1.189 ± 0.03
1.972MetSer: 1.972 ± 0.043
1.626MetThr: 1.626 ± 0.036
1.529MetVal: 1.529 ± 0.033
0.217MetTrp: 0.217 ± 0.013
0.548MetTyr: 0.548 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
4.186AsnAla: 4.186 ± 0.066
0.43AsnCys: 0.43 ± 0.021
2.697AsnAsp: 2.697 ± 0.047
2.92AsnGlu: 2.92 ± 0.054
1.801AsnPhe: 1.801 ± 0.041
3.051AsnGly: 3.051 ± 0.06
0.838AsnHis: 0.838 ± 0.024
3.365AsnIle: 3.365 ± 0.058
2.941AsnLys: 2.941 ± 0.05
4.321AsnLeu: 4.321 ± 0.068
1.181AsnMet: 1.181 ± 0.03
2.428AsnAsn: 2.428 ± 0.054
2.053AsnPro: 2.053 ± 0.04
2.085AsnGln: 2.085 ± 0.043
2.044AsnArg: 2.044 ± 0.04
3.063AsnSer: 3.063 ± 0.059
2.637AsnThr: 2.637 ± 0.046
3.074AsnVal: 3.074 ± 0.057
0.607AsnTrp: 0.607 ± 0.023
1.446AsnTyr: 1.446 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
2.88ProAla: 2.88 ± 0.058
0.278ProCys: 0.278 ± 0.016
2.395ProAsp: 2.395 ± 0.052
2.88ProGlu: 2.88 ± 0.054
1.769ProPhe: 1.769 ± 0.038
2.133ProGly: 2.133 ± 0.046
0.768ProHis: 0.768 ± 0.024
2.696ProIle: 2.696 ± 0.045
2.041ProLys: 2.041 ± 0.041
3.699ProLeu: 3.699 ± 0.051
0.924ProMet: 0.924 ± 0.029
1.913ProAsn: 1.913 ± 0.045
1.08ProPro: 1.08 ± 0.033
1.439ProGln: 1.439 ± 0.037
1.274ProArg: 1.274 ± 0.032
2.637ProSer: 2.637 ± 0.043
2.007ProThr: 2.007 ± 0.041
2.707ProVal: 2.707 ± 0.052
0.444ProTrp: 0.444 ± 0.019
1.108ProTyr: 1.108 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.989GlnAla: 3.989 ± 0.059
0.315GlnCys: 0.315 ± 0.016
2.051GlnAsp: 2.051 ± 0.041
2.338GlnGlu: 2.338 ± 0.054
1.787GlnPhe: 1.787 ± 0.039
2.574GlnGly: 2.574 ± 0.042
1.147GlnHis: 1.147 ± 0.029
2.687GlnIle: 2.687 ± 0.04
2.473GlnLys: 2.473 ± 0.043
4.737GlnLeu: 4.737 ± 0.07
1.056GlnMet: 1.056 ± 0.034
1.903GlnAsn: 1.903 ± 0.043
1.499GlnPro: 1.499 ± 0.034
2.725GlnGln: 2.725 ± 0.054
2.183GlnArg: 2.183 ± 0.04
2.972GlnSer: 2.972 ± 0.049
2.343GlnThr: 2.343 ± 0.045
2.848GlnVal: 2.848 ± 0.048
0.549GlnTrp: 0.549 ± 0.024
1.333GlnTyr: 1.333 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.314ArgAla: 3.314 ± 0.058
0.344ArgCys: 0.344 ± 0.018
2.334ArgAsp: 2.334 ± 0.042
2.787ArgGlu: 2.787 ± 0.05
2.155ArgPhe: 2.155 ± 0.038
2.384ArgGly: 2.384 ± 0.052
0.88ArgHis: 0.88 ± 0.025
3.006ArgIle: 3.006 ± 0.047
2.427ArgLys: 2.427 ± 0.055
4.55ArgLeu: 4.55 ± 0.066
1.065ArgMet: 1.065 ± 0.026
1.964ArgAsn: 1.964 ± 0.041
1.44ArgPro: 1.44 ± 0.036
1.933ArgGln: 1.933 ± 0.042
2.04ArgArg: 2.04 ± 0.04
2.563ArgSer: 2.563 ± 0.047
2.049ArgThr: 2.049 ± 0.042
3.001ArgVal: 3.001 ± 0.054
0.535ArgTrp: 0.535 ± 0.024
1.492ArgTyr: 1.492 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
6.166SerAla: 6.166 ± 0.078
0.586SerCys: 0.586 ± 0.02
4.152SerAsp: 4.152 ± 0.063
4.327SerGlu: 4.327 ± 0.064
3.113SerPhe: 3.113 ± 0.054
5.009SerGly: 5.009 ± 0.082
1.363SerHis: 1.363 ± 0.032
5.069SerIle: 5.069 ± 0.07
3.932SerLys: 3.932 ± 0.059
7.26SerLeu: 7.26 ± 0.084
1.88SerMet: 1.88 ± 0.039
3.127SerAsn: 3.127 ± 0.057
2.559SerPro: 2.559 ± 0.042
2.918SerGln: 2.918 ± 0.049
2.749SerArg: 2.749 ± 0.05
5.007SerSer: 5.007 ± 0.072
3.756SerThr: 3.756 ± 0.053
4.893SerVal: 4.893 ± 0.068
0.74SerTrp: 0.74 ± 0.027
2.057SerTyr: 2.057 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.449ThrAla: 4.449 ± 0.072
0.493ThrCys: 0.493 ± 0.022
3.027ThrAsp: 3.027 ± 0.054
3.263ThrGlu: 3.263 ± 0.052
2.306ThrPhe: 2.306 ± 0.044
3.925ThrGly: 3.925 ± 0.066
1.135ThrHis: 1.135 ± 0.031
3.615ThrIle: 3.615 ± 0.057
2.74ThrLys: 2.74 ± 0.05
5.995ThrLeu: 5.995 ± 0.076
1.211ThrMet: 1.211 ± 0.031
2.343ThrAsn: 2.343 ± 0.049
2.459ThrPro: 2.459 ± 0.054
2.45ThrGln: 2.45 ± 0.044
2.097ThrArg: 2.097 ± 0.043
3.723ThrSer: 3.723 ± 0.061
2.944ThrThr: 2.944 ± 0.064
3.671ThrVal: 3.671 ± 0.062
0.593ThrTrp: 0.593 ± 0.022
1.512ThrTyr: 1.512 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
5.976ValAla: 5.976 ± 0.077
0.722ValCys: 0.722 ± 0.026
4.406ValAsp: 4.406 ± 0.062
4.318ValGlu: 4.318 ± 0.073
3.016ValPhe: 3.016 ± 0.058
4.309ValGly: 4.309 ± 0.057
1.267ValHis: 1.267 ± 0.033
4.902ValIle: 4.902 ± 0.061
3.785ValLys: 3.785 ± 0.056
6.666ValLeu: 6.666 ± 0.08
1.767ValMet: 1.767 ± 0.037
3.469ValAsn: 3.469 ± 0.056
2.552ValPro: 2.552 ± 0.047
2.228ValGln: 2.228 ± 0.044
2.661ValArg: 2.661 ± 0.05
5.226ValSer: 5.226 ± 0.07
3.935ValThr: 3.935 ± 0.066
5.112ValVal: 5.112 ± 0.072
0.695ValTrp: 0.695 ± 0.023
1.844ValTyr: 1.844 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.722TrpAla: 0.722 ± 0.027
0.128TrpCys: 0.128 ± 0.009
0.552TrpAsp: 0.552 ± 0.022
0.537TrpGlu: 0.537 ± 0.022
0.595TrpPhe: 0.595 ± 0.022
0.656TrpGly: 0.656 ± 0.025
0.312TrpHis: 0.312 ± 0.015
0.662TrpIle: 0.662 ± 0.027
0.483TrpLys: 0.483 ± 0.023
1.499TrpLeu: 1.499 ± 0.04
0.301TrpMet: 0.301 ± 0.018
0.416TrpAsn: 0.416 ± 0.022
0.404TrpPro: 0.404 ± 0.018
0.82TrpGln: 0.82 ± 0.03
0.61TrpArg: 0.61 ± 0.021
0.687TrpSer: 0.687 ± 0.026
0.529TrpThr: 0.529 ± 0.022
0.778TrpVal: 0.778 ± 0.022
0.169TrpTrp: 0.169 ± 0.012
0.348TrpTyr: 0.348 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.454TyrAla: 2.454 ± 0.047
0.343TyrCys: 0.343 ± 0.017
1.645TyrAsp: 1.645 ± 0.043
1.689TyrGlu: 1.689 ± 0.041
1.576TyrPhe: 1.576 ± 0.037
1.95TyrGly: 1.95 ± 0.04
0.66TyrHis: 0.66 ± 0.022
1.698TyrIle: 1.698 ± 0.039
1.511TyrLys: 1.511 ± 0.039
3.177TyrLeu: 3.177 ± 0.056
0.621TyrMet: 0.621 ± 0.025
1.177TyrAsn: 1.177 ± 0.035
1.209TyrPro: 1.209 ± 0.029
1.625TyrGln: 1.625 ± 0.035
1.52TyrArg: 1.52 ± 0.039
2.132TyrSer: 2.132 ± 0.045
1.391TyrThr: 1.391 ± 0.037
1.883TyrVal: 1.883 ± 0.04
0.439TyrTrp: 0.439 ± 0.018
0.997TyrTyr: 0.997 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3653 proteins (1214097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski