Amino acid dipepetide frequency for Novosphingobium sp. AAP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.669AlaAla: 21.669 ± 0.187
1.145AlaCys: 1.145 ± 0.031
8.364AlaAsp: 8.364 ± 0.097
6.864AlaGlu: 6.864 ± 0.092
4.41AlaPhe: 4.41 ± 0.063
12.507AlaGly: 12.507 ± 0.118
2.874AlaHis: 2.874 ± 0.052
6.423AlaIle: 6.423 ± 0.074
3.693AlaLys: 3.693 ± 0.063
16.241AlaLeu: 16.241 ± 0.174
4.374AlaMet: 4.374 ± 0.057
3.245AlaAsn: 3.245 ± 0.047
7.665AlaPro: 7.665 ± 0.104
6.372AlaGln: 6.372 ± 0.085
10.916AlaArg: 10.916 ± 0.114
6.542AlaSer: 6.542 ± 0.083
7.268AlaThr: 7.268 ± 0.077
9.46AlaVal: 9.46 ± 0.102
2.13AlaTrp: 2.13 ± 0.05
2.651AlaTyr: 2.651 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.125CysAla: 1.125 ± 0.033
0.098CysCys: 0.098 ± 0.009
0.469CysAsp: 0.469 ± 0.019
0.326CysGlu: 0.326 ± 0.015
0.27CysPhe: 0.27 ± 0.016
0.856CysGly: 0.856 ± 0.024
0.236CysHis: 0.236 ± 0.013
0.326CysIle: 0.326 ± 0.016
0.144CysLys: 0.144 ± 0.01
0.774CysLeu: 0.774 ± 0.025
0.146CysMet: 0.146 ± 0.01
0.212CysAsn: 0.212 ± 0.014
0.503CysPro: 0.503 ± 0.024
0.255CysGln: 0.255 ± 0.015
0.521CysArg: 0.521 ± 0.019
0.396CysSer: 0.396 ± 0.017
0.416CysThr: 0.416 ± 0.018
0.582CysVal: 0.582 ± 0.021
0.133CysTrp: 0.133 ± 0.01
0.167CysTyr: 0.167 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
8.09AspAla: 8.09 ± 0.095
0.48AspCys: 0.48 ± 0.021
3.094AspAsp: 3.094 ± 0.054
2.855AspGlu: 2.855 ± 0.052
2.033AspPhe: 2.033 ± 0.041
5.641AspGly: 5.641 ± 0.069
1.475AspHis: 1.475 ± 0.038
2.445AspIle: 2.445 ± 0.05
1.595AspLys: 1.595 ± 0.037
5.878AspLeu: 5.878 ± 0.064
1.341AspMet: 1.341 ± 0.032
1.327AspAsn: 1.327 ± 0.036
3.785AspPro: 3.785 ± 0.059
1.82AspGln: 1.82 ± 0.038
4.294AspArg: 4.294 ± 0.06
2.247AspSer: 2.247 ± 0.046
2.712AspThr: 2.712 ± 0.048
4.128AspVal: 4.128 ± 0.057
1.117AspTrp: 1.117 ± 0.029
1.55AspTyr: 1.55 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.825GluAla: 6.825 ± 0.095
0.328GluCys: 0.328 ± 0.017
2.373GluAsp: 2.373 ± 0.047
2.382GluGlu: 2.382 ± 0.054
1.365GluPhe: 1.365 ± 0.034
3.893GluGly: 3.893 ± 0.053
1.056GluHis: 1.056 ± 0.031
2.346GluIle: 2.346 ± 0.049
1.456GluLys: 1.456 ± 0.043
4.226GluLeu: 4.226 ± 0.063
1.163GluMet: 1.163 ± 0.032
1.187GluAsn: 1.187 ± 0.031
2.225GluPro: 2.225 ± 0.044
1.991GluGln: 1.991 ± 0.041
4.058GluArg: 4.058 ± 0.067
1.927GluSer: 1.927 ± 0.039
2.819GluThr: 2.819 ± 0.047
3.186GluVal: 3.186 ± 0.055
0.755GluTrp: 0.755 ± 0.025
0.85GluTyr: 0.85 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
5.211PheAla: 5.211 ± 0.062
0.31PheCys: 0.31 ± 0.015
2.566PheAsp: 2.566 ± 0.045
1.601PheGlu: 1.601 ± 0.035
1.204PhePhe: 1.204 ± 0.03
3.517PheGly: 3.517 ± 0.065
0.771PheHis: 0.771 ± 0.024
1.314PheIle: 1.314 ± 0.033
0.781PheLys: 0.781 ± 0.028
2.981PheLeu: 2.981 ± 0.046
0.711PheMet: 0.711 ± 0.024
1.102PheAsn: 1.102 ± 0.031
1.539PhePro: 1.539 ± 0.04
0.857PheGln: 0.857 ± 0.025
1.999PheArg: 1.999 ± 0.037
1.906PheSer: 1.906 ± 0.041
2.009PheThr: 2.009 ± 0.039
2.475PheVal: 2.475 ± 0.046
0.516PheTrp: 0.516 ± 0.022
0.954PheTyr: 0.954 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
10.987GlyAla: 10.987 ± 0.108
0.804GlyCys: 0.804 ± 0.027
4.6GlyAsp: 4.6 ± 0.061
4.186GlyGlu: 4.186 ± 0.055
3.595GlyPhe: 3.595 ± 0.059
8.017GlyGly: 8.017 ± 0.123
2.201GlyHis: 2.201 ± 0.045
4.138GlyIle: 4.138 ± 0.067
3.071GlyLys: 3.071 ± 0.054
9.485GlyLeu: 9.485 ± 0.098
2.408GlyMet: 2.408 ± 0.047
2.398GlyAsn: 2.398 ± 0.052
3.909GlyPro: 3.909 ± 0.056
3.753GlyGln: 3.753 ± 0.06
6.188GlyArg: 6.188 ± 0.077
4.573GlySer: 4.573 ± 0.068
5.205GlyThr: 5.205 ± 0.082
6.581GlyVal: 6.581 ± 0.074
1.694GlyTrp: 1.694 ± 0.046
2.292GlyTyr: 2.292 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
3.055HisAla: 3.055 ± 0.054
0.24HisCys: 0.24 ± 0.013
1.395HisAsp: 1.395 ± 0.033
0.963HisGlu: 0.963 ± 0.027
0.906HisPhe: 0.906 ± 0.029
2.418HisGly: 2.418 ± 0.045
0.618HisHis: 0.618 ± 0.024
0.918HisIle: 0.918 ± 0.025
0.457HisLys: 0.457 ± 0.022
2.079HisLeu: 2.079 ± 0.038
0.497HisMet: 0.497 ± 0.019
0.502HisAsn: 0.502 ± 0.021
1.408HisPro: 1.408 ± 0.034
0.628HisGln: 0.628 ± 0.023
1.478HisArg: 1.478 ± 0.034
0.946HisSer: 0.946 ± 0.026
0.897HisThr: 0.897 ± 0.023
1.701HisVal: 1.701 ± 0.039
0.402HisTrp: 0.402 ± 0.016
0.621HisTyr: 0.621 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.391IleAla: 7.391 ± 0.079
0.387IleCys: 0.387 ± 0.018
3.693IleAsp: 3.693 ± 0.057
2.788IleGlu: 2.788 ± 0.046
1.21IlePhe: 1.21 ± 0.031
4.515IleGly: 4.515 ± 0.059
0.852IleHis: 0.852 ± 0.025
1.572IleIle: 1.572 ± 0.042
1.151IleLys: 1.151 ± 0.031
3.459IleLeu: 3.459 ± 0.055
0.874IleMet: 0.874 ± 0.029
1.351IleAsn: 1.351 ± 0.037
1.976IlePro: 1.976 ± 0.04
1.055IleGln: 1.055 ± 0.028
2.682IleArg: 2.682 ± 0.044
2.196IleSer: 2.196 ± 0.044
2.683IleThr: 2.683 ± 0.042
3.724IleVal: 3.724 ± 0.051
0.499IleTrp: 0.499 ± 0.024
0.968IleTyr: 0.968 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.859LysAla: 3.859 ± 0.075
0.119LysCys: 0.119 ± 0.009
1.345LysAsp: 1.345 ± 0.04
0.975LysGlu: 0.975 ± 0.033
0.699LysPhe: 0.699 ± 0.025
2.416LysGly: 2.416 ± 0.041
0.452LysHis: 0.452 ± 0.019
1.21LysIle: 1.21 ± 0.03
0.78LysLys: 0.78 ± 0.035
2.744LysLeu: 2.744 ± 0.056
0.57LysMet: 0.57 ± 0.023
0.655LysAsn: 0.655 ± 0.026
1.784LysPro: 1.784 ± 0.035
0.789LysGln: 0.789 ± 0.025
1.777LysArg: 1.777 ± 0.044
1.228LysSer: 1.228 ± 0.032
1.456LysThr: 1.456 ± 0.036
2.091LysVal: 2.091 ± 0.052
0.314LysTrp: 0.314 ± 0.018
0.508LysTyr: 0.508 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
17.248LeuAla: 17.248 ± 0.188
0.823LeuCys: 0.823 ± 0.026
6.31LeuAsp: 6.31 ± 0.079
4.315LeuGlu: 4.315 ± 0.069
3.445LeuPhe: 3.445 ± 0.058
9.032LeuGly: 9.032 ± 0.085
2.126LeuHis: 2.126 ± 0.041
4.336LeuIle: 4.336 ± 0.058
2.591LeuLys: 2.591 ± 0.049
9.471LeuLeu: 9.471 ± 0.103
2.069LeuMet: 2.069 ± 0.041
2.244LeuAsn: 2.244 ± 0.047
6.176LeuPro: 6.176 ± 0.074
2.509LeuGln: 2.509 ± 0.044
7.194LeuArg: 7.194 ± 0.084
5.595LeuSer: 5.595 ± 0.077
5.454LeuThr: 5.454 ± 0.076
7.917LeuVal: 7.917 ± 0.083
1.301LeuTrp: 1.301 ± 0.037
1.933LeuTyr: 1.933 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
3.867MetAla: 3.867 ± 0.06
0.138MetCys: 0.138 ± 0.01
1.11MetAsp: 1.11 ± 0.028
0.906MetGlu: 0.906 ± 0.028
0.616MetPhe: 0.616 ± 0.02
1.938MetGly: 1.938 ± 0.041
0.461MetHis: 0.461 ± 0.018
1.222MetIle: 1.222 ± 0.03
0.735MetLys: 0.735 ± 0.026
2.532MetLeu: 2.532 ± 0.053
0.585MetMet: 0.585 ± 0.022
0.612MetAsn: 0.612 ± 0.023
1.552MetPro: 1.552 ± 0.035
0.744MetGln: 0.744 ± 0.027
1.64MetArg: 1.64 ± 0.039
1.181MetSer: 1.181 ± 0.03
1.64MetThr: 1.64 ± 0.04
1.781MetVal: 1.781 ± 0.038
0.185MetTrp: 0.185 ± 0.011
0.237MetTyr: 0.237 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.404AsnAla: 3.404 ± 0.058
0.225AsnCys: 0.225 ± 0.013
1.316AsnAsp: 1.316 ± 0.034
0.914AsnGlu: 0.914 ± 0.027
0.941AsnPhe: 0.941 ± 0.033
2.463AsnGly: 2.463 ± 0.057
0.507AsnHis: 0.507 ± 0.017
1.091AsnIle: 1.091 ± 0.031
0.543AsnLys: 0.543 ± 0.021
2.524AsnLeu: 2.524 ± 0.042
0.45AsnMet: 0.45 ± 0.018
0.692AsnAsn: 0.692 ± 0.03
1.863AsnPro: 1.863 ± 0.04
0.831AsnGln: 0.831 ± 0.024
1.772AsnArg: 1.772 ± 0.036
1.187AsnSer: 1.187 ± 0.037
1.31AsnThr: 1.31 ± 0.04
1.817AsnVal: 1.817 ± 0.043
0.416AsnTrp: 0.416 ± 0.019
0.755AsnTyr: 0.755 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
8.719ProAla: 8.719 ± 0.118
0.373ProCys: 0.373 ± 0.018
3.789ProAsp: 3.789 ± 0.061
3.051ProGlu: 3.051 ± 0.056
1.934ProPhe: 1.934 ± 0.043
5.372ProGly: 5.372 ± 0.062
1.234ProHis: 1.234 ± 0.031
2.275ProIle: 2.275 ± 0.044
1.177ProLys: 1.177 ± 0.034
5.54ProLeu: 5.54 ± 0.072
1.215ProMet: 1.215 ± 0.032
1.156ProAsn: 1.156 ± 0.03
2.933ProPro: 2.933 ± 0.063
2.166ProGln: 2.166 ± 0.042
3.328ProArg: 3.328 ± 0.06
2.554ProSer: 2.554 ± 0.045
2.671ProThr: 2.671 ± 0.044
4.421ProVal: 4.421 ± 0.061
0.813ProTrp: 0.813 ± 0.026
1.117ProTyr: 1.117 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
5.659GlnAla: 5.659 ± 0.081
0.237GlnCys: 0.237 ± 0.013
1.675GlnAsp: 1.675 ± 0.038
1.286GlnGlu: 1.286 ± 0.033
1.204GlnPhe: 1.204 ± 0.035
3.264GlnGly: 3.264 ± 0.058
0.728GlnHis: 0.728 ± 0.022
1.736GlnIle: 1.736 ± 0.035
0.762GlnLys: 0.762 ± 0.023
3.169GlnLeu: 3.169 ± 0.052
0.811GlnMet: 0.811 ± 0.023
0.79GlnAsn: 0.79 ± 0.029
2.079GlnPro: 2.079 ± 0.04
1.319GlnGln: 1.319 ± 0.037
2.657GlnArg: 2.657 ± 0.05
1.714GlnSer: 1.714 ± 0.042
1.898GlnThr: 1.898 ± 0.039
2.888GlnVal: 2.888 ± 0.052
0.627GlnTrp: 0.627 ± 0.019
0.606GlnTyr: 0.606 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
9.336ArgAla: 9.336 ± 0.099
0.5ArgCys: 0.5 ± 0.019
3.944ArgAsp: 3.944 ± 0.056
3.584ArgGlu: 3.584 ± 0.073
2.989ArgPhe: 2.989 ± 0.053
5.006ArgGly: 5.006 ± 0.07
1.935ArgHis: 1.935 ± 0.043
3.593ArgIle: 3.593 ± 0.054
1.842ArgLys: 1.842 ± 0.036
8.376ArgLeu: 8.376 ± 0.105
1.793ArgMet: 1.793 ± 0.035
1.712ArgAsn: 1.712 ± 0.034
3.737ArgPro: 3.737 ± 0.058
2.866ArgGln: 2.866 ± 0.048
5.47ArgArg: 5.47 ± 0.072
3.17ArgSer: 3.17 ± 0.053
3.353ArgThr: 3.353 ± 0.054
4.895ArgVal: 4.895 ± 0.068
1.197ArgTrp: 1.197 ± 0.029
1.837ArgTyr: 1.837 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
6.518SerAla: 6.518 ± 0.075
0.379SerCys: 0.379 ± 0.02
2.75SerAsp: 2.75 ± 0.053
1.976SerGlu: 1.976 ± 0.043
1.859SerPhe: 1.859 ± 0.044
5.136SerGly: 5.136 ± 0.08
1.033SerHis: 1.033 ± 0.031
2.121SerIle: 2.121 ± 0.045
1.115SerLys: 1.115 ± 0.033
5.121SerLeu: 5.121 ± 0.078
0.992SerMet: 0.992 ± 0.03
1.324SerAsn: 1.324 ± 0.038
2.774SerPro: 2.774 ± 0.045
1.591SerGln: 1.591 ± 0.035
3.131SerArg: 3.131 ± 0.051
2.446SerSer: 2.446 ± 0.055
2.454SerThr: 2.454 ± 0.041
3.458SerVal: 3.458 ± 0.054
0.753SerTrp: 0.753 ± 0.022
1.294SerTyr: 1.294 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
6.963ThrAla: 6.963 ± 0.079
0.403ThrCys: 0.403 ± 0.018
2.621ThrAsp: 2.621 ± 0.058
1.951ThrGlu: 1.951 ± 0.043
1.821ThrPhe: 1.821 ± 0.038
5.323ThrGly: 5.323 ± 0.066
1.036ThrHis: 1.036 ± 0.031
2.797ThrIle: 2.797 ± 0.055
1.092ThrLys: 1.092 ± 0.035
6.223ThrLeu: 6.223 ± 0.085
1.252ThrMet: 1.252 ± 0.031
1.301ThrAsn: 1.301 ± 0.041
3.646ThrPro: 3.646 ± 0.058
1.776ThrGln: 1.776 ± 0.036
3.585ThrArg: 3.585 ± 0.053
2.541ThrSer: 2.541 ± 0.053
2.962ThrThr: 2.962 ± 0.072
4.33ThrVal: 4.33 ± 0.06
0.788ThrTrp: 0.788 ± 0.027
1.214ThrTyr: 1.214 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
10.513ValAla: 10.513 ± 0.105
0.611ValCys: 0.611 ± 0.023
4.222ValAsp: 4.222 ± 0.058
3.987ValGlu: 3.987 ± 0.064
2.384ValPhe: 2.384 ± 0.052
5.635ValGly: 5.635 ± 0.076
1.495ValHis: 1.495 ± 0.036
3.734ValIle: 3.734 ± 0.06
1.826ValLys: 1.826 ± 0.043
7.309ValLeu: 7.309 ± 0.073
1.714ValMet: 1.714 ± 0.039
2.021ValAsn: 2.021 ± 0.044
4.411ValPro: 4.411 ± 0.057
2.193ValGln: 2.193 ± 0.046
5.101ValArg: 5.101 ± 0.076
3.935ValSer: 3.935 ± 0.053
4.452ValThr: 4.452 ± 0.064
5.884ValVal: 5.884 ± 0.092
0.95ValTrp: 0.95 ± 0.03
1.37ValTyr: 1.37 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.547TrpAla: 1.547 ± 0.035
0.143TrpCys: 0.143 ± 0.01
0.773TrpAsp: 0.773 ± 0.027
0.531TrpGlu: 0.531 ± 0.021
0.592TrpPhe: 0.592 ± 0.022
1.011TrpGly: 1.011 ± 0.029
0.477TrpHis: 0.477 ± 0.018
0.671TrpIle: 0.671 ± 0.022
0.434TrpLys: 0.434 ± 0.019
1.923TrpLeu: 1.923 ± 0.043
0.348TrpMet: 0.348 ± 0.017
0.519TrpAsn: 0.519 ± 0.021
0.825TrpPro: 0.825 ± 0.029
0.834TrpGln: 0.834 ± 0.026
1.474TrpArg: 1.474 ± 0.035
0.84TrpSer: 0.84 ± 0.027
0.799TrpThr: 0.799 ± 0.025
0.857TrpVal: 0.857 ± 0.029
0.302TrpTrp: 0.302 ± 0.016
0.32TrpTyr: 0.32 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.827TyrAla: 2.827 ± 0.053
0.21TyrCys: 0.21 ± 0.012
1.498TyrAsp: 1.498 ± 0.048
0.968TyrGlu: 0.968 ± 0.028
0.849TyrPhe: 0.849 ± 0.028
2.121TyrGly: 2.121 ± 0.045
0.564TyrHis: 0.564 ± 0.022
0.76TyrIle: 0.76 ± 0.024
0.528TyrLys: 0.528 ± 0.024
2.033TyrLeu: 2.033 ± 0.041
0.371TyrMet: 0.371 ± 0.018
0.672TyrAsn: 0.672 ± 0.031
1.049TyrPro: 1.049 ± 0.033
0.76TyrGln: 0.76 ± 0.026
1.872TyrArg: 1.872 ± 0.038
1.101TyrSer: 1.101 ± 0.034
1.163TyrThr: 1.163 ± 0.041
1.571TyrVal: 1.571 ± 0.039
0.329TyrTrp: 0.329 ± 0.016
0.637TyrTyr: 0.637 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3896 proteins (1332680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski