Amino acid dipepetide frequency for Pedobacter sp. ok626

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.41AlaAla: 6.41 ± 0.079
0.592AlaCys: 0.592 ± 0.017
4.303AlaAsp: 4.303 ± 0.04
4.4AlaGlu: 4.4 ± 0.051
3.414AlaPhe: 3.414 ± 0.038
5.495AlaGly: 5.495 ± 0.086
1.09AlaHis: 1.09 ± 0.027
5.412AlaIle: 5.412 ± 0.061
5.028AlaLys: 5.028 ± 0.052
6.897AlaLeu: 6.897 ± 0.056
1.648AlaMet: 1.648 ± 0.032
4.13AlaAsn: 4.13 ± 0.045
2.352AlaPro: 2.352 ± 0.041
2.905AlaGln: 2.905 ± 0.039
2.318AlaArg: 2.318 ± 0.037
4.718AlaSer: 4.718 ± 0.061
4.458AlaThr: 4.458 ± 0.09
4.738AlaVal: 4.738 ± 0.049
0.751AlaTrp: 0.751 ± 0.019
3.192AlaTyr: 3.192 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.472CysAla: 0.472 ± 0.017
0.105CysCys: 0.105 ± 0.007
0.343CysAsp: 0.343 ± 0.012
0.359CysGlu: 0.359 ± 0.015
0.456CysPhe: 0.456 ± 0.016
0.565CysGly: 0.565 ± 0.018
0.172CysHis: 0.172 ± 0.01
0.553CysIle: 0.553 ± 0.016
0.553CysLys: 0.553 ± 0.018
0.716CysLeu: 0.716 ± 0.022
0.164CysMet: 0.164 ± 0.011
0.349CysAsn: 0.349 ± 0.013
0.25CysPro: 0.25 ± 0.011
0.194CysGln: 0.194 ± 0.01
0.304CysArg: 0.304 ± 0.014
0.479CysSer: 0.479 ± 0.017
0.411CysThr: 0.411 ± 0.014
0.402CysVal: 0.402 ± 0.013
0.07CysTrp: 0.07 ± 0.006
0.311CysTyr: 0.311 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.887AspAla: 3.887 ± 0.048
0.359AspCys: 0.359 ± 0.012
2.387AspAsp: 2.387 ± 0.036
3.21AspGlu: 3.21 ± 0.044
3.13AspPhe: 3.13 ± 0.042
3.906AspGly: 3.906 ± 0.056
1.031AspHis: 1.031 ± 0.024
3.766AspIle: 3.766 ± 0.041
3.984AspLys: 3.984 ± 0.049
5.321AspLeu: 5.321 ± 0.055
1.175AspMet: 1.175 ± 0.024
2.721AspAsn: 2.721 ± 0.039
2.22AspPro: 2.22 ± 0.036
2.123AspGln: 2.123 ± 0.031
2.139AspArg: 2.139 ± 0.032
2.59AspSer: 2.59 ± 0.042
2.311AspThr: 2.311 ± 0.03
3.484AspVal: 3.484 ± 0.039
0.813AspTrp: 0.813 ± 0.021
2.558AspTyr: 2.558 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
4.107GluAla: 4.107 ± 0.056
0.306GluCys: 0.306 ± 0.013
2.628GluAsp: 2.628 ± 0.04
3.457GluGlu: 3.457 ± 0.061
2.385GluPhe: 2.385 ± 0.036
3.41GluGly: 3.41 ± 0.041
0.971GluHis: 0.971 ± 0.027
4.325GluIle: 4.325 ± 0.055
4.488GluLys: 4.488 ± 0.057
5.726GluLeu: 5.726 ± 0.063
1.4GluMet: 1.4 ± 0.028
3.319GluAsn: 3.319 ± 0.043
1.546GluPro: 1.546 ± 0.029
2.359GluGln: 2.359 ± 0.035
2.446GluArg: 2.446 ± 0.036
2.868GluSer: 2.868 ± 0.035
2.908GluThr: 2.908 ± 0.039
3.822GluVal: 3.822 ± 0.045
0.641GluTrp: 0.641 ± 0.019
1.917GluTyr: 1.917 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.23PheAla: 3.23 ± 0.041
0.416PheCys: 0.416 ± 0.016
2.83PheAsp: 2.83 ± 0.037
2.697PheGlu: 2.697 ± 0.039
2.338PhePhe: 2.338 ± 0.044
3.281PheGly: 3.281 ± 0.044
0.706PheHis: 0.706 ± 0.016
3.442PheIle: 3.442 ± 0.044
3.564PheLys: 3.564 ± 0.046
4.267PheLeu: 4.267 ± 0.06
1.121PheMet: 1.121 ± 0.024
3.258PheAsn: 3.258 ± 0.042
1.722PhePro: 1.722 ± 0.031
1.323PheGln: 1.323 ± 0.025
1.717PheArg: 1.717 ± 0.032
3.938PheSer: 3.938 ± 0.047
3.036PheThr: 3.036 ± 0.042
2.753PheVal: 2.753 ± 0.038
0.58PheTrp: 0.58 ± 0.017
2.203PheTyr: 2.203 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
4.811GlyAla: 4.811 ± 0.077
0.558GlyCys: 0.558 ± 0.019
3.348GlyAsp: 3.348 ± 0.046
3.174GlyGlu: 3.174 ± 0.04
3.608GlyPhe: 3.608 ± 0.049
4.886GlyGly: 4.886 ± 0.08
1.045GlyHis: 1.045 ± 0.028
5.308GlyIle: 5.308 ± 0.056
5.538GlyLys: 5.538 ± 0.065
6.576GlyLeu: 6.576 ± 0.07
1.672GlyMet: 1.672 ± 0.032
4.062GlyAsn: 4.062 ± 0.06
1.584GlyPro: 1.584 ± 0.035
2.286GlyGln: 2.286 ± 0.037
2.496GlyArg: 2.496 ± 0.04
4.629GlySer: 4.629 ± 0.063
4.7GlyThr: 4.7 ± 0.101
4.609GlyVal: 4.609 ± 0.058
0.917GlyTrp: 0.917 ± 0.023
3.328GlyTyr: 3.328 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.068HisAla: 1.068 ± 0.025
0.17HisCys: 0.17 ± 0.008
0.785HisAsp: 0.785 ± 0.019
0.815HisGlu: 0.815 ± 0.022
1.044HisPhe: 1.044 ± 0.024
0.983HisGly: 0.983 ± 0.023
0.426HisHis: 0.426 ± 0.016
1.23HisIle: 1.23 ± 0.028
1.033HisLys: 1.033 ± 0.026
1.677HisLeu: 1.677 ± 0.03
0.337HisMet: 0.337 ± 0.013
0.894HisAsn: 0.894 ± 0.02
0.877HisPro: 0.877 ± 0.021
0.723HisGln: 0.723 ± 0.019
0.657HisArg: 0.657 ± 0.019
0.964HisSer: 0.964 ± 0.022
0.917HisThr: 0.917 ± 0.021
0.883HisVal: 0.883 ± 0.019
0.213HisTrp: 0.213 ± 0.01
0.743HisTyr: 0.743 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.792IleAla: 5.792 ± 0.055
0.642IleCys: 0.642 ± 0.02
4.197IleAsp: 4.197 ± 0.044
4.179IleGlu: 4.179 ± 0.048
3.056IlePhe: 3.056 ± 0.047
4.892IleGly: 4.892 ± 0.056
1.148IleHis: 1.148 ± 0.024
4.784IleIle: 4.784 ± 0.062
5.206IleLys: 5.206 ± 0.054
6.286IleLeu: 6.286 ± 0.061
1.356IleMet: 1.356 ± 0.028
4.532IleAsn: 4.532 ± 0.05
3.067IlePro: 3.067 ± 0.042
2.272IleGln: 2.272 ± 0.031
2.821IleArg: 2.821 ± 0.036
5.503IleSer: 5.503 ± 0.052
4.566IleThr: 4.566 ± 0.062
4.276IleVal: 4.276 ± 0.046
0.764IleTrp: 0.764 ± 0.024
2.767IleTyr: 2.767 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
5.272LysAla: 5.272 ± 0.059
0.314LysCys: 0.314 ± 0.014
4.468LysAsp: 4.468 ± 0.048
4.628LysGlu: 4.628 ± 0.056
2.691LysPhe: 2.691 ± 0.041
4.859LysGly: 4.859 ± 0.054
1.247LysHis: 1.247 ± 0.026
5.307LysIle: 5.307 ± 0.054
5.578LysLys: 5.578 ± 0.076
6.483LysLeu: 6.483 ± 0.065
1.937LysMet: 1.937 ± 0.031
4.548LysAsn: 4.548 ± 0.043
2.795LysPro: 2.795 ± 0.04
2.743LysGln: 2.743 ± 0.043
2.828LysArg: 2.828 ± 0.04
4.325LysSer: 4.325 ± 0.047
4.381LysThr: 4.381 ± 0.049
4.631LysVal: 4.631 ± 0.053
0.88LysTrp: 0.88 ± 0.023
3.048LysTyr: 3.048 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
6.736LeuAla: 6.736 ± 0.065
0.773LeuCys: 0.773 ± 0.023
4.73LeuAsp: 4.73 ± 0.045
4.836LeuGlu: 4.836 ± 0.066
4.605LeuPhe: 4.605 ± 0.055
5.689LeuGly: 5.689 ± 0.06
1.553LeuHis: 1.553 ± 0.031
6.675LeuIle: 6.675 ± 0.081
7.854LeuLys: 7.854 ± 0.067
9.242LeuLeu: 9.242 ± 0.093
2.205LeuMet: 2.205 ± 0.033
6.151LeuAsn: 6.151 ± 0.06
4.061LeuPro: 4.061 ± 0.046
3.399LeuGln: 3.399 ± 0.045
3.58LeuArg: 3.58 ± 0.044
7.505LeuSer: 7.505 ± 0.07
5.776LeuThr: 5.776 ± 0.06
5.326LeuVal: 5.326 ± 0.056
0.962LeuTrp: 0.962 ± 0.024
3.425LeuTyr: 3.425 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
1.804MetAla: 1.804 ± 0.032
0.138MetCys: 0.138 ± 0.008
1.213MetAsp: 1.213 ± 0.026
1.365MetGlu: 1.365 ± 0.025
0.822MetPhe: 0.822 ± 0.02
1.51MetGly: 1.51 ± 0.029
0.375MetHis: 0.375 ± 0.015
1.526MetIle: 1.526 ± 0.03
2.024MetLys: 2.024 ± 0.031
2.127MetLeu: 2.127 ± 0.03
0.625MetMet: 0.625 ± 0.019
1.319MetAsn: 1.319 ± 0.027
1.067MetPro: 1.067 ± 0.025
0.894MetGln: 0.894 ± 0.02
0.967MetArg: 0.967 ± 0.02
1.337MetSer: 1.337 ± 0.026
1.112MetThr: 1.112 ± 0.024
1.45MetVal: 1.45 ± 0.025
0.199MetTrp: 0.199 ± 0.009
0.715MetTyr: 0.715 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
4.324AsnAla: 4.324 ± 0.057
0.387AsnCys: 0.387 ± 0.015
2.907AsnAsp: 2.907 ± 0.035
3.085AsnGlu: 3.085 ± 0.041
2.787AsnPhe: 2.787 ± 0.041
4.534AsnGly: 4.534 ± 0.057
0.941AsnHis: 0.941 ± 0.022
4.318AsnIle: 4.318 ± 0.049
4.054AsnLys: 4.054 ± 0.043
5.466AsnLeu: 5.466 ± 0.057
1.252AsnMet: 1.252 ± 0.023
3.606AsnAsn: 3.606 ± 0.06
2.845AsnPro: 2.845 ± 0.04
2.123AsnGln: 2.123 ± 0.033
2.387AsnArg: 2.387 ± 0.034
3.73AsnSer: 3.73 ± 0.046
3.642AsnThr: 3.642 ± 0.056
3.57AsnVal: 3.57 ± 0.049
0.772AsnTrp: 0.772 ± 0.022
2.872AsnTyr: 2.872 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.042ProAla: 3.042 ± 0.047
0.231ProCys: 0.231 ± 0.011
2.442ProAsp: 2.442 ± 0.036
2.755ProGlu: 2.755 ± 0.039
1.892ProPhe: 1.892 ± 0.032
2.914ProGly: 2.914 ± 0.04
0.558ProHis: 0.558 ± 0.015
2.515ProIle: 2.515 ± 0.039
2.257ProLys: 2.257 ± 0.034
3.401ProLeu: 3.401 ± 0.039
0.794ProMet: 0.794 ± 0.021
2.042ProAsn: 2.042 ± 0.033
0.939ProPro: 0.939 ± 0.021
1.29ProGln: 1.29 ± 0.027
1.083ProArg: 1.083 ± 0.025
2.306ProSer: 2.306 ± 0.034
2.094ProThr: 2.094 ± 0.036
3.008ProVal: 3.008 ± 0.04
0.391ProTrp: 0.391 ± 0.015
1.548ProTyr: 1.548 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.492GlnAla: 2.492 ± 0.036
0.171GlnCys: 0.171 ± 0.01
1.685GlnAsp: 1.685 ± 0.029
1.967GlnGlu: 1.967 ± 0.036
1.587GlnPhe: 1.587 ± 0.027
2.091GlnGly: 2.091 ± 0.03
0.693GlnHis: 0.693 ± 0.02
2.555GlnIle: 2.555 ± 0.038
2.567GlnLys: 2.567 ± 0.042
3.858GlnLeu: 3.858 ± 0.048
0.868GlnMet: 0.868 ± 0.02
2.155GlnAsn: 2.155 ± 0.035
1.296GlnPro: 1.296 ± 0.023
1.875GlnGln: 1.875 ± 0.036
1.455GlnArg: 1.455 ± 0.029
2.285GlnSer: 2.285 ± 0.036
2.108GlnThr: 2.108 ± 0.032
2.313GlnVal: 2.313 ± 0.036
0.409GlnTrp: 0.409 ± 0.016
1.498GlnTyr: 1.498 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
2.483ArgAla: 2.483 ± 0.035
0.224ArgCys: 0.224 ± 0.011
1.892ArgAsp: 1.892 ± 0.032
2.018ArgGlu: 2.018 ± 0.034
2.098ArgPhe: 2.098 ± 0.036
2.207ArgGly: 2.207 ± 0.035
0.581ArgHis: 0.581 ± 0.019
3.04ArgIle: 3.04 ± 0.044
2.818ArgLys: 2.818 ± 0.051
3.797ArgLeu: 3.797 ± 0.045
1.057ArgMet: 1.057 ± 0.02
2.297ArgAsn: 2.297 ± 0.035
1.35ArgPro: 1.35 ± 0.033
1.245ArgGln: 1.245 ± 0.021
1.469ArgArg: 1.469 ± 0.032
2.295ArgSer: 2.295 ± 0.035
2.096ArgThr: 2.096 ± 0.03
2.285ArgVal: 2.285 ± 0.034
0.574ArgTrp: 0.574 ± 0.019
1.87ArgTyr: 1.87 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
4.979SerAla: 4.979 ± 0.058
0.566SerCys: 0.566 ± 0.017
3.285SerAsp: 3.285 ± 0.044
2.998SerGlu: 2.998 ± 0.038
3.756SerPhe: 3.756 ± 0.049
5.301SerGly: 5.301 ± 0.076
0.99SerHis: 0.99 ± 0.023
4.967SerIle: 4.967 ± 0.055
4.225SerLys: 4.225 ± 0.056
6.347SerLeu: 6.347 ± 0.059
1.287SerMet: 1.287 ± 0.025
3.588SerAsn: 3.588 ± 0.051
2.481SerPro: 2.481 ± 0.039
1.963SerGln: 1.963 ± 0.03
2.523SerArg: 2.523 ± 0.038
4.521SerSer: 4.521 ± 0.059
3.975SerThr: 3.975 ± 0.05
4.219SerVal: 4.219 ± 0.043
0.852SerTrp: 0.852 ± 0.021
2.999SerTyr: 2.999 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.953ThrAla: 4.953 ± 0.089
0.316ThrCys: 0.316 ± 0.013
3.453ThrAsp: 3.453 ± 0.043
3.155ThrGlu: 3.155 ± 0.039
2.779ThrPhe: 2.779 ± 0.044
5.024ThrGly: 5.024 ± 0.084
0.936ThrHis: 0.936 ± 0.02
4.388ThrIle: 4.388 ± 0.056
3.504ThrLys: 3.504 ± 0.042
5.652ThrLeu: 5.652 ± 0.066
1.064ThrMet: 1.064 ± 0.023
3.208ThrAsn: 3.208 ± 0.054
2.5ThrPro: 2.5 ± 0.032
1.872ThrGln: 1.872 ± 0.031
1.886ThrArg: 1.886 ± 0.03
3.705ThrSer: 3.705 ± 0.053
3.817ThrThr: 3.817 ± 0.068
4.013ThrVal: 4.013 ± 0.08
0.697ThrTrp: 0.697 ± 0.022
2.647ThrTyr: 2.647 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
4.617ValAla: 4.617 ± 0.057
0.52ValCys: 0.52 ± 0.018
3.333ValAsp: 3.333 ± 0.039
3.247ValGlu: 3.247 ± 0.045
3.072ValPhe: 3.072 ± 0.036
3.762ValGly: 3.762 ± 0.048
0.933ValHis: 0.933 ± 0.022
4.647ValIle: 4.647 ± 0.054
4.755ValLys: 4.755 ± 0.049
6.026ValLeu: 6.026 ± 0.059
1.469ValMet: 1.469 ± 0.024
3.931ValAsn: 3.931 ± 0.045
2.468ValPro: 2.468 ± 0.035
2.025ValGln: 2.025 ± 0.029
2.248ValArg: 2.248 ± 0.034
4.558ValSer: 4.558 ± 0.052
3.822ValThr: 3.822 ± 0.068
4.239ValVal: 4.239 ± 0.055
0.683ValTrp: 0.683 ± 0.02
2.579ValTyr: 2.579 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.774TrpAla: 0.774 ± 0.022
0.13TrpCys: 0.13 ± 0.008
0.636TrpAsp: 0.636 ± 0.019
0.659TrpGlu: 0.659 ± 0.016
0.573TrpPhe: 0.573 ± 0.018
0.834TrpGly: 0.834 ± 0.021
0.228TrpHis: 0.228 ± 0.011
0.757TrpIle: 0.757 ± 0.021
0.894TrpLys: 0.894 ± 0.024
1.162TrpLeu: 1.162 ± 0.022
0.345TrpMet: 0.345 ± 0.015
0.713TrpAsn: 0.713 ± 0.02
0.368TrpPro: 0.368 ± 0.014
0.513TrpGln: 0.513 ± 0.017
0.524TrpArg: 0.524 ± 0.017
0.721TrpSer: 0.721 ± 0.021
0.703TrpThr: 0.703 ± 0.021
0.673TrpVal: 0.673 ± 0.018
0.176TrpTrp: 0.176 ± 0.01
0.51TrpTyr: 0.51 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.996TyrAla: 2.996 ± 0.044
0.312TyrCys: 0.312 ± 0.012
2.248TyrAsp: 2.248 ± 0.039
2.026TyrGlu: 2.026 ± 0.031
2.33TyrPhe: 2.33 ± 0.039
2.992TyrGly: 2.992 ± 0.048
0.814TyrHis: 0.814 ± 0.021
2.591TyrIle: 2.591 ± 0.037
3.04TyrLys: 3.04 ± 0.04
4.054TyrLeu: 4.054 ± 0.055
0.81TyrMet: 0.81 ± 0.017
2.775TyrAsn: 2.775 ± 0.045
1.717TyrPro: 1.717 ± 0.027
1.807TyrGln: 1.807 ± 0.029
1.848TyrArg: 1.848 ± 0.03
2.845TyrSer: 2.845 ± 0.043
2.737TyrThr: 2.737 ± 0.043
2.242TyrVal: 2.242 ± 0.036
0.546TyrTrp: 0.546 ± 0.017
1.959TyrTyr: 1.959 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5460 proteins (2081436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski