Amino acid dipepetide frequency for Lachnospiraceae bacterium 1_4_56FAA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.546AlaAla: 7.546 ± 0.123
1.152AlaCys: 1.152 ± 0.039
4.509AlaAsp: 4.509 ± 0.068
6.289AlaGlu: 6.289 ± 0.104
3.062AlaPhe: 3.062 ± 0.065
6.205AlaGly: 6.205 ± 0.095
1.162AlaHis: 1.162 ± 0.036
4.62AlaIle: 4.62 ± 0.079
4.99AlaLys: 4.99 ± 0.083
6.797AlaLeu: 6.797 ± 0.094
2.418AlaMet: 2.418 ± 0.056
2.477AlaAsn: 2.477 ± 0.05
1.98AlaPro: 1.98 ± 0.056
2.351AlaGln: 2.351 ± 0.05
2.942AlaArg: 2.942 ± 0.061
3.748AlaSer: 3.748 ± 0.064
2.941AlaThr: 2.941 ± 0.055
6.626AlaVal: 6.626 ± 0.095
0.733AlaTrp: 0.733 ± 0.032
2.843AlaTyr: 2.843 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
1.094CysAla: 1.094 ± 0.034
0.27CysCys: 0.27 ± 0.017
0.828CysAsp: 0.828 ± 0.027
1.088CysGlu: 1.088 ± 0.034
0.645CysPhe: 0.645 ± 0.024
1.507CysGly: 1.507 ± 0.037
0.289CysHis: 0.289 ± 0.018
1.081CysIle: 1.081 ± 0.035
0.776CysLys: 0.776 ± 0.033
1.085CysLeu: 1.085 ± 0.036
0.487CysMet: 0.487 ± 0.019
0.518CysAsn: 0.518 ± 0.022
0.626CysPro: 0.626 ± 0.029
0.39CysGln: 0.39 ± 0.022
0.785CysArg: 0.785 ± 0.027
0.874CysSer: 0.874 ± 0.028
0.796CysThr: 0.796 ± 0.03
1.039CysVal: 1.039 ± 0.029
0.123CysTrp: 0.123 ± 0.011
0.583CysTyr: 0.583 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.363AspAla: 4.363 ± 0.072
0.771AspCys: 0.771 ± 0.029
2.551AspAsp: 2.551 ± 0.077
4.497AspGlu: 4.497 ± 0.076
2.437AspPhe: 2.437 ± 0.051
4.619AspGly: 4.619 ± 0.086
1.073AspHis: 1.073 ± 0.038
3.706AspIle: 3.706 ± 0.071
2.873AspLys: 2.873 ± 0.058
4.813AspLeu: 4.813 ± 0.077
1.722AspMet: 1.722 ± 0.043
1.719AspAsn: 1.719 ± 0.049
2.147AspPro: 2.147 ± 0.046
1.491AspGln: 1.491 ± 0.04
2.548AspArg: 2.548 ± 0.053
2.678AspSer: 2.678 ± 0.056
2.888AspThr: 2.888 ± 0.056
4.0AspVal: 4.0 ± 0.073
0.611AspTrp: 0.611 ± 0.024
2.54AspTyr: 2.54 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
5.874GluAla: 5.874 ± 0.092
0.932GluCys: 0.932 ± 0.03
4.374GluAsp: 4.374 ± 0.086
8.729GluGlu: 8.729 ± 0.151
2.648GluPhe: 2.648 ± 0.046
4.236GluGly: 4.236 ± 0.072
1.523GluHis: 1.523 ± 0.041
6.226GluIle: 6.226 ± 0.088
7.437GluLys: 7.437 ± 0.097
7.152GluLeu: 7.152 ± 0.116
2.951GluMet: 2.951 ± 0.062
4.235GluAsn: 4.235 ± 0.073
1.988GluPro: 1.988 ± 0.054
3.676GluGln: 3.676 ± 0.076
4.084GluArg: 4.084 ± 0.077
3.593GluSer: 3.593 ± 0.062
4.412GluThr: 4.412 ± 0.071
4.89GluVal: 4.89 ± 0.079
0.784GluTrp: 0.784 ± 0.031
3.203GluTyr: 3.203 ± 0.068
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.06
0.741PheCys: 0.741 ± 0.028
2.45PheAsp: 2.45 ± 0.058
2.744PheGlu: 2.744 ± 0.057
1.787PhePhe: 1.787 ± 0.057
3.307PheGly: 3.307 ± 0.071
0.885PheHis: 0.885 ± 0.031
2.417PheIle: 2.417 ± 0.054
1.732PheLys: 1.732 ± 0.045
3.974PheLeu: 3.974 ± 0.084
1.22PheMet: 1.22 ± 0.039
1.257PheAsn: 1.257 ± 0.038
1.443PhePro: 1.443 ± 0.038
1.403PheGln: 1.403 ± 0.038
1.95PheArg: 1.95 ± 0.053
2.701PheSer: 2.701 ± 0.06
2.193PheThr: 2.193 ± 0.051
2.849PheVal: 2.849 ± 0.052
0.45PheTrp: 0.45 ± 0.024
1.685PheTyr: 1.685 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
5.239GlyAla: 5.239 ± 0.078
1.264GlyCys: 1.264 ± 0.04
3.431GlyAsp: 3.431 ± 0.056
5.345GlyGlu: 5.345 ± 0.079
3.186GlyPhe: 3.186 ± 0.07
5.218GlyGly: 5.218 ± 0.103
1.197GlyHis: 1.197 ± 0.038
6.402GlyIle: 6.402 ± 0.089
5.806GlyLys: 5.806 ± 0.094
5.69GlyLeu: 5.69 ± 0.079
2.723GlyMet: 2.723 ± 0.054
3.3GlyAsn: 3.3 ± 0.069
1.232GlyPro: 1.232 ± 0.04
2.142GlyGln: 2.142 ± 0.041
3.073GlyArg: 3.073 ± 0.059
4.121GlySer: 4.121 ± 0.068
4.381GlyThr: 4.381 ± 0.069
5.138GlyVal: 5.138 ± 0.077
0.803GlyTrp: 0.803 ± 0.039
3.287GlyTyr: 3.287 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.279HisAla: 1.279 ± 0.038
0.318HisCys: 0.318 ± 0.019
0.864HisAsp: 0.864 ± 0.031
1.154HisGlu: 1.154 ± 0.038
0.858HisPhe: 0.858 ± 0.032
1.351HisGly: 1.351 ± 0.037
0.426HisHis: 0.426 ± 0.031
1.407HisIle: 1.407 ± 0.044
0.919HisLys: 0.919 ± 0.035
1.641HisLeu: 1.641 ± 0.04
0.565HisMet: 0.565 ± 0.024
0.639HisAsn: 0.639 ± 0.028
0.993HisPro: 0.993 ± 0.035
0.565HisGln: 0.565 ± 0.025
0.832HisArg: 0.832 ± 0.031
0.953HisSer: 0.953 ± 0.033
1.052HisThr: 1.052 ± 0.033
1.212HisVal: 1.212 ± 0.035
0.192HisTrp: 0.192 ± 0.014
0.793HisTyr: 0.793 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.531IleAla: 5.531 ± 0.084
1.356IleCys: 1.356 ± 0.039
3.824IleAsp: 3.824 ± 0.065
5.014IleGlu: 5.014 ± 0.078
2.874IlePhe: 2.874 ± 0.071
5.448IleGly: 5.448 ± 0.086
1.377IleHis: 1.377 ± 0.036
4.114IleIle: 4.114 ± 0.087
3.482IleLys: 3.482 ± 0.063
7.097IleLeu: 7.097 ± 0.113
1.883IleMet: 1.883 ± 0.052
2.425IleAsn: 2.425 ± 0.05
3.22IlePro: 3.22 ± 0.068
2.317IleGln: 2.317 ± 0.053
4.17IleArg: 4.17 ± 0.08
4.481IleSer: 4.481 ± 0.072
3.828IleThr: 3.828 ± 0.069
4.997IleVal: 4.997 ± 0.074
0.688IleTrp: 0.688 ± 0.03
2.586IleTyr: 2.586 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.836LysAla: 4.836 ± 0.079
0.731LysCys: 0.731 ± 0.027
3.58LysAsp: 3.58 ± 0.078
7.25LysGlu: 7.25 ± 0.102
1.805LysPhe: 1.805 ± 0.042
4.026LysGly: 4.026 ± 0.068
1.066LysHis: 1.066 ± 0.034
4.786LysIle: 4.786 ± 0.077
6.298LysLys: 6.298 ± 0.095
5.125LysLeu: 5.125 ± 0.069
2.376LysMet: 2.376 ± 0.048
3.486LysAsn: 3.486 ± 0.059
1.981LysPro: 1.981 ± 0.055
2.507LysGln: 2.507 ± 0.052
3.581LysArg: 3.581 ± 0.056
3.248LysSer: 3.248 ± 0.057
3.786LysThr: 3.786 ± 0.066
4.035LysVal: 4.035 ± 0.062
0.652LysTrp: 0.652 ± 0.027
2.766LysTyr: 2.766 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
6.587LeuAla: 6.587 ± 0.092
1.525LeuCys: 1.525 ± 0.039
4.957LeuAsp: 4.957 ± 0.076
6.789LeuGlu: 6.789 ± 0.104
3.796LeuPhe: 3.796 ± 0.085
6.155LeuGly: 6.155 ± 0.094
1.616LeuHis: 1.616 ± 0.045
6.0LeuIle: 6.0 ± 0.099
6.131LeuLys: 6.131 ± 0.08
8.476LeuLeu: 8.476 ± 0.126
2.605LeuMet: 2.605 ± 0.056
3.671LeuAsn: 3.671 ± 0.07
3.62LeuPro: 3.62 ± 0.068
2.803LeuGln: 2.803 ± 0.059
3.841LeuArg: 3.841 ± 0.069
5.761LeuSer: 5.761 ± 0.075
5.001LeuThr: 5.001 ± 0.084
5.365LeuVal: 5.365 ± 0.086
0.764LeuTrp: 0.764 ± 0.027
3.334LeuTyr: 3.334 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
2.43MetAla: 2.43 ± 0.056
0.349MetCys: 0.349 ± 0.019
1.887MetAsp: 1.887 ± 0.044
2.915MetGlu: 2.915 ± 0.067
1.068MetPhe: 1.068 ± 0.035
2.15MetGly: 2.15 ± 0.059
0.503MetHis: 0.503 ± 0.024
2.267MetIle: 2.267 ± 0.055
2.756MetLys: 2.756 ± 0.052
2.938MetLeu: 2.938 ± 0.052
1.024MetMet: 1.024 ± 0.039
1.534MetAsn: 1.534 ± 0.039
1.125MetPro: 1.125 ± 0.033
1.219MetGln: 1.219 ± 0.035
1.495MetArg: 1.495 ± 0.042
1.728MetSer: 1.728 ± 0.04
1.877MetThr: 1.877 ± 0.045
1.852MetVal: 1.852 ± 0.048
0.248MetTrp: 0.248 ± 0.017
1.069MetTyr: 1.069 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
3.054AsnAla: 3.054 ± 0.054
0.611AsnCys: 0.611 ± 0.027
1.886AsnAsp: 1.886 ± 0.049
2.87AsnGlu: 2.87 ± 0.059
1.398AsnPhe: 1.398 ± 0.039
3.648AsnGly: 3.648 ± 0.061
0.796AsnHis: 0.796 ± 0.032
2.898AsnIle: 2.898 ± 0.054
2.111AsnLys: 2.111 ± 0.052
3.673AsnLeu: 3.673 ± 0.07
1.31AsnMet: 1.31 ± 0.039
1.418AsnAsn: 1.418 ± 0.042
1.871AsnPro: 1.871 ± 0.044
1.373AsnGln: 1.373 ± 0.037
2.163AsnArg: 2.163 ± 0.052
2.019AsnSer: 2.019 ± 0.055
2.234AsnThr: 2.234 ± 0.045
2.889AsnVal: 2.889 ± 0.056
0.445AsnTrp: 0.445 ± 0.022
1.695AsnTyr: 1.695 ± 0.051
0.001AsnXaa: 0.001 ± 0.001
Pro
2.428ProAla: 2.428 ± 0.051
0.436ProCys: 0.436 ± 0.022
2.255ProAsp: 2.255 ± 0.054
3.597ProGlu: 3.597 ± 0.073
1.499ProPhe: 1.499 ± 0.041
2.425ProGly: 2.425 ± 0.062
0.66ProHis: 0.66 ± 0.029
2.148ProIle: 2.148 ± 0.047
2.15ProLys: 2.15 ± 0.049
2.713ProLeu: 2.713 ± 0.06
0.946ProMet: 0.946 ± 0.035
1.232ProAsn: 1.232 ± 0.035
0.756ProPro: 0.756 ± 0.031
1.024ProGln: 1.024 ± 0.035
1.018ProArg: 1.018 ± 0.033
1.691ProSer: 1.691 ± 0.042
1.622ProThr: 1.622 ± 0.045
2.817ProVal: 2.817 ± 0.047
0.334ProTrp: 0.334 ± 0.02
1.465ProTyr: 1.465 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.48GlnAla: 2.48 ± 0.057
0.383GlnCys: 0.383 ± 0.018
1.594GlnAsp: 1.594 ± 0.043
2.909GlnGlu: 2.909 ± 0.068
1.286GlnPhe: 1.286 ± 0.031
1.931GlnGly: 1.931 ± 0.045
0.48GlnHis: 0.48 ± 0.022
2.959GlnIle: 2.959 ± 0.057
2.823GlnLys: 2.823 ± 0.06
2.774GlnLeu: 2.774 ± 0.066
1.33GlnMet: 1.33 ± 0.035
1.66GlnAsn: 1.66 ± 0.046
0.879GlnPro: 0.879 ± 0.037
1.199GlnGln: 1.199 ± 0.042
1.473GlnArg: 1.473 ± 0.045
1.583GlnSer: 1.583 ± 0.049
1.88GlnThr: 1.88 ± 0.048
2.072GlnVal: 2.072 ± 0.048
0.378GlnTrp: 0.378 ± 0.021
1.405GlnTyr: 1.405 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
3.059ArgAla: 3.059 ± 0.059
0.61ArgCys: 0.61 ± 0.028
2.226ArgAsp: 2.226 ± 0.049
4.422ArgGlu: 4.422 ± 0.08
1.968ArgPhe: 1.968 ± 0.047
2.743ArgGly: 2.743 ± 0.048
0.806ArgHis: 0.806 ± 0.028
3.586ArgIle: 3.586 ± 0.078
3.839ArgLys: 3.839 ± 0.063
4.011ArgLeu: 4.011 ± 0.071
1.859ArgMet: 1.859 ± 0.048
2.093ArgAsn: 2.093 ± 0.05
1.364ArgPro: 1.364 ± 0.036
1.89ArgGln: 1.89 ± 0.048
2.571ArgArg: 2.571 ± 0.062
2.323ArgSer: 2.323 ± 0.053
2.354ArgThr: 2.354 ± 0.054
2.798ArgVal: 2.798 ± 0.055
0.446ArgTrp: 0.446 ± 0.022
1.997ArgTyr: 1.997 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
4.254SerAla: 4.254 ± 0.077
0.813SerCys: 0.813 ± 0.031
2.967SerAsp: 2.967 ± 0.062
4.157SerGlu: 4.157 ± 0.068
2.402SerPhe: 2.402 ± 0.054
5.13SerGly: 5.13 ± 0.078
1.006SerHis: 1.006 ± 0.034
3.652SerIle: 3.652 ± 0.064
3.06SerLys: 3.06 ± 0.053
4.546SerLeu: 4.546 ± 0.069
1.833SerMet: 1.833 ± 0.046
1.856SerAsn: 1.856 ± 0.045
1.705SerPro: 1.705 ± 0.047
1.551SerGln: 1.551 ± 0.042
2.686SerArg: 2.686 ± 0.048
2.97SerSer: 2.97 ± 0.067
2.538SerThr: 2.538 ± 0.06
4.199SerVal: 4.199 ± 0.069
0.557SerTrp: 0.557 ± 0.027
2.395SerTyr: 2.395 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
4.452ThrAla: 4.452 ± 0.081
0.642ThrCys: 0.642 ± 0.022
3.187ThrAsp: 3.187 ± 0.06
4.184ThrGlu: 4.184 ± 0.071
2.248ThrPhe: 2.248 ± 0.056
4.574ThrGly: 4.574 ± 0.077
0.91ThrHis: 0.91 ± 0.035
3.761ThrIle: 3.761 ± 0.064
3.193ThrLys: 3.193 ± 0.059
4.814ThrLeu: 4.814 ± 0.079
1.532ThrMet: 1.532 ± 0.043
1.903ThrAsn: 1.903 ± 0.044
2.075ThrPro: 2.075 ± 0.05
1.433ThrGln: 1.433 ± 0.038
1.925ThrArg: 1.925 ± 0.043
2.718ThrSer: 2.718 ± 0.048
2.773ThrThr: 2.773 ± 0.06
4.373ThrVal: 4.373 ± 0.07
0.578ThrTrp: 0.578 ± 0.028
2.114ThrTyr: 2.114 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.468ValAla: 4.468 ± 0.078
1.22ValCys: 1.22 ± 0.036
3.673ValAsp: 3.673 ± 0.059
4.928ValGlu: 4.928 ± 0.077
3.064ValPhe: 3.064 ± 0.07
4.536ValGly: 4.536 ± 0.07
1.159ValHis: 1.159 ± 0.035
5.13ValIle: 5.13 ± 0.088
4.527ValLys: 4.527 ± 0.063
6.85ValLeu: 6.85 ± 0.097
2.125ValMet: 2.125 ± 0.055
2.727ValAsn: 2.727 ± 0.055
2.526ValPro: 2.526 ± 0.055
2.233ValGln: 2.233 ± 0.05
3.249ValArg: 3.249 ± 0.058
4.402ValSer: 4.402 ± 0.07
4.079ValThr: 4.079 ± 0.07
4.715ValVal: 4.715 ± 0.082
0.744ValTrp: 0.744 ± 0.031
2.699ValTyr: 2.699 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.027
0.161TrpCys: 0.161 ± 0.012
0.579TrpAsp: 0.579 ± 0.028
0.769TrpGlu: 0.769 ± 0.028
0.445TrpPhe: 0.445 ± 0.023
0.728TrpGly: 0.728 ± 0.03
0.185TrpHis: 0.185 ± 0.014
0.869TrpIle: 0.869 ± 0.03
0.906TrpLys: 0.906 ± 0.032
0.827TrpLeu: 0.827 ± 0.031
0.394TrpMet: 0.394 ± 0.022
0.581TrpAsn: 0.581 ± 0.026
0.193TrpPro: 0.193 ± 0.016
0.372TrpGln: 0.372 ± 0.021
0.383TrpArg: 0.383 ± 0.02
0.504TrpSer: 0.504 ± 0.023
0.456TrpThr: 0.456 ± 0.021
0.559TrpVal: 0.559 ± 0.025
0.149TrpTrp: 0.149 ± 0.015
0.472TrpTyr: 0.472 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.822TyrAla: 2.822 ± 0.056
0.599TyrCys: 0.599 ± 0.022
2.426TyrAsp: 2.426 ± 0.059
3.299TyrGlu: 3.299 ± 0.059
1.745TyrPhe: 1.745 ± 0.045
3.043TyrGly: 3.043 ± 0.06
0.895TyrHis: 0.895 ± 0.031
2.681TyrIle: 2.681 ± 0.063
2.112TyrLys: 2.112 ± 0.052
3.803TyrLeu: 3.803 ± 0.078
1.128TyrMet: 1.128 ± 0.035
1.63TyrAsn: 1.63 ± 0.044
1.493TyrPro: 1.493 ± 0.04
1.537TyrGln: 1.537 ± 0.043
2.194TyrArg: 2.194 ± 0.051
2.182TyrSer: 2.182 ± 0.051
2.299TyrThr: 2.299 ± 0.051
2.629TyrVal: 2.629 ± 0.051
0.416TyrTrp: 0.416 ± 0.021
1.895TyrTyr: 1.895 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3188 proteins (960594 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski