Amino acid dipepetide frequency for Leifsonia rubra CMS 76R

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.137AlaAla: 17.137 ± 0.216
0.559AlaCys: 0.559 ± 0.028
6.96AlaAsp: 6.96 ± 0.092
7.705AlaGlu: 7.705 ± 0.118
3.785AlaPhe: 3.785 ± 0.086
10.305AlaGly: 10.305 ± 0.131
2.42AlaHis: 2.42 ± 0.06
6.908AlaIle: 6.908 ± 0.102
3.174AlaLys: 3.174 ± 0.08
13.1AlaLeu: 13.1 ± 0.166
2.551AlaMet: 2.551 ± 0.066
2.916AlaAsn: 2.916 ± 0.066
5.197AlaPro: 5.197 ± 0.103
3.867AlaGln: 3.867 ± 0.069
7.728AlaArg: 7.728 ± 0.105
7.753AlaSer: 7.753 ± 0.113
7.51AlaThr: 7.51 ± 0.109
10.423AlaVal: 10.423 ± 0.148
1.434AlaTrp: 1.434 ± 0.051
2.133AlaTyr: 2.133 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.578CysAla: 0.578 ± 0.027
0.059CysCys: 0.059 ± 0.009
0.299CysAsp: 0.299 ± 0.018
0.289CysGlu: 0.289 ± 0.022
0.174CysPhe: 0.174 ± 0.016
0.516CysGly: 0.516 ± 0.026
0.106CysHis: 0.106 ± 0.013
0.249CysIle: 0.249 ± 0.018
0.069CysLys: 0.069 ± 0.008
0.424CysLeu: 0.424 ± 0.026
0.088CysMet: 0.088 ± 0.009
0.114CysAsn: 0.114 ± 0.012
0.268CysPro: 0.268 ± 0.021
0.131CysGln: 0.131 ± 0.014
0.298CysArg: 0.298 ± 0.02
0.386CysSer: 0.386 ± 0.026
0.361CysThr: 0.361 ± 0.023
0.46CysVal: 0.46 ± 0.025
0.063CysTrp: 0.063 ± 0.009
0.099CysTyr: 0.099 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.485AspAla: 7.485 ± 0.098
0.265AspCys: 0.265 ± 0.02
3.777AspAsp: 3.777 ± 0.098
4.123AspGlu: 4.123 ± 0.078
1.937AspPhe: 1.937 ± 0.054
4.982AspGly: 4.982 ± 0.092
1.185AspHis: 1.185 ± 0.036
2.699AspIle: 2.699 ± 0.055
1.26AspLys: 1.26 ± 0.045
5.679AspLeu: 5.679 ± 0.092
0.829AspMet: 0.829 ± 0.036
1.462AspAsn: 1.462 ± 0.046
3.366AspPro: 3.366 ± 0.069
1.689AspGln: 1.689 ± 0.049
3.917AspArg: 3.917 ± 0.085
3.601AspSer: 3.601 ± 0.069
2.986AspThr: 2.986 ± 0.061
4.942AspVal: 4.942 ± 0.086
0.875AspTrp: 0.875 ± 0.032
1.347AspTyr: 1.347 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
6.538GluAla: 6.538 ± 0.108
0.269GluCys: 0.269 ± 0.018
2.38GluAsp: 2.38 ± 0.066
2.922GluGlu: 2.922 ± 0.064
2.129GluPhe: 2.129 ± 0.058
3.881GluGly: 3.881 ± 0.082
1.37GluHis: 1.37 ± 0.044
3.191GluIle: 3.191 ± 0.072
1.727GluLys: 1.727 ± 0.056
7.355GluLeu: 7.355 ± 0.113
1.13GluMet: 1.13 ± 0.04
1.643GluAsn: 1.643 ± 0.046
2.856GluPro: 2.856 ± 0.064
2.262GluGln: 2.262 ± 0.053
4.55GluArg: 4.55 ± 0.086
3.712GluSer: 3.712 ± 0.073
3.332GluThr: 3.332 ± 0.064
4.497GluVal: 4.497 ± 0.077
0.831GluTrp: 0.831 ± 0.033
1.213GluTyr: 1.213 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
4.466PheAla: 4.466 ± 0.083
0.2PheCys: 0.2 ± 0.018
2.554PheAsp: 2.554 ± 0.06
2.057PheGlu: 2.057 ± 0.053
1.176PhePhe: 1.176 ± 0.041
3.478PheGly: 3.478 ± 0.068
0.548PheHis: 0.548 ± 0.028
1.604PheIle: 1.604 ± 0.051
0.56PheLys: 0.56 ± 0.029
2.897PheLeu: 2.897 ± 0.068
0.57PheMet: 0.57 ± 0.028
0.806PheAsn: 0.806 ± 0.032
1.428PhePro: 1.428 ± 0.047
0.763PheGln: 0.763 ± 0.035
1.7PheArg: 1.7 ± 0.057
2.23PheSer: 2.23 ± 0.05
2.254PheThr: 2.254 ± 0.058
3.018PheVal: 3.018 ± 0.063
0.425PheTrp: 0.425 ± 0.023
0.749PheTyr: 0.749 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
9.511GlyAla: 9.511 ± 0.141
0.545GlyCys: 0.545 ± 0.025
4.437GlyAsp: 4.437 ± 0.078
4.661GlyGlu: 4.661 ± 0.078
3.259GlyPhe: 3.259 ± 0.069
6.73GlyGly: 6.73 ± 0.133
1.733GlyHis: 1.733 ± 0.05
5.097GlyIle: 5.097 ± 0.101
2.616GlyLys: 2.616 ± 0.062
8.413GlyLeu: 8.413 ± 0.107
1.909GlyMet: 1.909 ± 0.053
2.102GlyAsn: 2.102 ± 0.055
3.153GlyPro: 3.153 ± 0.074
2.533GlyGln: 2.533 ± 0.06
5.075GlyArg: 5.075 ± 0.09
5.542GlySer: 5.542 ± 0.109
5.294GlyThr: 5.294 ± 0.098
7.36GlyVal: 7.36 ± 0.105
1.383GlyTrp: 1.383 ± 0.04
2.149GlyTyr: 2.149 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
2.113HisAla: 2.113 ± 0.056
0.154HisCys: 0.154 ± 0.013
1.262HisAsp: 1.262 ± 0.038
1.166HisGlu: 1.166 ± 0.041
0.613HisPhe: 0.613 ± 0.029
1.764HisGly: 1.764 ± 0.05
0.571HisHis: 0.571 ± 0.029
0.895HisIle: 0.895 ± 0.034
0.415HisLys: 0.415 ± 0.021
2.008HisLeu: 2.008 ± 0.05
0.371HisMet: 0.371 ± 0.023
0.531HisAsn: 0.531 ± 0.025
1.429HisPro: 1.429 ± 0.05
0.573HisGln: 0.573 ± 0.025
1.407HisArg: 1.407 ± 0.047
1.335HisSer: 1.335 ± 0.049
1.093HisThr: 1.093 ± 0.035
1.532HisVal: 1.532 ± 0.045
0.295HisTrp: 0.295 ± 0.022
0.481HisTyr: 0.481 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
7.618IleAla: 7.618 ± 0.107
0.261IleCys: 0.261 ± 0.018
3.852IleAsp: 3.852 ± 0.081
3.338IleGlu: 3.338 ± 0.066
1.638IlePhe: 1.638 ± 0.055
4.895IleGly: 4.895 ± 0.092
0.816IleHis: 0.816 ± 0.03
2.672IleIle: 2.672 ± 0.068
1.114IleLys: 1.114 ± 0.041
4.391IleLeu: 4.391 ± 0.09
0.809IleMet: 0.809 ± 0.032
1.424IleAsn: 1.424 ± 0.046
2.754IlePro: 2.754 ± 0.053
1.264IleGln: 1.264 ± 0.045
2.933IleArg: 2.933 ± 0.064
3.557IleSer: 3.557 ± 0.062
3.539IleThr: 3.539 ± 0.063
5.362IleVal: 5.362 ± 0.103
0.494IleTrp: 0.494 ± 0.025
0.941IleTyr: 0.941 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
2.788LysAla: 2.788 ± 0.08
0.085LysCys: 0.085 ± 0.009
1.27LysAsp: 1.27 ± 0.039
1.215LysGlu: 1.215 ± 0.05
0.779LysPhe: 0.779 ± 0.033
1.708LysGly: 1.708 ± 0.053
0.589LysHis: 0.589 ± 0.025
1.348LysIle: 1.348 ± 0.04
1.144LysLys: 1.144 ± 0.056
2.637LysLeu: 2.637 ± 0.058
0.553LysMet: 0.553 ± 0.026
0.868LysAsn: 0.868 ± 0.036
1.435LysPro: 1.435 ± 0.043
0.91LysGln: 0.91 ± 0.033
1.952LysArg: 1.952 ± 0.05
1.599LysSer: 1.599 ± 0.047
1.569LysThr: 1.569 ± 0.048
1.972LysVal: 1.972 ± 0.064
0.309LysTrp: 0.309 ± 0.022
0.571LysTyr: 0.571 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
13.584LeuAla: 13.584 ± 0.151
0.506LeuCys: 0.506 ± 0.026
6.403LeuAsp: 6.403 ± 0.102
5.401LeuGlu: 5.401 ± 0.082
2.982LeuPhe: 2.982 ± 0.075
9.032LeuGly: 9.032 ± 0.105
1.842LeuHis: 1.842 ± 0.052
5.337LeuIle: 5.337 ± 0.086
2.35LeuLys: 2.35 ± 0.061
9.981LeuLeu: 9.981 ± 0.162
1.77LeuMet: 1.77 ± 0.049
2.499LeuAsn: 2.499 ± 0.05
5.141LeuPro: 5.141 ± 0.082
2.479LeuGln: 2.479 ± 0.06
6.676LeuArg: 6.676 ± 0.12
7.02LeuSer: 7.02 ± 0.099
6.614LeuThr: 6.614 ± 0.099
9.049LeuVal: 9.049 ± 0.126
1.225LeuTrp: 1.225 ± 0.044
1.749LeuTyr: 1.749 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.214MetAla: 2.214 ± 0.05
0.088MetCys: 0.088 ± 0.009
0.828MetAsp: 0.828 ± 0.033
0.739MetGlu: 0.739 ± 0.031
0.596MetPhe: 0.596 ± 0.023
1.353MetGly: 1.353 ± 0.043
0.38MetHis: 0.38 ± 0.021
1.091MetIle: 1.091 ± 0.035
0.571MetLys: 0.571 ± 0.03
2.124MetLeu: 2.124 ± 0.047
0.353MetMet: 0.353 ± 0.022
0.645MetAsn: 0.645 ± 0.03
1.113MetPro: 1.113 ± 0.036
0.558MetGln: 0.558 ± 0.03
1.389MetArg: 1.389 ± 0.044
1.542MetSer: 1.542 ± 0.047
1.642MetThr: 1.642 ± 0.054
1.623MetVal: 1.623 ± 0.047
0.211MetTrp: 0.211 ± 0.017
0.295MetTyr: 0.295 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.921AsnAla: 2.921 ± 0.062
0.156AsnCys: 0.156 ± 0.015
1.66AsnAsp: 1.66 ± 0.051
1.453AsnGlu: 1.453 ± 0.043
0.951AsnPhe: 0.951 ± 0.035
2.282AsnGly: 2.282 ± 0.059
0.494AsnHis: 0.494 ± 0.024
1.248AsnIle: 1.248 ± 0.044
0.66AsnLys: 0.66 ± 0.031
2.412AsnLeu: 2.412 ± 0.056
0.486AsnMet: 0.486 ± 0.024
0.775AsnAsn: 0.775 ± 0.036
1.895AsnPro: 1.895 ± 0.055
0.81AsnGln: 0.81 ± 0.03
1.647AsnArg: 1.647 ± 0.049
1.677AsnSer: 1.677 ± 0.054
1.515AsnThr: 1.515 ± 0.042
2.078AsnVal: 2.078 ± 0.052
0.404AsnTrp: 0.404 ± 0.024
0.648AsnTyr: 0.648 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
5.841ProAla: 5.841 ± 0.087
0.18ProCys: 0.18 ± 0.017
3.064ProAsp: 3.064 ± 0.062
3.639ProGlu: 3.639 ± 0.073
1.602ProPhe: 1.602 ± 0.044
4.162ProGly: 4.162 ± 0.079
1.096ProHis: 1.096 ± 0.04
2.512ProIle: 2.512 ± 0.065
1.228ProLys: 1.228 ± 0.043
4.628ProLeu: 4.628 ± 0.08
0.896ProMet: 0.896 ± 0.034
1.332ProAsn: 1.332 ± 0.038
1.839ProPro: 1.839 ± 0.055
1.604ProGln: 1.604 ± 0.048
2.813ProArg: 2.813 ± 0.063
3.183ProSer: 3.183 ± 0.061
3.254ProThr: 3.254 ± 0.065
4.093ProVal: 4.093 ± 0.073
0.753ProTrp: 0.753 ± 0.026
1.011ProTyr: 1.011 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.258GlnAla: 3.258 ± 0.06
0.14GlnCys: 0.14 ± 0.013
1.164GlnAsp: 1.164 ± 0.038
1.377GlnGlu: 1.377 ± 0.05
1.019GlnPhe: 1.019 ± 0.035
1.979GlnGly: 1.979 ± 0.044
0.745GlnHis: 0.745 ± 0.03
1.714GlnIle: 1.714 ± 0.039
0.83GlnLys: 0.83 ± 0.034
3.713GlnLeu: 3.713 ± 0.069
0.643GlnMet: 0.643 ± 0.033
0.83GlnAsn: 0.83 ± 0.029
1.447GlnPro: 1.447 ± 0.042
1.383GlnGln: 1.383 ± 0.053
2.419GlnArg: 2.419 ± 0.07
1.869GlnSer: 1.869 ± 0.046
1.564GlnThr: 1.564 ± 0.044
2.445GlnVal: 2.445 ± 0.063
0.553GlnTrp: 0.553 ± 0.026
0.674GlnTyr: 0.674 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
7.39ArgAla: 7.39 ± 0.111
0.303ArgCys: 0.303 ± 0.022
3.883ArgAsp: 3.883 ± 0.083
4.07ArgGlu: 4.07 ± 0.075
2.235ArgPhe: 2.235 ± 0.05
4.735ArgGly: 4.735 ± 0.089
1.412ArgHis: 1.412 ± 0.043
3.631ArgIle: 3.631 ± 0.076
1.764ArgLys: 1.764 ± 0.058
6.385ArgLeu: 6.385 ± 0.103
1.662ArgMet: 1.662 ± 0.048
1.658ArgAsn: 1.658 ± 0.053
2.797ArgPro: 2.797 ± 0.068
2.032ArgGln: 2.032 ± 0.053
4.963ArgArg: 4.963 ± 0.101
4.292ArgSer: 4.292 ± 0.081
3.836ArgThr: 3.836 ± 0.072
5.344ArgVal: 5.344 ± 0.082
1.046ArgTrp: 1.046 ± 0.038
1.559ArgTyr: 1.559 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
8.004SerAla: 8.004 ± 0.138
0.283SerCys: 0.283 ± 0.018
3.688SerAsp: 3.688 ± 0.079
3.631SerGlu: 3.631 ± 0.078
2.253SerPhe: 2.253 ± 0.057
6.244SerGly: 6.244 ± 0.108
1.225SerHis: 1.225 ± 0.039
3.491SerIle: 3.491 ± 0.065
1.595SerLys: 1.595 ± 0.045
6.305SerLeu: 6.305 ± 0.101
1.374SerMet: 1.374 ± 0.041
1.68SerAsn: 1.68 ± 0.045
3.173SerPro: 3.173 ± 0.059
1.868SerGln: 1.868 ± 0.046
4.056SerArg: 4.056 ± 0.076
4.463SerSer: 4.463 ± 0.092
4.29SerThr: 4.29 ± 0.082
5.504SerVal: 5.504 ± 0.093
0.994SerTrp: 0.994 ± 0.038
1.408SerTyr: 1.408 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
7.529ThrAla: 7.529 ± 0.105
0.28ThrCys: 0.28 ± 0.022
3.518ThrAsp: 3.518 ± 0.079
3.447ThrGlu: 3.447 ± 0.066
2.049ThrPhe: 2.049 ± 0.051
5.605ThrGly: 5.605 ± 0.101
1.18ThrHis: 1.18 ± 0.041
3.607ThrIle: 3.607 ± 0.078
1.554ThrLys: 1.554 ± 0.051
6.215ThrLeu: 6.215 ± 0.098
1.111ThrMet: 1.111 ± 0.037
1.633ThrAsn: 1.633 ± 0.049
3.672ThrPro: 3.672 ± 0.077
1.749ThrGln: 1.749 ± 0.045
3.571ThrArg: 3.571 ± 0.071
3.825ThrSer: 3.825 ± 0.071
4.066ThrThr: 4.066 ± 0.089
5.861ThrVal: 5.861 ± 0.108
0.768ThrTrp: 0.768 ± 0.035
1.163ThrTyr: 1.163 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
10.886ValAla: 10.886 ± 0.135
0.456ValCys: 0.456 ± 0.027
5.382ValAsp: 5.382 ± 0.088
4.805ValGlu: 4.805 ± 0.079
2.939ValPhe: 2.939 ± 0.059
7.076ValGly: 7.076 ± 0.086
1.668ValHis: 1.668 ± 0.047
4.86ValIle: 4.86 ± 0.088
1.887ValLys: 1.887 ± 0.05
9.006ValLeu: 9.006 ± 0.126
1.618ValMet: 1.618 ± 0.046
2.182ValAsn: 2.182 ± 0.047
4.096ValPro: 4.096 ± 0.067
2.11ValGln: 2.11 ± 0.052
5.359ValArg: 5.359 ± 0.089
5.699ValSer: 5.699 ± 0.093
5.755ValThr: 5.755 ± 0.113
8.283ValVal: 8.283 ± 0.137
1.048ValTrp: 1.048 ± 0.04
1.45ValTyr: 1.45 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
1.474TrpAla: 1.474 ± 0.047
0.099TrpCys: 0.099 ± 0.01
0.675TrpAsp: 0.675 ± 0.031
0.59TrpGlu: 0.59 ± 0.028
0.525TrpPhe: 0.525 ± 0.021
0.909TrpGly: 0.909 ± 0.033
0.311TrpHis: 0.311 ± 0.022
0.713TrpIle: 0.713 ± 0.029
0.366TrpLys: 0.366 ± 0.021
1.683TrpLeu: 1.683 ± 0.051
0.358TrpMet: 0.358 ± 0.023
0.509TrpAsn: 0.509 ± 0.025
0.653TrpPro: 0.653 ± 0.034
0.574TrpGln: 0.574 ± 0.03
1.021TrpArg: 1.021 ± 0.035
0.86TrpSer: 0.86 ± 0.033
0.74TrpThr: 0.74 ± 0.034
1.088TrpVal: 1.088 ± 0.038
0.305TrpTrp: 0.305 ± 0.022
0.261TrpTyr: 0.261 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.23TyrAla: 2.23 ± 0.054
0.141TyrCys: 0.141 ± 0.013
1.348TyrAsp: 1.348 ± 0.04
1.159TyrGlu: 1.159 ± 0.038
0.843TyrPhe: 0.843 ± 0.031
1.867TyrGly: 1.867 ± 0.05
0.315TyrHis: 0.315 ± 0.02
0.808TyrIle: 0.808 ± 0.032
0.415TyrLys: 0.415 ± 0.023
2.287TyrLeu: 2.287 ± 0.06
0.273TyrMet: 0.273 ± 0.02
0.564TyrAsn: 0.564 ± 0.028
1.021TyrPro: 1.021 ± 0.039
0.608TyrGln: 0.608 ± 0.03
1.523TyrArg: 1.523 ± 0.046
1.384TyrSer: 1.384 ± 0.043
1.189TyrThr: 1.189 ± 0.045
1.629TyrVal: 1.629 ± 0.041
0.299TyrTrp: 0.299 ± 0.023
0.476TyrTyr: 0.476 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2670 proteins (799840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski