Amino acid dipepetide frequency for Exilibacterium tricleocarpae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.411AlaAla: 13.411 ± 0.143
1.141AlaCys: 1.141 ± 0.027
6.313AlaAsp: 6.313 ± 0.062
6.161AlaGlu: 6.161 ± 0.071
3.621AlaPhe: 3.621 ± 0.048
9.552AlaGly: 9.552 ± 0.094
1.795AlaHis: 1.795 ± 0.035
5.242AlaIle: 5.242 ± 0.059
3.307AlaLys: 3.307 ± 0.052
11.379AlaLeu: 11.379 ± 0.104
2.234AlaMet: 2.234 ± 0.041
3.152AlaAsn: 3.152 ± 0.053
4.383AlaPro: 4.383 ± 0.071
4.217AlaGln: 4.217 ± 0.064
6.136AlaArg: 6.136 ± 0.055
5.315AlaSer: 5.315 ± 0.055
5.405AlaThr: 5.405 ± 0.057
8.012AlaVal: 8.012 ± 0.073
1.218AlaTrp: 1.218 ± 0.027
2.752AlaTyr: 2.752 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.023
0.174CysCys: 0.174 ± 0.01
0.62CysAsp: 0.62 ± 0.021
0.583CysGlu: 0.583 ± 0.019
0.409CysPhe: 0.409 ± 0.017
0.973CysGly: 0.973 ± 0.027
0.319CysHis: 0.319 ± 0.015
0.511CysIle: 0.511 ± 0.017
0.309CysLys: 0.309 ± 0.011
1.101CysLeu: 1.101 ± 0.029
0.17CysMet: 0.17 ± 0.01
0.317CysAsn: 0.317 ± 0.013
0.468CysPro: 0.468 ± 0.016
0.353CysGln: 0.353 ± 0.013
0.769CysArg: 0.769 ± 0.019
0.637CysSer: 0.637 ± 0.019
0.469CysThr: 0.469 ± 0.016
0.703CysVal: 0.703 ± 0.023
0.146CysTrp: 0.146 ± 0.009
0.31CysTyr: 0.31 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.651AspAla: 5.651 ± 0.092
0.618AspCys: 0.618 ± 0.02
3.413AspAsp: 3.413 ± 0.071
3.13AspGlu: 3.13 ± 0.047
2.557AspPhe: 2.557 ± 0.04
4.994AspGly: 4.994 ± 0.084
1.232AspHis: 1.232 ± 0.024
4.039AspIle: 4.039 ± 0.055
2.247AspLys: 2.247 ± 0.034
5.729AspLeu: 5.729 ± 0.06
1.276AspMet: 1.276 ± 0.029
2.441AspAsn: 2.441 ± 0.042
2.963AspPro: 2.963 ± 0.052
2.067AspGln: 2.067 ± 0.033
3.684AspArg: 3.684 ± 0.047
3.541AspSer: 3.541 ± 0.052
3.596AspThr: 3.596 ± 0.048
3.638AspVal: 3.638 ± 0.047
0.998AspTrp: 0.998 ± 0.025
2.21AspTyr: 2.21 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.818GluAla: 5.818 ± 0.067
0.413GluCys: 0.413 ± 0.016
2.785GluAsp: 2.785 ± 0.045
3.183GluGlu: 3.183 ± 0.049
2.093GluPhe: 2.093 ± 0.038
3.595GluGly: 3.595 ± 0.043
1.361GluHis: 1.361 ± 0.031
3.219GluIle: 3.219 ± 0.048
2.577GluLys: 2.577 ± 0.042
6.121GluLeu: 6.121 ± 0.078
1.224GluMet: 1.224 ± 0.026
2.129GluAsn: 2.129 ± 0.04
2.43GluPro: 2.43 ± 0.041
3.195GluGln: 3.195 ± 0.044
4.111GluArg: 4.111 ± 0.056
3.052GluSer: 3.052 ± 0.044
3.237GluThr: 3.237 ± 0.042
4.226GluVal: 4.226 ± 0.05
0.644GluTrp: 0.644 ± 0.017
1.593GluTyr: 1.593 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
3.581PheAla: 3.581 ± 0.044
0.485PheCys: 0.485 ± 0.017
2.932PheAsp: 2.932 ± 0.043
2.305PheGlu: 2.305 ± 0.042
1.675PhePhe: 1.675 ± 0.033
3.216PheGly: 3.216 ± 0.05
0.771PheHis: 0.771 ± 0.023
2.082PheIle: 2.082 ± 0.033
1.349PheLys: 1.349 ± 0.028
3.238PheLeu: 3.238 ± 0.053
0.826PheMet: 0.826 ± 0.021
1.672PheAsn: 1.672 ± 0.03
1.43PhePro: 1.43 ± 0.028
1.209PheGln: 1.209 ± 0.025
1.954PheArg: 1.954 ± 0.031
3.031PheSer: 3.031 ± 0.048
2.399PheThr: 2.399 ± 0.037
2.536PheVal: 2.536 ± 0.037
0.531PheTrp: 0.531 ± 0.017
1.324PheTyr: 1.324 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.584GlyAla: 7.584 ± 0.075
1.007GlyCys: 1.007 ± 0.027
4.946GlyAsp: 4.946 ± 0.066
4.757GlyGlu: 4.757 ± 0.064
3.47GlyPhe: 3.47 ± 0.052
6.511GlyGly: 6.511 ± 0.085
1.652GlyHis: 1.652 ± 0.032
4.308GlyIle: 4.308 ± 0.051
3.098GlyLys: 3.098 ± 0.044
7.837GlyLeu: 7.837 ± 0.071
1.798GlyMet: 1.798 ± 0.035
3.012GlyAsn: 3.012 ± 0.092
2.534GlyPro: 2.534 ± 0.04
2.851GlyGln: 2.851 ± 0.047
4.888GlyArg: 4.888 ± 0.051
4.653GlySer: 4.653 ± 0.062
4.137GlyThr: 4.137 ± 0.058
5.826GlyVal: 5.826 ± 0.065
1.135GlyTrp: 1.135 ± 0.029
2.592GlyTyr: 2.592 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
1.668HisAla: 1.668 ± 0.028
0.345HisCys: 0.345 ± 0.013
0.988HisAsp: 0.988 ± 0.023
0.958HisGlu: 0.958 ± 0.026
0.952HisPhe: 0.952 ± 0.021
1.565HisGly: 1.565 ± 0.033
0.647HisHis: 0.647 ± 0.022
1.254HisIle: 1.254 ± 0.024
0.733HisLys: 0.733 ± 0.021
2.395HisLeu: 2.395 ± 0.034
0.47HisMet: 0.47 ± 0.017
0.784HisAsn: 0.784 ± 0.022
1.203HisPro: 1.203 ± 0.028
0.94HisGln: 0.94 ± 0.024
1.667HisArg: 1.667 ± 0.03
1.294HisSer: 1.294 ± 0.026
1.169HisThr: 1.169 ± 0.025
1.07HisVal: 1.07 ± 0.02
0.408HisTrp: 0.408 ± 0.015
0.964HisTyr: 0.964 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.821IleAla: 5.821 ± 0.062
0.594IleCys: 0.594 ± 0.019
4.202IleAsp: 4.202 ± 0.048
3.808IleGlu: 3.808 ± 0.05
1.742IlePhe: 1.742 ± 0.034
4.159IleGly: 4.159 ± 0.049
1.056IleHis: 1.056 ± 0.022
2.31IleIle: 2.31 ± 0.043
2.046IleLys: 2.046 ± 0.031
4.138IleLeu: 4.138 ± 0.057
0.853IleMet: 0.853 ± 0.022
2.236IleAsn: 2.236 ± 0.039
2.348IlePro: 2.348 ± 0.033
1.658IleGln: 1.658 ± 0.03
2.922IleArg: 2.922 ± 0.041
3.4IleSer: 3.4 ± 0.048
3.238IleThr: 3.238 ± 0.046
3.596IleVal: 3.596 ± 0.055
0.532IleTrp: 0.532 ± 0.016
1.495IleTyr: 1.495 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
3.454LysAla: 3.454 ± 0.051
0.244LysCys: 0.244 ± 0.012
1.755LysAsp: 1.755 ± 0.033
1.792LysGlu: 1.792 ± 0.037
1.088LysPhe: 1.088 ± 0.023
2.317LysGly: 2.317 ± 0.032
0.839LysHis: 0.839 ± 0.022
1.926LysIle: 1.926 ± 0.036
1.823LysLys: 1.823 ± 0.039
3.721LysLeu: 3.721 ± 0.047
0.754LysMet: 0.754 ± 0.02
1.414LysAsn: 1.414 ± 0.028
1.878LysPro: 1.878 ± 0.039
1.634LysGln: 1.634 ± 0.034
2.357LysArg: 2.357 ± 0.038
2.185LysSer: 2.185 ± 0.038
2.24LysThr: 2.24 ± 0.033
2.643LysVal: 2.643 ± 0.037
0.382LysTrp: 0.382 ± 0.013
0.99LysTyr: 0.99 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
11.971LeuAla: 11.971 ± 0.1
1.13LeuCys: 1.13 ± 0.027
6.036LeuAsp: 6.036 ± 0.061
5.952LeuGlu: 5.952 ± 0.068
3.793LeuPhe: 3.793 ± 0.058
7.671LeuGly: 7.671 ± 0.071
2.138LeuHis: 2.138 ± 0.037
4.916LeuIle: 4.916 ± 0.058
3.73LeuLys: 3.73 ± 0.048
11.1LeuLeu: 11.1 ± 0.121
2.043LeuMet: 2.043 ± 0.036
3.635LeuAsn: 3.635 ± 0.048
5.278LeuPro: 5.278 ± 0.065
4.746LeuGln: 4.746 ± 0.056
6.376LeuArg: 6.376 ± 0.068
7.051LeuSer: 7.051 ± 0.076
5.649LeuThr: 5.649 ± 0.062
7.289LeuVal: 7.289 ± 0.072
1.269LeuTrp: 1.269 ± 0.031
2.652LeuTyr: 2.652 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
2.315MetAla: 2.315 ± 0.038
0.147MetCys: 0.147 ± 0.008
1.1MetAsp: 1.1 ± 0.025
1.09MetGlu: 1.09 ± 0.024
0.619MetPhe: 0.619 ± 0.02
1.509MetGly: 1.509 ± 0.031
0.409MetHis: 0.409 ± 0.014
1.0MetIle: 1.0 ± 0.023
0.889MetLys: 0.889 ± 0.023
2.119MetLeu: 2.119 ± 0.038
0.451MetMet: 0.451 ± 0.016
0.759MetAsn: 0.759 ± 0.023
1.093MetPro: 1.093 ± 0.022
0.888MetGln: 0.888 ± 0.022
1.203MetArg: 1.203 ± 0.026
1.383MetSer: 1.383 ± 0.03
1.284MetThr: 1.284 ± 0.026
1.452MetVal: 1.452 ± 0.031
0.164MetTrp: 0.164 ± 0.009
0.381MetTyr: 0.381 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.265AsnAla: 3.265 ± 0.052
0.399AsnCys: 0.399 ± 0.016
1.953AsnAsp: 1.953 ± 0.035
1.574AsnGlu: 1.574 ± 0.031
1.35AsnPhe: 1.35 ± 0.029
2.96AsnGly: 2.96 ± 0.063
0.782AsnHis: 0.782 ± 0.022
2.062AsnIle: 2.062 ± 0.032
1.168AsnLys: 1.168 ± 0.025
3.571AsnLeu: 3.571 ± 0.052
0.713AsnMet: 0.713 ± 0.018
1.515AsnAsn: 1.515 ± 0.045
2.144AsnPro: 2.144 ± 0.042
1.427AsnGln: 1.427 ± 0.035
2.609AsnArg: 2.609 ± 0.061
1.976AsnSer: 1.976 ± 0.038
2.2AsnThr: 2.2 ± 0.039
2.07AsnVal: 2.07 ± 0.04
0.542AsnTrp: 0.542 ± 0.016
1.227AsnTyr: 1.227 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
5.592ProAla: 5.592 ± 0.075
0.334ProCys: 0.334 ± 0.015
3.144ProAsp: 3.144 ± 0.041
2.97ProGlu: 2.97 ± 0.042
1.695ProPhe: 1.695 ± 0.029
4.017ProGly: 4.017 ± 0.047
0.935ProHis: 0.935 ± 0.021
2.068ProIle: 2.068 ± 0.038
1.386ProLys: 1.386 ± 0.028
4.758ProLeu: 4.758 ± 0.063
0.887ProMet: 0.887 ± 0.023
1.432ProAsn: 1.432 ± 0.029
2.233ProPro: 2.233 ± 0.047
2.021ProGln: 2.021 ± 0.038
2.29ProArg: 2.29 ± 0.041
2.319ProSer: 2.319 ± 0.04
2.196ProThr: 2.196 ± 0.036
3.814ProVal: 3.814 ± 0.047
0.59ProTrp: 0.59 ± 0.02
1.215ProTyr: 1.215 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
4.678GlnAla: 4.678 ± 0.061
0.364GlnCys: 0.364 ± 0.015
1.707GlnAsp: 1.707 ± 0.032
1.944GlnGlu: 1.944 ± 0.033
1.453GlnPhe: 1.453 ± 0.025
2.782GlnGly: 2.782 ± 0.044
1.025GlnHis: 1.025 ± 0.024
1.958GlnIle: 1.958 ± 0.031
1.341GlnLys: 1.341 ± 0.025
5.032GlnLeu: 5.032 ± 0.06
0.851GlnMet: 0.851 ± 0.022
1.156GlnAsn: 1.156 ± 0.027
2.131GlnPro: 2.131 ± 0.036
2.626GlnGln: 2.626 ± 0.043
3.222GlnArg: 3.222 ± 0.042
2.319GlnSer: 2.319 ± 0.033
2.287GlnThr: 2.287 ± 0.039
3.273GlnVal: 3.273 ± 0.045
0.707GlnTrp: 0.707 ± 0.017
1.124GlnTyr: 1.124 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
5.715ArgAla: 5.715 ± 0.063
0.637ArgCys: 0.637 ± 0.019
3.722ArgAsp: 3.722 ± 0.047
4.069ArgGlu: 4.069 ± 0.052
2.74ArgPhe: 2.74 ± 0.04
4.219ArgGly: 4.219 ± 0.047
1.711ArgHis: 1.711 ± 0.029
3.359ArgIle: 3.359 ± 0.049
2.111ArgLys: 2.111 ± 0.037
7.303ArgLeu: 7.303 ± 0.07
1.229ArgMet: 1.229 ± 0.026
2.025ArgAsn: 2.025 ± 0.039
2.58ArgPro: 2.58 ± 0.045
3.404ArgGln: 3.404 ± 0.04
4.835ArgArg: 4.835 ± 0.061
3.323ArgSer: 3.323 ± 0.041
2.829ArgThr: 2.829 ± 0.046
4.332ArgVal: 4.332 ± 0.052
1.005ArgTrp: 1.005 ± 0.023
2.344ArgTyr: 2.344 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
5.96SerAla: 5.96 ± 0.062
0.565SerCys: 0.565 ± 0.018
3.523SerAsp: 3.523 ± 0.047
3.078SerGlu: 3.078 ± 0.045
2.454SerPhe: 2.454 ± 0.044
5.45SerGly: 5.45 ± 0.065
1.313SerHis: 1.313 ± 0.03
3.206SerIle: 3.206 ± 0.043
1.876SerLys: 1.876 ± 0.036
6.401SerLeu: 6.401 ± 0.072
1.181SerMet: 1.181 ± 0.025
1.978SerAsn: 1.978 ± 0.033
2.71SerPro: 2.71 ± 0.04
2.202SerGln: 2.202 ± 0.032
3.716SerArg: 3.716 ± 0.046
3.529SerSer: 3.529 ± 0.06
3.137SerThr: 3.137 ± 0.043
4.367SerVal: 4.367 ± 0.053
0.786SerTrp: 0.786 ± 0.022
1.864SerTyr: 1.864 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
6.076ThrAla: 6.076 ± 0.066
0.479ThrCys: 0.479 ± 0.015
3.479ThrAsp: 3.479 ± 0.057
3.032ThrGlu: 3.032 ± 0.043
2.026ThrPhe: 2.026 ± 0.038
4.945ThrGly: 4.945 ± 0.064
1.126ThrHis: 1.126 ± 0.024
2.684ThrIle: 2.684 ± 0.046
1.428ThrLys: 1.428 ± 0.03
6.461ThrLeu: 6.461 ± 0.061
0.921ThrMet: 0.921 ± 0.024
1.613ThrAsn: 1.613 ± 0.037
3.068ThrPro: 3.068 ± 0.043
1.923ThrGln: 1.923 ± 0.032
3.274ThrArg: 3.274 ± 0.058
2.952ThrSer: 2.952 ± 0.043
3.257ThrThr: 3.257 ± 0.066
4.336ThrVal: 4.336 ± 0.054
0.723ThrTrp: 0.723 ± 0.021
1.764ThrTyr: 1.764 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
7.67ValAla: 7.67 ± 0.087
0.755ValCys: 0.755 ± 0.023
4.678ValAsp: 4.678 ± 0.063
4.433ValGlu: 4.433 ± 0.051
2.933ValPhe: 2.933 ± 0.048
4.927ValGly: 4.927 ± 0.052
1.273ValHis: 1.273 ± 0.023
3.939ValIle: 3.939 ± 0.049
2.657ValLys: 2.657 ± 0.04
7.0ValLeu: 7.0 ± 0.066
1.534ValMet: 1.534 ± 0.034
2.725ValAsn: 2.725 ± 0.04
3.106ValPro: 3.106 ± 0.045
2.318ValGln: 2.318 ± 0.038
3.913ValArg: 3.913 ± 0.049
4.654ValSer: 4.654 ± 0.052
4.465ValThr: 4.465 ± 0.059
5.834ValVal: 5.834 ± 0.081
0.817ValTrp: 0.817 ± 0.02
2.064ValTyr: 2.064 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
0.968TrpAla: 0.968 ± 0.025
0.157TrpCys: 0.157 ± 0.01
0.691TrpAsp: 0.691 ± 0.02
0.692TrpGlu: 0.692 ± 0.02
0.571TrpPhe: 0.571 ± 0.017
0.902TrpGly: 0.902 ± 0.025
0.366TrpHis: 0.366 ± 0.014
0.627TrpIle: 0.627 ± 0.019
0.383TrpLys: 0.383 ± 0.014
1.713TrpLeu: 1.713 ± 0.035
0.295TrpMet: 0.295 ± 0.012
0.449TrpAsn: 0.449 ± 0.015
0.566TrpPro: 0.566 ± 0.017
0.876TrpGln: 0.876 ± 0.026
1.057TrpArg: 1.057 ± 0.026
0.855TrpSer: 0.855 ± 0.025
0.611TrpThr: 0.611 ± 0.018
0.926TrpVal: 0.926 ± 0.021
0.217TrpTrp: 0.217 ± 0.011
0.37TrpTyr: 0.37 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.527TyrAla: 2.527 ± 0.035
0.374TyrCys: 0.374 ± 0.014
2.035TyrAsp: 2.035 ± 0.107
1.495TyrGlu: 1.495 ± 0.031
1.306TyrPhe: 1.306 ± 0.031
2.331TyrGly: 2.331 ± 0.047
0.726TyrHis: 0.726 ± 0.02
1.448TyrIle: 1.448 ± 0.029
0.979TyrLys: 0.979 ± 0.026
3.308TyrLeu: 3.308 ± 0.043
0.541TyrMet: 0.541 ± 0.018
1.061TyrAsn: 1.061 ± 0.03
1.349TyrPro: 1.349 ± 0.027
1.425TyrGln: 1.425 ± 0.027
2.508TyrArg: 2.508 ± 0.038
1.827TyrSer: 1.827 ± 0.033
1.762TyrThr: 1.762 ± 0.033
1.75TyrVal: 1.75 ± 0.032
0.48TyrTrp: 0.48 ± 0.017
1.015TyrTyr: 1.015 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5556 proteins (2009629 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski