Amino acid dipepetide frequency for Frankia sp. (strain EAN1pec)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.133AlaAla: 23.133 ± 0.17
1.118AlaCys: 1.118 ± 0.022
8.976AlaAsp: 8.976 ± 0.077
7.906AlaGlu: 7.906 ± 0.057
3.4AlaPhe: 3.4 ± 0.04
15.554AlaGly: 15.554 ± 0.112
2.718AlaHis: 2.718 ± 0.031
3.793AlaIle: 3.793 ± 0.042
1.816AlaLys: 1.816 ± 0.029
13.515AlaLeu: 13.515 ± 0.095
2.44AlaMet: 2.44 ± 0.032
2.007AlaAsn: 2.007 ± 0.032
7.686AlaPro: 7.686 ± 0.077
3.438AlaGln: 3.438 ± 0.037
11.669AlaArg: 11.669 ± 0.091
6.534AlaSer: 6.534 ± 0.062
7.785AlaThr: 7.785 ± 0.065
12.61AlaVal: 12.61 ± 0.093
1.744AlaTrp: 1.744 ± 0.029
2.255AlaTyr: 2.255 ± 0.033
0.0AlaXaa: 0.0 ± 0.0
Cys
1.067CysAla: 1.067 ± 0.022
0.098CysCys: 0.098 ± 0.007
0.5CysAsp: 0.5 ± 0.013
0.387CysGlu: 0.387 ± 0.012
0.223CysPhe: 0.223 ± 0.009
0.937CysGly: 0.937 ± 0.02
0.208CysHis: 0.208 ± 0.01
0.172CysIle: 0.172 ± 0.009
0.068CysLys: 0.068 ± 0.005
0.781CysLeu: 0.781 ± 0.019
0.117CysMet: 0.117 ± 0.007
0.118CysAsn: 0.118 ± 0.006
0.536CysPro: 0.536 ± 0.017
0.211CysGln: 0.211 ± 0.009
0.701CysArg: 0.701 ± 0.019
0.489CysSer: 0.489 ± 0.015
0.399CysThr: 0.399 ± 0.013
0.653CysVal: 0.653 ± 0.018
0.145CysTrp: 0.145 ± 0.008
0.157CysTyr: 0.157 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.575AspAla: 7.575 ± 0.065
0.409AspCys: 0.409 ± 0.015
3.906AspAsp: 3.906 ± 0.046
3.687AspGlu: 3.687 ± 0.038
1.434AspPhe: 1.434 ± 0.026
6.509AspGly: 6.509 ± 0.063
1.517AspHis: 1.517 ± 0.023
1.959AspIle: 1.959 ± 0.03
0.699AspLys: 0.699 ± 0.019
6.992AspLeu: 6.992 ± 0.069
0.742AspMet: 0.742 ± 0.018
0.917AspAsn: 0.917 ± 0.019
5.069AspPro: 5.069 ± 0.051
1.754AspGln: 1.754 ± 0.031
5.357AspArg: 5.357 ± 0.055
2.531AspSer: 2.531 ± 0.032
3.068AspThr: 3.068 ± 0.043
5.204AspVal: 5.204 ± 0.051
0.88AspTrp: 0.88 ± 0.019
1.08AspTyr: 1.08 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
6.306GluAla: 6.306 ± 0.055
0.332GluCys: 0.332 ± 0.012
2.267GluAsp: 2.267 ± 0.035
2.421GluGlu: 2.421 ± 0.035
1.4GluPhe: 1.4 ± 0.029
3.267GluGly: 3.267 ± 0.044
1.45GluHis: 1.45 ± 0.026
2.532GluIle: 2.532 ± 0.034
1.026GluLys: 1.026 ± 0.024
6.253GluLeu: 6.253 ± 0.063
0.815GluMet: 0.815 ± 0.017
0.948GluAsn: 0.948 ± 0.018
3.45GluPro: 3.45 ± 0.045
2.012GluGln: 2.012 ± 0.031
5.17GluArg: 5.17 ± 0.05
2.354GluSer: 2.354 ± 0.033
2.772GluThr: 2.772 ± 0.032
4.202GluVal: 4.202 ± 0.05
0.722GluTrp: 0.722 ± 0.017
0.941GluTyr: 0.941 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.574PheAla: 3.574 ± 0.041
0.278PheCys: 0.278 ± 0.01
2.058PheAsp: 2.058 ± 0.034
1.338PheGlu: 1.338 ± 0.027
0.811PhePhe: 0.811 ± 0.021
3.056PheGly: 3.056 ± 0.044
0.587PheHis: 0.587 ± 0.016
0.7PheIle: 0.7 ± 0.019
0.305PheLys: 0.305 ± 0.012
2.434PheLeu: 2.434 ± 0.033
0.336PheMet: 0.336 ± 0.012
0.484PheAsn: 0.484 ± 0.015
1.434PhePro: 1.434 ± 0.024
0.602PheGln: 0.602 ± 0.016
1.834PheArg: 1.834 ± 0.028
1.365PheSer: 1.365 ± 0.027
1.866PheThr: 1.866 ± 0.03
2.345PheVal: 2.345 ± 0.033
0.361PheTrp: 0.361 ± 0.013
0.542PheTyr: 0.542 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
11.741GlyAla: 11.741 ± 0.093
0.829GlyCys: 0.829 ± 0.021
5.254GlyAsp: 5.254 ± 0.047
4.533GlyGlu: 4.533 ± 0.049
2.759GlyPhe: 2.759 ± 0.038
9.738GlyGly: 9.738 ± 0.115
2.324GlyHis: 2.324 ± 0.033
3.372GlyIle: 3.372 ± 0.042
1.603GlyLys: 1.603 ± 0.028
9.56GlyLeu: 9.56 ± 0.08
1.852GlyMet: 1.852 ± 0.033
1.505GlyAsn: 1.505 ± 0.028
6.471GlyPro: 6.471 ± 0.079
2.881GlyGln: 2.881 ± 0.04
8.993GlyArg: 8.993 ± 0.07
5.826GlySer: 5.826 ± 0.057
6.194GlyThr: 6.194 ± 0.061
7.611GlyVal: 7.611 ± 0.062
1.796GlyTrp: 1.796 ± 0.03
2.141GlyTyr: 2.141 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
2.625HisAla: 2.625 ± 0.032
0.195HisCys: 0.195 ± 0.01
1.349HisAsp: 1.349 ± 0.026
1.046HisGlu: 1.046 ± 0.02
0.563HisPhe: 0.563 ± 0.016
2.276HisGly: 2.276 ± 0.035
0.707HisHis: 0.707 ± 0.022
0.632HisIle: 0.632 ± 0.017
0.205HisLys: 0.205 ± 0.01
2.412HisLeu: 2.412 ± 0.032
0.288HisMet: 0.288 ± 0.011
0.357HisAsn: 0.357 ± 0.012
1.939HisPro: 1.939 ± 0.033
0.67HisGln: 0.67 ± 0.018
2.279HisArg: 2.279 ± 0.033
1.046HisSer: 1.046 ± 0.019
1.235HisThr: 1.235 ± 0.028
1.667HisVal: 1.667 ± 0.028
0.32HisTrp: 0.32 ± 0.012
0.447HisTyr: 0.447 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.866IleAla: 4.866 ± 0.048
0.304IleCys: 0.304 ± 0.011
2.465IleAsp: 2.465 ± 0.035
2.018IleGlu: 2.018 ± 0.032
0.766IlePhe: 0.766 ± 0.02
3.474IleGly: 3.474 ± 0.044
0.676IleHis: 0.676 ± 0.017
1.092IleIle: 1.092 ± 0.023
0.506IleLys: 0.506 ± 0.018
2.7IleLeu: 2.7 ± 0.038
0.482IleMet: 0.482 ± 0.016
0.695IleAsn: 0.695 ± 0.018
1.978IlePro: 1.978 ± 0.029
0.742IleGln: 0.742 ± 0.018
2.666IleArg: 2.666 ± 0.034
1.927IleSer: 1.927 ± 0.03
2.207IleThr: 2.207 ± 0.037
3.031IleVal: 3.031 ± 0.041
0.4IleTrp: 0.4 ± 0.013
0.592IleTyr: 0.592 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
1.841LysAla: 1.841 ± 0.031
0.079LysCys: 0.079 ± 0.006
0.739LysAsp: 0.739 ± 0.023
0.694LysGlu: 0.694 ± 0.018
0.313LysPhe: 0.313 ± 0.01
1.062LysGly: 1.062 ± 0.024
0.293LysHis: 0.293 ± 0.012
0.798LysIle: 0.798 ± 0.02
0.393LysLys: 0.393 ± 0.013
1.325LysLeu: 1.325 ± 0.026
0.266LysMet: 0.266 ± 0.011
0.316LysAsn: 0.316 ± 0.012
0.924LysPro: 0.924 ± 0.024
0.405LysGln: 0.405 ± 0.012
1.05LysArg: 1.05 ± 0.025
0.764LysSer: 0.764 ± 0.018
0.853LysThr: 0.853 ± 0.021
1.403LysVal: 1.403 ± 0.029
0.155LysTrp: 0.155 ± 0.008
0.29LysTyr: 0.29 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
16.056LeuAla: 16.056 ± 0.119
0.801LeuCys: 0.801 ± 0.018
6.929LeuAsp: 6.929 ± 0.065
4.38LeuGlu: 4.38 ± 0.045
2.513LeuPhe: 2.513 ± 0.035
9.038LeuGly: 9.038 ± 0.076
2.104LeuHis: 2.104 ± 0.031
3.149LeuIle: 3.149 ± 0.044
1.188LeuLys: 1.188 ± 0.028
10.647LeuLeu: 10.647 ± 0.096
1.332LeuMet: 1.332 ± 0.026
1.528LeuAsn: 1.528 ± 0.028
6.345LeuPro: 6.345 ± 0.055
1.812LeuGln: 1.812 ± 0.029
9.154LeuArg: 9.154 ± 0.082
5.087LeuSer: 5.087 ± 0.047
6.59LeuThr: 6.59 ± 0.058
9.06LeuVal: 9.06 ± 0.072
1.195LeuTrp: 1.195 ± 0.023
1.574LeuTyr: 1.574 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
2.108MetAla: 2.108 ± 0.029
0.15MetCys: 0.15 ± 0.007
0.815MetAsp: 0.815 ± 0.019
0.599MetGlu: 0.599 ± 0.017
0.462MetPhe: 0.462 ± 0.014
1.127MetGly: 1.127 ± 0.024
0.315MetHis: 0.315 ± 0.012
0.816MetIle: 0.816 ± 0.018
0.289MetLys: 0.289 ± 0.012
1.657MetLeu: 1.657 ± 0.027
0.291MetMet: 0.291 ± 0.011
0.384MetAsn: 0.384 ± 0.013
1.133MetPro: 1.133 ± 0.023
0.347MetGln: 0.347 ± 0.012
1.414MetArg: 1.414 ± 0.026
1.229MetSer: 1.229 ± 0.023
1.474MetThr: 1.474 ± 0.023
1.283MetVal: 1.283 ± 0.023
0.193MetTrp: 0.193 ± 0.008
0.25MetTyr: 0.25 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.1AsnAla: 2.1 ± 0.033
0.136AsnCys: 0.136 ± 0.008
0.863AsnAsp: 0.863 ± 0.021
0.765AsnGlu: 0.765 ± 0.017
0.426AsnPhe: 0.426 ± 0.013
1.771AsnGly: 1.771 ± 0.033
0.391AsnHis: 0.391 ± 0.013
0.683AsnIle: 0.683 ± 0.017
0.244AsnLys: 0.244 ± 0.01
1.724AsnLeu: 1.724 ± 0.031
0.255AsnMet: 0.255 ± 0.01
0.378AsnAsn: 0.378 ± 0.014
1.387AsnPro: 1.387 ± 0.024
0.481AsnGln: 0.481 ± 0.014
1.374AsnArg: 1.374 ± 0.025
0.833AsnSer: 0.833 ± 0.016
1.034AsnThr: 1.034 ± 0.023
1.248AsnVal: 1.248 ± 0.026
0.264AsnTrp: 0.264 ± 0.011
0.342AsnTyr: 0.342 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
10.785ProAla: 10.785 ± 0.113
0.365ProCys: 0.365 ± 0.013
4.959ProAsp: 4.959 ± 0.05
3.712ProGlu: 3.712 ± 0.041
1.629ProPhe: 1.629 ± 0.025
7.792ProGly: 7.792 ± 0.086
1.337ProHis: 1.337 ± 0.027
1.642ProIle: 1.642 ± 0.029
0.86ProLys: 0.86 ± 0.022
5.083ProLeu: 5.083 ± 0.053
1.032ProMet: 1.032 ± 0.021
1.005ProAsn: 1.005 ± 0.022
4.997ProPro: 4.997 ± 0.08
1.401ProGln: 1.401 ± 0.026
4.925ProArg: 4.925 ± 0.057
3.759ProSer: 3.759 ± 0.046
4.073ProThr: 4.073 ± 0.042
5.717ProVal: 5.717 ± 0.053
0.891ProTrp: 0.891 ± 0.02
1.081ProTyr: 1.081 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
3.76GlnAla: 3.76 ± 0.041
0.168GlnCys: 0.168 ± 0.009
1.163GlnAsp: 1.163 ± 0.025
1.159GlnGlu: 1.159 ± 0.023
0.694GlnPhe: 0.694 ± 0.018
1.827GlnGly: 1.827 ± 0.033
0.578GlnHis: 0.578 ± 0.015
1.19GlnIle: 1.19 ± 0.022
0.385GlnLys: 0.385 ± 0.013
2.803GlnLeu: 2.803 ± 0.04
0.451GlnMet: 0.451 ± 0.014
0.467GlnAsn: 0.467 ± 0.014
1.896GlnPro: 1.896 ± 0.036
0.978GlnGln: 0.978 ± 0.028
2.451GlnArg: 2.451 ± 0.03
1.125GlnSer: 1.125 ± 0.02
1.363GlnThr: 1.363 ± 0.026
2.496GlnVal: 2.496 ± 0.033
0.399GlnTrp: 0.399 ± 0.012
0.493GlnTyr: 0.493 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
11.222ArgAla: 11.222 ± 0.096
0.715ArgCys: 0.715 ± 0.018
4.785ArgAsp: 4.785 ± 0.048
4.38ArgGlu: 4.38 ± 0.047
2.456ArgPhe: 2.456 ± 0.031
6.651ArgGly: 6.651 ± 0.055
2.282ArgHis: 2.282 ± 0.028
3.303ArgIle: 3.303 ± 0.038
1.169ArgLys: 1.169 ± 0.024
9.336ArgLeu: 9.336 ± 0.076
1.789ArgMet: 1.789 ± 0.028
1.4ArgAsn: 1.4 ± 0.026
6.344ArgPro: 6.344 ± 0.066
2.574ArgGln: 2.574 ± 0.039
9.745ArgArg: 9.745 ± 0.094
4.82ArgSer: 4.82 ± 0.048
5.06ArgThr: 5.06 ± 0.048
6.59ArgVal: 6.59 ± 0.059
1.562ArgTrp: 1.562 ± 0.03
1.835ArgTyr: 1.835 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
7.099SerAla: 7.099 ± 0.059
0.432SerCys: 0.432 ± 0.015
2.89SerAsp: 2.89 ± 0.037
2.344SerGlu: 2.344 ± 0.03
1.547SerPhe: 1.547 ± 0.024
6.112SerGly: 6.112 ± 0.062
1.01SerHis: 1.01 ± 0.021
1.621SerIle: 1.621 ± 0.025
0.732SerLys: 0.732 ± 0.019
4.676SerLeu: 4.676 ± 0.051
1.064SerMet: 1.064 ± 0.019
0.821SerAsn: 0.821 ± 0.018
3.809SerPro: 3.809 ± 0.038
1.298SerGln: 1.298 ± 0.025
4.361SerArg: 4.361 ± 0.04
3.189SerSer: 3.189 ± 0.048
3.466SerThr: 3.466 ± 0.039
4.31SerVal: 4.31 ± 0.044
0.905SerTrp: 0.905 ± 0.019
1.142SerTyr: 1.142 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
8.771ThrAla: 8.771 ± 0.07
0.477ThrCys: 0.477 ± 0.014
3.643ThrAsp: 3.643 ± 0.041
2.907ThrGlu: 2.907 ± 0.035
1.603ThrPhe: 1.603 ± 0.024
6.813ThrGly: 6.813 ± 0.069
1.149ThrHis: 1.149 ± 0.022
2.037ThrIle: 2.037 ± 0.036
0.852ThrLys: 0.852 ± 0.021
5.519ThrLeu: 5.519 ± 0.053
1.023ThrMet: 1.023 ± 0.025
1.012ThrAsn: 1.012 ± 0.02
4.243ThrPro: 4.243 ± 0.054
1.28ThrGln: 1.28 ± 0.023
4.481ThrArg: 4.481 ± 0.049
3.553ThrSer: 3.553 ± 0.039
3.96ThrThr: 3.96 ± 0.049
5.678ThrVal: 5.678 ± 0.05
0.881ThrTrp: 0.881 ± 0.019
1.098ThrTyr: 1.098 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
11.655ValAla: 11.655 ± 0.082
0.743ValCys: 0.743 ± 0.018
5.74ValAsp: 5.74 ± 0.054
4.707ValGlu: 4.707 ± 0.055
2.295ValPhe: 2.295 ± 0.031
7.247ValGly: 7.247 ± 0.059
1.822ValHis: 1.822 ± 0.027
3.062ValIle: 3.062 ± 0.043
1.113ValLys: 1.113 ± 0.024
9.279ValLeu: 9.279 ± 0.067
1.28ValMet: 1.28 ± 0.025
1.663ValAsn: 1.663 ± 0.029
5.392ValPro: 5.392 ± 0.055
1.858ValGln: 1.858 ± 0.026
7.102ValArg: 7.102 ± 0.058
4.504ValSer: 4.504 ± 0.051
5.616ValThr: 5.616 ± 0.049
8.323ValVal: 8.323 ± 0.071
1.063ValTrp: 1.063 ± 0.02
1.378ValTyr: 1.378 ± 0.023
0.0ValXaa: 0.0 ± 0.0
Trp
1.63TrpAla: 1.63 ± 0.025
0.163TrpCys: 0.163 ± 0.008
0.825TrpAsp: 0.825 ± 0.022
0.691TrpGlu: 0.691 ± 0.017
0.428TrpPhe: 0.428 ± 0.014
0.923TrpGly: 0.923 ± 0.019
0.391TrpHis: 0.391 ± 0.013
0.54TrpIle: 0.54 ± 0.015
0.258TrpLys: 0.258 ± 0.011
1.659TrpLeu: 1.659 ± 0.028
0.28TrpMet: 0.28 ± 0.011
0.33TrpAsn: 0.33 ± 0.012
0.884TrpPro: 0.884 ± 0.02
0.46TrpGln: 0.46 ± 0.011
1.498TrpArg: 1.498 ± 0.027
0.953TrpSer: 0.953 ± 0.02
0.976TrpThr: 0.976 ± 0.023
0.997TrpVal: 0.997 ± 0.019
0.333TrpTrp: 0.333 ± 0.013
0.29TrpTyr: 0.29 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.184TyrAla: 2.184 ± 0.029
0.176TyrCys: 0.176 ± 0.008
1.167TyrAsp: 1.167 ± 0.025
0.966TyrGlu: 0.966 ± 0.021
0.591TyrPhe: 0.591 ± 0.016
1.807TyrGly: 1.807 ± 0.031
0.402TyrHis: 0.402 ± 0.013
0.491TyrIle: 0.491 ± 0.017
0.254TyrLys: 0.254 ± 0.011
2.124TyrLeu: 2.124 ± 0.029
0.2TyrMet: 0.2 ± 0.009
0.362TyrAsn: 0.362 ± 0.014
1.131TyrPro: 1.131 ± 0.024
0.641TyrGln: 0.641 ± 0.019
1.791TyrArg: 1.791 ± 0.026
0.938TyrSer: 0.938 ± 0.022
0.986TyrThr: 0.986 ± 0.021
1.415TyrVal: 1.415 ± 0.026
0.302TyrTrp: 0.302 ± 0.011
0.397TyrTyr: 0.397 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7133 proteins (2476290 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski