Amino acid dipepetide frequency for Asticcacaulis benevestitus DSM 16100 = ATCC BAA-896

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.894AlaAla: 13.894 ± 0.141
1.051AlaCys: 1.051 ± 0.031
6.598AlaAsp: 6.598 ± 0.07
6.186AlaGlu: 6.186 ± 0.072
4.499AlaPhe: 4.499 ± 0.058
9.099AlaGly: 9.099 ± 0.103
2.413AlaHis: 2.413 ± 0.044
5.787AlaIle: 5.787 ± 0.072
4.837AlaLys: 4.837 ± 0.06
12.581AlaLeu: 12.581 ± 0.131
3.276AlaMet: 3.276 ± 0.054
3.278AlaAsn: 3.278 ± 0.06
5.335AlaPro: 5.335 ± 0.071
4.47AlaGln: 4.47 ± 0.07
6.998AlaArg: 6.998 ± 0.089
6.533AlaSer: 6.533 ± 0.082
6.072AlaThr: 6.072 ± 0.071
7.459AlaVal: 7.459 ± 0.084
1.449AlaTrp: 1.449 ± 0.035
3.128AlaTyr: 3.128 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.972CysAla: 0.972 ± 0.03
0.093CysCys: 0.093 ± 0.007
0.533CysAsp: 0.533 ± 0.019
0.415CysGlu: 0.415 ± 0.018
0.27CysPhe: 0.27 ± 0.013
0.786CysGly: 0.786 ± 0.025
0.179CysHis: 0.179 ± 0.011
0.357CysIle: 0.357 ± 0.016
0.219CysLys: 0.219 ± 0.014
0.847CysLeu: 0.847 ± 0.027
0.157CysMet: 0.157 ± 0.01
0.23CysAsn: 0.23 ± 0.012
0.369CysPro: 0.369 ± 0.018
0.244CysGln: 0.244 ± 0.014
0.463CysArg: 0.463 ± 0.019
0.418CysSer: 0.418 ± 0.017
0.419CysThr: 0.419 ± 0.021
0.648CysVal: 0.648 ± 0.021
0.091CysTrp: 0.091 ± 0.008
0.214CysTyr: 0.214 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.65AspAla: 6.65 ± 0.07
0.456AspCys: 0.456 ± 0.019
3.48AspAsp: 3.48 ± 0.059
3.157AspGlu: 3.157 ± 0.056
2.46AspPhe: 2.46 ± 0.048
5.075AspGly: 5.075 ± 0.073
1.449AspHis: 1.449 ± 0.032
3.742AspIle: 3.742 ± 0.059
2.468AspLys: 2.468 ± 0.05
6.269AspLeu: 6.269 ± 0.072
1.631AspMet: 1.631 ± 0.036
1.737AspAsn: 1.737 ± 0.038
3.096AspPro: 3.096 ± 0.048
2.184AspGln: 2.184 ± 0.039
3.548AspArg: 3.548 ± 0.055
2.797AspSer: 2.797 ± 0.049
3.189AspThr: 3.189 ± 0.051
4.15AspVal: 4.15 ± 0.056
1.02AspTrp: 1.02 ± 0.027
1.998AspTyr: 1.998 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.193GluAla: 7.193 ± 0.084
0.268GluCys: 0.268 ± 0.013
2.868GluAsp: 2.868 ± 0.051
2.39GluGlu: 2.39 ± 0.052
1.613GluPhe: 1.613 ± 0.03
4.154GluGly: 4.154 ± 0.066
1.066GluHis: 1.066 ± 0.027
3.035GluIle: 3.035 ± 0.053
2.372GluLys: 2.372 ± 0.05
4.503GluLeu: 4.503 ± 0.078
1.341GluMet: 1.341 ± 0.03
1.63GluAsn: 1.63 ± 0.039
2.201GluPro: 2.201 ± 0.041
1.879GluGln: 1.879 ± 0.039
3.792GluArg: 3.792 ± 0.076
2.473GluSer: 2.473 ± 0.046
3.485GluThr: 3.485 ± 0.056
3.535GluVal: 3.535 ± 0.06
0.665GluTrp: 0.665 ± 0.022
1.048GluTyr: 1.048 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.175PheAla: 4.175 ± 0.061
0.367PheCys: 0.367 ± 0.016
2.914PheAsp: 2.914 ± 0.048
2.168PheGlu: 2.168 ± 0.043
1.408PhePhe: 1.408 ± 0.035
3.532PheGly: 3.532 ± 0.055
0.778PheHis: 0.778 ± 0.025
1.962PheIle: 1.962 ± 0.038
1.56PheLys: 1.56 ± 0.034
3.124PheLeu: 3.124 ± 0.059
0.928PheMet: 0.928 ± 0.024
1.468PheAsn: 1.468 ± 0.034
1.465PhePro: 1.465 ± 0.034
1.073PheGln: 1.073 ± 0.028
1.832PheArg: 1.832 ± 0.038
2.493PheSer: 2.493 ± 0.05
2.281PheThr: 2.281 ± 0.04
2.637PheVal: 2.637 ± 0.042
0.546PheTrp: 0.546 ± 0.022
1.069PheTyr: 1.069 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
8.384GlyAla: 8.384 ± 0.093
0.699GlyCys: 0.699 ± 0.026
4.638GlyAsp: 4.638 ± 0.062
4.12GlyGlu: 4.12 ± 0.053
3.693GlyPhe: 3.693 ± 0.059
6.734GlyGly: 6.734 ± 0.111
2.0GlyHis: 2.0 ± 0.044
4.152GlyIle: 4.152 ± 0.066
3.821GlyLys: 3.821 ± 0.064
8.787GlyLeu: 8.787 ± 0.106
2.105GlyMet: 2.105 ± 0.043
2.442GlyAsn: 2.442 ± 0.064
2.951GlyPro: 2.951 ± 0.044
3.127GlyGln: 3.127 ± 0.046
4.693GlyArg: 4.693 ± 0.068
4.355GlySer: 4.355 ± 0.077
4.439GlyThr: 4.439 ± 0.104
6.062GlyVal: 6.062 ± 0.07
1.338GlyTrp: 1.338 ± 0.034
2.718GlyTyr: 2.718 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.189HisAla: 2.189 ± 0.04
0.192HisCys: 0.192 ± 0.01
1.42HisAsp: 1.42 ± 0.036
1.114HisGlu: 1.114 ± 0.03
0.883HisPhe: 0.883 ± 0.028
1.774HisGly: 1.774 ± 0.035
0.541HisHis: 0.541 ± 0.023
1.228HisIle: 1.228 ± 0.03
0.804HisLys: 0.804 ± 0.023
2.258HisLeu: 2.258 ± 0.043
0.619HisMet: 0.619 ± 0.022
0.665HisAsn: 0.665 ± 0.024
1.224HisPro: 1.224 ± 0.027
0.686HisGln: 0.686 ± 0.021
1.211HisArg: 1.211 ± 0.035
1.033HisSer: 1.033 ± 0.023
1.002HisThr: 1.002 ± 0.028
1.456HisVal: 1.456 ± 0.032
0.338HisTrp: 0.338 ± 0.014
0.656HisTyr: 0.656 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.295IleAla: 6.295 ± 0.086
0.509IleCys: 0.509 ± 0.019
3.889IleAsp: 3.889 ± 0.055
3.505IleGlu: 3.505 ± 0.058
1.764IlePhe: 1.764 ± 0.04
4.687IleGly: 4.687 ± 0.076
0.999IleHis: 0.999 ± 0.028
2.604IleIle: 2.604 ± 0.05
2.175IleLys: 2.175 ± 0.038
4.585IleLeu: 4.585 ± 0.07
1.217IleMet: 1.217 ± 0.033
1.842IleAsn: 1.842 ± 0.04
2.196IlePro: 2.196 ± 0.042
1.352IleGln: 1.352 ± 0.032
3.032IleArg: 3.032 ± 0.051
3.525IleSer: 3.525 ± 0.059
3.173IleThr: 3.173 ± 0.052
3.871IleVal: 3.871 ± 0.062
0.714IleTrp: 0.714 ± 0.023
1.416IleTyr: 1.416 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
5.943LysAla: 5.943 ± 0.065
0.214LysCys: 0.214 ± 0.012
2.435LysAsp: 2.435 ± 0.048
1.609LysGlu: 1.609 ± 0.04
1.168LysPhe: 1.168 ± 0.029
3.451LysGly: 3.451 ± 0.055
0.739LysHis: 0.739 ± 0.024
2.183LysIle: 2.183 ± 0.042
1.854LysLys: 1.854 ± 0.044
4.029LysLeu: 4.029 ± 0.054
1.002LysMet: 1.002 ± 0.028
1.318LysAsn: 1.318 ± 0.031
2.712LysPro: 2.712 ± 0.054
1.251LysGln: 1.251 ± 0.029
2.646LysArg: 2.646 ± 0.044
2.631LysSer: 2.631 ± 0.039
3.007LysThr: 3.007 ± 0.05
2.982LysVal: 2.982 ± 0.051
0.531LysTrp: 0.531 ± 0.019
0.921LysTyr: 0.921 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
11.114LeuAla: 11.114 ± 0.107
0.873LeuCys: 0.873 ± 0.026
5.894LeuAsp: 5.894 ± 0.067
4.917LeuGlu: 4.917 ± 0.079
3.539LeuPhe: 3.539 ± 0.061
7.618LeuGly: 7.618 ± 0.078
1.944LeuHis: 1.944 ± 0.041
5.673LeuIle: 5.673 ± 0.082
5.264LeuLys: 5.264 ± 0.072
8.455LeuLeu: 8.455 ± 0.097
2.534LeuMet: 2.534 ± 0.047
3.507LeuAsn: 3.507 ± 0.058
5.073LeuPro: 5.073 ± 0.065
2.974LeuGln: 2.974 ± 0.046
5.791LeuArg: 5.791 ± 0.078
7.407LeuSer: 7.407 ± 0.104
6.485LeuThr: 6.485 ± 0.129
6.255LeuVal: 6.255 ± 0.078
1.297LeuTrp: 1.297 ± 0.035
2.45LeuTyr: 2.45 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
3.233MetAla: 3.233 ± 0.056
0.163MetCys: 0.163 ± 0.009
1.253MetAsp: 1.253 ± 0.031
1.055MetGlu: 1.055 ± 0.026
0.72MetPhe: 0.72 ± 0.025
1.914MetGly: 1.914 ± 0.034
0.44MetHis: 0.44 ± 0.018
1.37MetIle: 1.37 ± 0.036
1.22MetLys: 1.22 ± 0.031
2.137MetLeu: 2.137 ± 0.046
0.677MetMet: 0.677 ± 0.02
0.867MetAsn: 0.867 ± 0.023
1.492MetPro: 1.492 ± 0.033
0.859MetGln: 0.859 ± 0.025
1.553MetArg: 1.553 ± 0.037
1.937MetSer: 1.937 ± 0.038
2.051MetThr: 2.051 ± 0.036
1.602MetVal: 1.602 ± 0.037
0.246MetTrp: 0.246 ± 0.013
0.392MetTyr: 0.392 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.638AsnAla: 3.638 ± 0.061
0.231AsnCys: 0.231 ± 0.014
1.879AsnAsp: 1.879 ± 0.044
1.452AsnGlu: 1.452 ± 0.036
1.181AsnPhe: 1.181 ± 0.031
2.91AsnGly: 2.91 ± 0.061
0.653AsnHis: 0.653 ± 0.022
1.759AsnIle: 1.759 ± 0.039
1.029AsnLys: 1.029 ± 0.032
3.377AsnLeu: 3.377 ± 0.065
0.746AsnMet: 0.746 ± 0.023
1.017AsnAsn: 1.017 ± 0.032
1.989AsnPro: 1.989 ± 0.042
0.94AsnGln: 0.94 ± 0.028
1.885AsnArg: 1.885 ± 0.038
1.725AsnSer: 1.725 ± 0.041
1.747AsnThr: 1.747 ± 0.036
2.151AsnVal: 2.151 ± 0.042
0.535AsnTrp: 0.535 ± 0.021
0.977AsnTyr: 0.977 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
5.459ProAla: 5.459 ± 0.074
0.315ProCys: 0.315 ± 0.016
3.652ProAsp: 3.652 ± 0.053
3.294ProGlu: 3.294 ± 0.057
1.921ProPhe: 1.921 ± 0.042
3.548ProGly: 3.548 ± 0.053
1.069ProHis: 1.069 ± 0.028
2.304ProIle: 2.304 ± 0.041
2.144ProLys: 2.144 ± 0.045
4.487ProLeu: 4.487 ± 0.057
1.07ProMet: 1.07 ± 0.027
1.555ProAsn: 1.555 ± 0.027
2.097ProPro: 2.097 ± 0.049
1.777ProGln: 1.777 ± 0.035
2.233ProArg: 2.233 ± 0.047
2.857ProSer: 2.857 ± 0.049
2.509ProThr: 2.509 ± 0.049
4.031ProVal: 4.031 ± 0.064
0.664ProTrp: 0.664 ± 0.023
1.399ProTyr: 1.399 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.443GlnAla: 4.443 ± 0.063
0.207GlnCys: 0.207 ± 0.011
1.693GlnAsp: 1.693 ± 0.033
1.293GlnGlu: 1.293 ± 0.032
1.168GlnPhe: 1.168 ± 0.03
2.653GlnGly: 2.653 ± 0.044
0.697GlnHis: 0.697 ± 0.023
2.136GlnIle: 2.136 ± 0.062
1.524GlnLys: 1.524 ± 0.039
3.027GlnLeu: 3.027 ± 0.067
0.961GlnMet: 0.961 ± 0.032
1.187GlnAsn: 1.187 ± 0.033
1.705GlnPro: 1.705 ± 0.041
1.248GlnGln: 1.248 ± 0.035
2.094GlnArg: 2.094 ± 0.038
2.229GlnSer: 2.229 ± 0.041
2.442GlnThr: 2.442 ± 0.047
2.58GlnVal: 2.58 ± 0.047
0.506GlnTrp: 0.506 ± 0.019
0.82GlnTyr: 0.82 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
6.097ArgAla: 6.097 ± 0.085
0.4ArgCys: 0.4 ± 0.017
3.749ArgAsp: 3.749 ± 0.065
3.164ArgGlu: 3.164 ± 0.064
2.638ArgPhe: 2.638 ± 0.045
3.845ArgGly: 3.845 ± 0.056
1.504ArgHis: 1.504 ± 0.033
3.36ArgIle: 3.36 ± 0.052
2.531ArgLys: 2.531 ± 0.051
6.989ArgLeu: 6.989 ± 0.088
1.58ArgMet: 1.58 ± 0.038
1.794ArgAsn: 1.794 ± 0.032
2.887ArgPro: 2.887 ± 0.058
2.315ArgGln: 2.315 ± 0.044
3.897ArgArg: 3.897 ± 0.07
3.193ArgSer: 3.193 ± 0.054
3.066ArgThr: 3.066 ± 0.052
3.929ArgVal: 3.929 ± 0.055
0.811ArgTrp: 0.811 ± 0.025
1.769ArgTyr: 1.769 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
6.836SerAla: 6.836 ± 0.077
0.4SerCys: 0.4 ± 0.019
3.762SerAsp: 3.762 ± 0.058
3.153SerGlu: 3.153 ± 0.046
2.376SerPhe: 2.376 ± 0.04
5.719SerGly: 5.719 ± 0.093
1.354SerHis: 1.354 ± 0.032
2.817SerIle: 2.817 ± 0.052
2.204SerLys: 2.204 ± 0.046
6.347SerLeu: 6.347 ± 0.071
1.356SerMet: 1.356 ± 0.028
1.818SerAsn: 1.818 ± 0.046
2.944SerPro: 2.944 ± 0.047
2.233SerGln: 2.233 ± 0.044
3.38SerArg: 3.38 ± 0.057
3.429SerSer: 3.429 ± 0.063
3.086SerThr: 3.086 ± 0.092
4.187SerVal: 4.187 ± 0.055
0.793SerTrp: 0.793 ± 0.023
1.757SerTyr: 1.757 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
6.493ThrAla: 6.493 ± 0.081
0.461ThrCys: 0.461 ± 0.02
3.218ThrAsp: 3.218 ± 0.055
2.694ThrGlu: 2.694 ± 0.044
2.19ThrPhe: 2.19 ± 0.043
5.396ThrGly: 5.396 ± 0.091
1.219ThrHis: 1.219 ± 0.033
2.893ThrIle: 2.893 ± 0.052
2.04ThrLys: 2.04 ± 0.038
6.434ThrLeu: 6.434 ± 0.08
1.106ThrMet: 1.106 ± 0.027
1.679ThrAsn: 1.679 ± 0.05
3.691ThrPro: 3.691 ± 0.061
2.253ThrGln: 2.253 ± 0.107
3.246ThrArg: 3.246 ± 0.048
3.554ThrSer: 3.554 ± 0.083
3.614ThrThr: 3.614 ± 0.136
4.224ThrVal: 4.224 ± 0.063
0.775ThrTrp: 0.775 ± 0.023
1.821ThrTyr: 1.821 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
7.77ValAla: 7.77 ± 0.08
0.652ValCys: 0.652 ± 0.019
3.977ValAsp: 3.977 ± 0.054
3.794ValGlu: 3.794 ± 0.062
2.691ValPhe: 2.691 ± 0.048
4.902ValGly: 4.902 ± 0.068
1.333ValHis: 1.333 ± 0.03
4.028ValIle: 4.028 ± 0.057
2.971ValLys: 2.971 ± 0.051
6.664ValLeu: 6.664 ± 0.078
1.879ValMet: 1.879 ± 0.043
2.235ValAsn: 2.235 ± 0.045
3.096ValPro: 3.096 ± 0.047
2.121ValGln: 2.121 ± 0.037
4.326ValArg: 4.326 ± 0.068
4.729ValSer: 4.729 ± 0.065
4.692ValThr: 4.692 ± 0.064
5.232ValVal: 5.232 ± 0.072
0.971ValTrp: 0.971 ± 0.029
1.714ValTyr: 1.714 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.209TrpAla: 1.209 ± 0.032
0.14TrpCys: 0.14 ± 0.01
0.701TrpAsp: 0.701 ± 0.025
0.546TrpGlu: 0.546 ± 0.021
0.576TrpPhe: 0.576 ± 0.022
1.02TrpGly: 1.02 ± 0.03
0.315TrpHis: 0.315 ± 0.015
0.698TrpIle: 0.698 ± 0.022
0.579TrpLys: 0.579 ± 0.022
1.695TrpLeu: 1.695 ± 0.038
0.377TrpMet: 0.377 ± 0.018
0.506TrpAsn: 0.506 ± 0.019
0.698TrpPro: 0.698 ± 0.024
0.679TrpGln: 0.679 ± 0.021
1.116TrpArg: 1.116 ± 0.031
0.921TrpSer: 0.921 ± 0.027
0.771TrpThr: 0.771 ± 0.023
0.821TrpVal: 0.821 ± 0.024
0.236TrpTrp: 0.236 ± 0.014
0.351TrpTyr: 0.351 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.955TyrAla: 2.955 ± 0.05
0.235TyrCys: 0.235 ± 0.013
2.004TyrAsp: 2.004 ± 0.046
1.484TyrGlu: 1.484 ± 0.035
1.024TyrPhe: 1.024 ± 0.03
2.439TyrGly: 2.439 ± 0.051
0.602TyrHis: 0.602 ± 0.019
1.264TyrIle: 1.264 ± 0.033
1.011TyrLys: 1.011 ± 0.029
2.469TyrLeu: 2.469 ± 0.041
0.618TyrMet: 0.618 ± 0.023
1.042TyrAsn: 1.042 ± 0.038
1.165TyrPro: 1.165 ± 0.031
0.947TyrGln: 0.947 ± 0.029
1.784TyrArg: 1.784 ± 0.039
1.708TyrSer: 1.708 ± 0.039
1.47TyrThr: 1.47 ± 0.039
1.966TyrVal: 1.966 ± 0.035
0.43TyrTrp: 0.43 ± 0.018
0.878TyrTyr: 0.878 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4617 proteins (1445798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski