Amino acid dipepetide frequency for Paenibacillus larvae subsp. larvae DSM 25430

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.885AlaAla: 6.885 ± 0.126
0.767AlaCys: 0.767 ± 0.028
3.786AlaAsp: 3.786 ± 0.062
5.244AlaGlu: 5.244 ± 0.092
3.073AlaPhe: 3.073 ± 0.05
6.142AlaGly: 6.142 ± 0.097
1.302AlaHis: 1.302 ± 0.037
5.119AlaIle: 5.119 ± 0.071
4.58AlaLys: 4.58 ± 0.077
7.381AlaLeu: 7.381 ± 0.1
2.194AlaMet: 2.194 ± 0.053
2.425AlaAsn: 2.425 ± 0.054
2.187AlaPro: 2.187 ± 0.05
2.283AlaGln: 2.283 ± 0.045
3.296AlaArg: 3.296 ± 0.063
4.425AlaSer: 4.425 ± 0.071
2.924AlaThr: 2.924 ± 0.059
5.864AlaVal: 5.864 ± 0.092
0.787AlaTrp: 0.787 ± 0.03
2.457AlaTyr: 2.457 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.022
0.152CysCys: 0.152 ± 0.011
0.396CysAsp: 0.396 ± 0.021
0.503CysGlu: 0.503 ± 0.019
0.423CysPhe: 0.423 ± 0.021
0.815CysGly: 0.815 ± 0.032
0.198CysHis: 0.198 ± 0.015
0.662CysIle: 0.662 ± 0.029
0.54CysLys: 0.54 ± 0.022
0.975CysLeu: 0.975 ± 0.036
0.265CysMet: 0.265 ± 0.017
0.286CysAsn: 0.286 ± 0.018
0.487CysPro: 0.487 ± 0.024
0.317CysGln: 0.317 ± 0.016
0.513CysArg: 0.513 ± 0.026
0.676CysSer: 0.676 ± 0.024
0.552CysThr: 0.552 ± 0.025
0.514CysVal: 0.514 ± 0.021
0.099CysTrp: 0.099 ± 0.009
0.312CysTyr: 0.312 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.365AspAla: 3.365 ± 0.062
0.401AspCys: 0.401 ± 0.021
2.063AspAsp: 2.063 ± 0.048
3.753AspGlu: 3.753 ± 0.071
2.137AspPhe: 2.137 ± 0.052
3.377AspGly: 3.377 ± 0.064
1.248AspHis: 1.248 ± 0.037
3.728AspIle: 3.728 ± 0.062
3.211AspLys: 3.211 ± 0.069
4.971AspLeu: 4.971 ± 0.075
1.397AspMet: 1.397 ± 0.038
1.62AspAsn: 1.62 ± 0.043
2.269AspPro: 2.269 ± 0.049
2.004AspGln: 2.004 ± 0.055
2.702AspArg: 2.702 ± 0.05
2.607AspSer: 2.607 ± 0.05
2.327AspThr: 2.327 ± 0.054
3.372AspVal: 3.372 ± 0.062
0.697AspTrp: 0.697 ± 0.031
1.902AspTyr: 1.902 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
5.667GluAla: 5.667 ± 0.079
0.485GluCys: 0.485 ± 0.022
3.451GluAsp: 3.451 ± 0.061
6.413GluGlu: 6.413 ± 0.101
2.214GluPhe: 2.214 ± 0.047
4.454GluGly: 4.454 ± 0.08
1.719GluHis: 1.719 ± 0.069
4.784GluIle: 4.784 ± 0.078
5.247GluLys: 5.247 ± 0.086
7.051GluLeu: 7.051 ± 0.098
2.217GluMet: 2.217 ± 0.048
2.777GluAsn: 2.777 ± 0.05
2.248GluPro: 2.248 ± 0.052
3.937GluGln: 3.937 ± 0.086
4.017GluArg: 4.017 ± 0.072
3.483GluSer: 3.483 ± 0.064
3.652GluThr: 3.652 ± 0.061
4.93GluVal: 4.93 ± 0.078
0.93GluTrp: 0.93 ± 0.034
2.206GluTyr: 2.206 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.002PheAla: 3.002 ± 0.05
0.437PheCys: 0.437 ± 0.019
2.036PheAsp: 2.036 ± 0.052
2.398PheGlu: 2.398 ± 0.056
2.034PhePhe: 2.034 ± 0.058
3.055PheGly: 3.055 ± 0.055
1.106PheHis: 1.106 ± 0.033
3.223PheIle: 3.223 ± 0.076
2.053PheLys: 2.053 ± 0.049
4.304PheLeu: 4.304 ± 0.094
1.185PheMet: 1.185 ± 0.034
1.479PheAsn: 1.479 ± 0.041
1.651PhePro: 1.651 ± 0.042
1.426PheGln: 1.426 ± 0.037
1.931PheArg: 1.931 ± 0.047
2.825PheSer: 2.825 ± 0.063
2.468PheThr: 2.468 ± 0.051
2.962PheVal: 2.962 ± 0.06
0.494PheTrp: 0.494 ± 0.028
1.498PheTyr: 1.498 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
4.83GlyAla: 4.83 ± 0.082
0.841GlyCys: 0.841 ± 0.035
3.168GlyAsp: 3.168 ± 0.064
4.593GlyGlu: 4.593 ± 0.075
3.299GlyPhe: 3.299 ± 0.065
5.173GlyGly: 5.173 ± 0.088
1.56GlyHis: 1.56 ± 0.041
6.155GlyIle: 6.155 ± 0.087
5.247GlyLys: 5.247 ± 0.08
7.088GlyLeu: 7.088 ± 0.1
2.341GlyMet: 2.341 ± 0.05
2.774GlyAsn: 2.774 ± 0.052
1.945GlyPro: 1.945 ± 0.063
2.543GlyGln: 2.543 ± 0.06
3.521GlyArg: 3.521 ± 0.07
4.211GlySer: 4.211 ± 0.072
4.29GlyThr: 4.29 ± 0.063
5.101GlyVal: 5.101 ± 0.083
0.895GlyTrp: 0.895 ± 0.033
2.722GlyTyr: 2.722 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.569HisAla: 1.569 ± 0.045
0.235HisCys: 0.235 ± 0.015
1.048HisAsp: 1.048 ± 0.031
1.413HisGlu: 1.413 ± 0.037
1.054HisPhe: 1.054 ± 0.035
1.51HisGly: 1.51 ± 0.048
0.691HisHis: 0.691 ± 0.032
1.597HisIle: 1.597 ± 0.051
1.148HisLys: 1.148 ± 0.036
2.42HisLeu: 2.42 ± 0.105
0.602HisMet: 0.602 ± 0.025
0.762HisAsn: 0.762 ± 0.03
1.27HisPro: 1.27 ± 0.037
0.892HisGln: 0.892 ± 0.033
1.077HisArg: 1.077 ± 0.031
1.349HisSer: 1.349 ± 0.036
1.204HisThr: 1.204 ± 0.035
1.596HisVal: 1.596 ± 0.038
0.24HisTrp: 0.24 ± 0.013
0.827HisTyr: 0.827 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.348IleAla: 5.348 ± 0.075
0.806IleCys: 0.806 ± 0.03
3.434IleAsp: 3.434 ± 0.062
4.696IleGlu: 4.696 ± 0.075
2.676IlePhe: 2.676 ± 0.053
5.705IleGly: 5.705 ± 0.099
1.79IleHis: 1.79 ± 0.041
4.741IleIle: 4.741 ± 0.084
3.731IleLys: 3.731 ± 0.054
6.733IleLeu: 6.733 ± 0.094
1.865IleMet: 1.865 ± 0.045
2.434IleAsn: 2.434 ± 0.05
3.456IlePro: 3.456 ± 0.058
2.976IleGln: 2.976 ± 0.059
4.314IleArg: 4.314 ± 0.063
4.683IleSer: 4.683 ± 0.078
3.901IleThr: 3.901 ± 0.06
4.954IleVal: 4.954 ± 0.083
0.691IleTrp: 0.691 ± 0.023
2.223IleTyr: 2.223 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.856LysAla: 4.856 ± 0.076
0.336LysCys: 0.336 ± 0.02
3.404LysAsp: 3.404 ± 0.063
5.943LysGlu: 5.943 ± 0.107
1.751LysPhe: 1.751 ± 0.041
4.35LysGly: 4.35 ± 0.067
1.348LysHis: 1.348 ± 0.035
3.885LysIle: 3.885 ± 0.066
4.817LysLys: 4.817 ± 0.078
5.941LysLeu: 5.941 ± 0.079
1.922LysMet: 1.922 ± 0.052
2.544LysAsn: 2.544 ± 0.051
2.586LysPro: 2.586 ± 0.058
3.354LysGln: 3.354 ± 0.069
3.492LysArg: 3.492 ± 0.064
3.293LysSer: 3.293 ± 0.06
3.415LysThr: 3.415 ± 0.065
4.165LysVal: 4.165 ± 0.063
0.813LysTrp: 0.813 ± 0.03
1.994LysTyr: 1.994 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
7.73LeuAla: 7.73 ± 0.11
0.948LeuCys: 0.948 ± 0.031
5.249LeuAsp: 5.249 ± 0.074
6.717LeuGlu: 6.717 ± 0.093
4.76LeuPhe: 4.76 ± 0.086
6.836LeuGly: 6.836 ± 0.082
2.331LeuHis: 2.331 ± 0.045
6.823LeuIle: 6.823 ± 0.112
6.076LeuLys: 6.076 ± 0.077
10.464LeuLeu: 10.464 ± 0.137
2.693LeuMet: 2.693 ± 0.054
3.946LeuAsn: 3.946 ± 0.062
4.337LeuPro: 4.337 ± 0.078
3.819LeuGln: 3.819 ± 0.064
4.583LeuArg: 4.583 ± 0.077
6.753LeuSer: 6.753 ± 0.086
5.382LeuThr: 5.382 ± 0.08
6.461LeuVal: 6.461 ± 0.092
0.918LeuTrp: 0.918 ± 0.029
3.122LeuTyr: 3.122 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.172MetAla: 2.172 ± 0.053
0.188MetCys: 0.188 ± 0.013
1.527MetAsp: 1.527 ± 0.045
2.3MetGlu: 2.3 ± 0.045
1.087MetPhe: 1.087 ± 0.031
1.956MetGly: 1.956 ± 0.051
0.51MetHis: 0.51 ± 0.02
2.178MetIle: 2.178 ± 0.045
2.477MetLys: 2.477 ± 0.051
2.792MetLeu: 2.792 ± 0.057
1.008MetMet: 1.008 ± 0.035
1.618MetAsn: 1.618 ± 0.041
1.063MetPro: 1.063 ± 0.034
1.018MetGln: 1.018 ± 0.033
1.299MetArg: 1.299 ± 0.038
1.683MetSer: 1.683 ± 0.041
1.619MetThr: 1.619 ± 0.042
1.818MetVal: 1.818 ± 0.047
0.204MetTrp: 0.204 ± 0.015
0.837MetTyr: 0.837 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
2.36AsnAla: 2.36 ± 0.05
0.343AsnCys: 0.343 ± 0.019
1.603AsnAsp: 1.603 ± 0.038
2.721AsnGlu: 2.721 ± 0.056
1.248AsnPhe: 1.248 ± 0.033
2.95AsnGly: 2.95 ± 0.058
0.891AsnHis: 0.891 ± 0.028
2.597AsnIle: 2.597 ± 0.047
2.614AsnLys: 2.614 ± 0.057
3.535AsnLeu: 3.535 ± 0.064
1.165AsnMet: 1.165 ± 0.032
1.507AsnAsn: 1.507 ± 0.047
2.056AsnPro: 2.056 ± 0.049
1.777AsnGln: 1.777 ± 0.045
2.355AsnArg: 2.355 ± 0.047
2.054AsnSer: 2.054 ± 0.051
1.885AsnThr: 1.885 ± 0.043
2.559AsnVal: 2.559 ± 0.057
0.513AsnTrp: 0.513 ± 0.024
1.2AsnTyr: 1.2 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
2.876ProAla: 2.876 ± 0.06
0.294ProCys: 0.294 ± 0.022
2.709ProAsp: 2.709 ± 0.056
3.484ProGlu: 3.484 ± 0.063
1.951ProPhe: 1.951 ± 0.045
2.907ProGly: 2.907 ± 0.061
0.882ProHis: 0.882 ± 0.029
2.415ProIle: 2.415 ± 0.052
2.281ProLys: 2.281 ± 0.045
3.727ProLeu: 3.727 ± 0.065
0.909ProMet: 0.909 ± 0.03
1.587ProAsn: 1.587 ± 0.042
1.155ProPro: 1.155 ± 0.034
1.272ProGln: 1.272 ± 0.054
1.415ProArg: 1.415 ± 0.041
2.526ProSer: 2.526 ± 0.049
1.814ProThr: 1.814 ± 0.044
3.457ProVal: 3.457 ± 0.061
0.421ProTrp: 0.421 ± 0.021
1.573ProTyr: 1.573 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.255GlnAla: 3.255 ± 0.063
0.256GlnCys: 0.256 ± 0.017
1.939GlnAsp: 1.939 ± 0.045
2.935GlnGlu: 2.935 ± 0.069
1.471GlnPhe: 1.471 ± 0.041
2.625GlnGly: 2.625 ± 0.088
0.85GlnHis: 0.85 ± 0.033
2.872GlnIle: 2.872 ± 0.059
2.658GlnLys: 2.658 ± 0.06
4.058GlnLeu: 4.058 ± 0.066
1.253GlnMet: 1.253 ± 0.039
1.469GlnAsn: 1.469 ± 0.034
1.407GlnPro: 1.407 ± 0.041
1.799GlnGln: 1.799 ± 0.053
1.81GlnArg: 1.81 ± 0.047
2.193GlnSer: 2.193 ± 0.085
2.212GlnThr: 2.212 ± 0.043
2.808GlnVal: 2.808 ± 0.06
0.386GlnTrp: 0.386 ± 0.021
1.321GlnTyr: 1.321 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.814ArgAla: 2.814 ± 0.058
0.466ArgCys: 0.466 ± 0.021
2.269ArgAsp: 2.269 ± 0.048
3.924ArgGlu: 3.924 ± 0.066
2.138ArgPhe: 2.138 ± 0.051
3.098ArgGly: 3.098 ± 0.067
1.153ArgHis: 1.153 ± 0.04
4.035ArgIle: 4.035 ± 0.065
3.881ArgLys: 3.881 ± 0.068
5.06ArgLeu: 5.06 ± 0.073
1.681ArgMet: 1.681 ± 0.045
2.181ArgAsn: 2.181 ± 0.043
1.726ArgPro: 1.726 ± 0.041
2.137ArgGln: 2.137 ± 0.051
2.781ArgArg: 2.781 ± 0.067
2.848ArgSer: 2.848 ± 0.052
2.617ArgThr: 2.617 ± 0.05
3.224ArgVal: 3.224 ± 0.06
0.596ArgTrp: 0.596 ± 0.023
1.842ArgTyr: 1.842 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
4.034SerAla: 4.034 ± 0.078
0.581SerCys: 0.581 ± 0.027
2.737SerAsp: 2.737 ± 0.053
3.646SerGlu: 3.646 ± 0.066
3.015SerPhe: 3.015 ± 0.057
4.97SerGly: 4.97 ± 0.078
1.331SerHis: 1.331 ± 0.036
4.425SerIle: 4.425 ± 0.081
3.56SerLys: 3.56 ± 0.061
6.264SerLeu: 6.264 ± 0.09
1.851SerMet: 1.851 ± 0.043
2.013SerAsn: 2.013 ± 0.049
2.447SerPro: 2.447 ± 0.051
2.023SerGln: 2.023 ± 0.046
3.031SerArg: 3.031 ± 0.059
4.161SerSer: 4.161 ± 0.083
2.924SerThr: 2.924 ± 0.056
4.356SerVal: 4.356 ± 0.079
0.762SerTrp: 0.762 ± 0.027
2.191SerTyr: 2.191 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
4.199ThrAla: 4.199 ± 0.072
0.454ThrCys: 0.454 ± 0.022
2.625ThrAsp: 2.625 ± 0.05
3.545ThrGlu: 3.545 ± 0.067
2.255ThrPhe: 2.255 ± 0.047
4.625ThrGly: 4.625 ± 0.081
1.054ThrHis: 1.054 ± 0.035
3.705ThrIle: 3.705 ± 0.067
2.89ThrLys: 2.89 ± 0.059
5.045ThrLeu: 5.045 ± 0.076
1.363ThrMet: 1.363 ± 0.04
1.882ThrAsn: 1.882 ± 0.044
2.407ThrPro: 2.407 ± 0.052
1.495ThrGln: 1.495 ± 0.036
2.396ThrArg: 2.396 ± 0.059
3.163ThrSer: 3.163 ± 0.053
2.552ThrThr: 2.552 ± 0.065
4.16ThrVal: 4.16 ± 0.062
0.574ThrTrp: 0.574 ± 0.023
1.929ThrTyr: 1.929 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
4.731ValAla: 4.731 ± 0.082
0.731ValCys: 0.731 ± 0.028
3.44ValAsp: 3.44 ± 0.066
4.549ValGlu: 4.549 ± 0.071
2.909ValPhe: 2.909 ± 0.061
4.522ValGly: 4.522 ± 0.083
1.559ValHis: 1.559 ± 0.042
5.116ValIle: 5.116 ± 0.076
4.475ValLys: 4.475 ± 0.069
7.429ValLeu: 7.429 ± 0.084
2.014ValMet: 2.014 ± 0.044
2.786ValAsn: 2.786 ± 0.057
3.036ValPro: 3.036 ± 0.056
2.673ValGln: 2.673 ± 0.057
3.455ValArg: 3.455 ± 0.06
4.692ValSer: 4.692 ± 0.075
4.153ValThr: 4.153 ± 0.071
4.662ValVal: 4.662 ± 0.075
0.742ValTrp: 0.742 ± 0.027
2.286ValTyr: 2.286 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.028
0.097TrpCys: 0.097 ± 0.01
0.58TrpAsp: 0.58 ± 0.027
0.728TrpGlu: 0.728 ± 0.024
0.553TrpPhe: 0.553 ± 0.022
0.703TrpGly: 0.703 ± 0.026
0.225TrpHis: 0.225 ± 0.018
0.903TrpIle: 0.903 ± 0.032
0.843TrpLys: 0.843 ± 0.03
1.32TrpLeu: 1.32 ± 0.044
0.437TrpMet: 0.437 ± 0.021
0.562TrpAsn: 0.562 ± 0.023
0.289TrpPro: 0.289 ± 0.017
0.412TrpGln: 0.412 ± 0.02
0.511TrpArg: 0.511 ± 0.025
0.681TrpSer: 0.681 ± 0.026
0.569TrpThr: 0.569 ± 0.022
0.764TrpVal: 0.764 ± 0.029
0.174TrpTrp: 0.174 ± 0.014
0.33TrpTyr: 0.33 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.258TyrAla: 2.258 ± 0.051
0.369TyrCys: 0.369 ± 0.022
1.69TyrAsp: 1.69 ± 0.044
2.378TyrGlu: 2.378 ± 0.049
1.516TyrPhe: 1.516 ± 0.042
2.526TyrGly: 2.526 ± 0.056
0.758TyrHis: 0.758 ± 0.031
2.192TyrIle: 2.192 ± 0.047
1.88TyrLys: 1.88 ± 0.043
3.39TyrLeu: 3.39 ± 0.053
0.991TyrMet: 0.991 ± 0.033
1.278TyrAsn: 1.278 ± 0.041
1.633TyrPro: 1.633 ± 0.048
1.419TyrGln: 1.419 ± 0.036
1.972TyrArg: 1.972 ± 0.048
2.016TyrSer: 2.016 ± 0.047
1.857TyrThr: 1.857 ± 0.041
2.233TyrVal: 2.233 ± 0.042
0.417TyrTrp: 0.417 ± 0.021
1.258TyrTyr: 1.258 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3693 proteins (1008502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski