Amino acid dipepetide frequency for Paenibacillus oryzae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.191AlaAla: 10.191 ± 0.121
0.77AlaCys: 0.77 ± 0.023
4.359AlaAsp: 4.359 ± 0.05
6.185AlaGlu: 6.185 ± 0.074
3.562AlaPhe: 3.562 ± 0.048
7.642AlaGly: 7.642 ± 0.104
1.354AlaHis: 1.354 ± 0.026
5.833AlaIle: 5.833 ± 0.068
4.286AlaLys: 4.286 ± 0.049
9.077AlaLeu: 9.077 ± 0.095
2.42AlaMet: 2.42 ± 0.042
2.744AlaAsn: 2.744 ± 0.047
2.895AlaPro: 2.895 ± 0.055
2.669AlaGln: 2.669 ± 0.042
3.825AlaArg: 3.825 ± 0.055
5.818AlaSer: 5.818 ± 0.073
3.764AlaThr: 3.764 ± 0.058
6.616AlaVal: 6.616 ± 0.066
0.997AlaTrp: 0.997 ± 0.027
2.724AlaTyr: 2.724 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.508CysAla: 0.508 ± 0.017
0.114CysCys: 0.114 ± 0.009
0.398CysAsp: 0.398 ± 0.016
0.444CysGlu: 0.444 ± 0.016
0.314CysPhe: 0.314 ± 0.016
0.771CysGly: 0.771 ± 0.024
0.183CysHis: 0.183 ± 0.011
0.484CysIle: 0.484 ± 0.017
0.314CysLys: 0.314 ± 0.014
0.783CysLeu: 0.783 ± 0.023
0.207CysMet: 0.207 ± 0.011
0.255CysAsn: 0.255 ± 0.011
0.348CysPro: 0.348 ± 0.016
0.222CysGln: 0.222 ± 0.013
0.458CysArg: 0.458 ± 0.016
0.599CysSer: 0.599 ± 0.019
0.37CysThr: 0.37 ± 0.015
0.432CysVal: 0.432 ± 0.017
0.103CysTrp: 0.103 ± 0.008
0.264CysTyr: 0.264 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.097AspAla: 4.097 ± 0.055
0.365AspCys: 0.365 ± 0.015
2.421AspAsp: 2.421 ± 0.052
3.989AspGlu: 3.989 ± 0.052
2.073AspPhe: 2.073 ± 0.038
4.343AspGly: 4.343 ± 0.064
0.987AspHis: 0.987 ± 0.025
3.611AspIle: 3.611 ± 0.055
2.81AspLys: 2.81 ± 0.046
4.474AspLeu: 4.474 ± 0.057
1.48AspMet: 1.48 ± 0.027
1.911AspAsn: 1.911 ± 0.039
2.085AspPro: 2.085 ± 0.041
1.64AspGln: 1.64 ± 0.037
2.661AspArg: 2.661 ± 0.044
2.986AspSer: 2.986 ± 0.048
2.303AspThr: 2.303 ± 0.04
3.374AspVal: 3.374 ± 0.047
0.82AspTrp: 0.82 ± 0.024
2.172AspTyr: 2.172 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.902GluAla: 6.902 ± 0.08
0.371GluCys: 0.371 ± 0.015
3.083GluAsp: 3.083 ± 0.045
5.469GluGlu: 5.469 ± 0.074
1.918GluPhe: 1.918 ± 0.033
4.976GluGly: 4.976 ± 0.05
1.466GluHis: 1.466 ± 0.028
4.069GluIle: 4.069 ± 0.053
3.897GluLys: 3.897 ± 0.057
8.179GluLeu: 8.179 ± 0.076
2.049GluMet: 2.049 ± 0.038
2.542GluAsn: 2.542 ± 0.042
2.387GluPro: 2.387 ± 0.037
3.624GluGln: 3.624 ± 0.052
4.095GluArg: 4.095 ± 0.06
3.688GluSer: 3.688 ± 0.052
3.278GluThr: 3.278 ± 0.048
4.243GluVal: 4.243 ± 0.057
1.023GluTrp: 1.023 ± 0.024
1.8GluTyr: 1.8 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.322PheAla: 3.322 ± 0.045
0.361PheCys: 0.361 ± 0.018
2.17PheAsp: 2.17 ± 0.038
2.163PheGlu: 2.163 ± 0.033
1.763PhePhe: 1.763 ± 0.04
3.29PheGly: 3.29 ± 0.045
0.873PheHis: 0.873 ± 0.023
2.705PheIle: 2.705 ± 0.042
1.959PheLys: 1.959 ± 0.03
3.823PheLeu: 3.823 ± 0.054
1.14PheMet: 1.14 ± 0.03
1.598PheAsn: 1.598 ± 0.033
1.537PhePro: 1.537 ± 0.029
1.559PheGln: 1.559 ± 0.029
2.094PheArg: 2.094 ± 0.034
2.978PheSer: 2.978 ± 0.042
2.228PheThr: 2.228 ± 0.037
3.023PheVal: 3.023 ± 0.104
0.53PheTrp: 0.53 ± 0.018
1.518PheTyr: 1.518 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
6.409GlyAla: 6.409 ± 0.081
0.691GlyCys: 0.691 ± 0.024
3.771GlyAsp: 3.771 ± 0.047
5.252GlyGlu: 5.252 ± 0.06
3.377GlyPhe: 3.377 ± 0.045
6.294GlyGly: 6.294 ± 0.105
1.503GlyHis: 1.503 ± 0.032
5.839GlyIle: 5.839 ± 0.055
4.807GlyLys: 4.807 ± 0.051
7.735GlyLeu: 7.735 ± 0.083
2.507GlyMet: 2.507 ± 0.039
2.97GlyAsn: 2.97 ± 0.055
1.922GlyPro: 1.922 ± 0.05
2.77GlyGln: 2.77 ± 0.054
3.626GlyArg: 3.626 ± 0.06
5.147GlySer: 5.147 ± 0.068
4.158GlyThr: 4.158 ± 0.06
5.276GlyVal: 5.276 ± 0.062
1.124GlyTrp: 1.124 ± 0.027
3.062GlyTyr: 3.062 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.597HisAla: 1.597 ± 0.032
0.213HisCys: 0.213 ± 0.011
1.0HisAsp: 1.0 ± 0.023
1.288HisGlu: 1.288 ± 0.029
1.026HisPhe: 1.026 ± 0.024
1.542HisGly: 1.542 ± 0.036
0.558HisHis: 0.558 ± 0.021
1.341HisIle: 1.341 ± 0.028
0.846HisLys: 0.846 ± 0.02
1.939HisLeu: 1.939 ± 0.037
0.551HisMet: 0.551 ± 0.016
0.677HisAsn: 0.677 ± 0.018
1.1HisPro: 1.1 ± 0.025
0.675HisGln: 0.675 ± 0.022
1.024HisArg: 1.024 ± 0.024
1.195HisSer: 1.195 ± 0.027
0.976HisThr: 0.976 ± 0.025
1.356HisVal: 1.356 ± 0.027
0.305HisTrp: 0.305 ± 0.015
0.976HisTyr: 0.976 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.287IleAla: 6.287 ± 0.068
0.564IleCys: 0.564 ± 0.021
3.42IleAsp: 3.42 ± 0.046
3.939IleGlu: 3.939 ± 0.052
2.225IlePhe: 2.225 ± 0.038
5.511IleGly: 5.511 ± 0.064
1.363IleHis: 1.363 ± 0.03
4.129IleIle: 4.129 ± 0.056
3.199IleLys: 3.199 ± 0.052
5.363IleLeu: 5.363 ± 0.06
1.685IleMet: 1.685 ± 0.034
2.504IleAsn: 2.504 ± 0.041
2.861IlePro: 2.861 ± 0.044
2.487IleGln: 2.487 ± 0.044
3.3IleArg: 3.3 ± 0.05
4.573IleSer: 4.573 ± 0.049
3.586IleThr: 3.586 ± 0.054
5.082IleVal: 5.082 ± 0.055
0.74IleTrp: 0.74 ± 0.022
2.031IleTyr: 2.031 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
4.772LysAla: 4.772 ± 0.056
0.258LysCys: 0.258 ± 0.012
2.73LysAsp: 2.73 ± 0.042
4.358LysGlu: 4.358 ± 0.058
1.254LysPhe: 1.254 ± 0.026
3.773LysGly: 3.773 ± 0.047
1.029LysHis: 1.029 ± 0.025
2.777LysIle: 2.777 ± 0.039
2.976LysLys: 2.976 ± 0.052
5.84LysLeu: 5.84 ± 0.061
1.503LysMet: 1.503 ± 0.034
1.874LysAsn: 1.874 ± 0.04
2.37LysPro: 2.37 ± 0.035
2.371LysGln: 2.371 ± 0.045
3.028LysArg: 3.028 ± 0.049
3.023LysSer: 3.023 ± 0.047
2.765LysThr: 2.765 ± 0.04
3.277LysVal: 3.277 ± 0.047
0.731LysTrp: 0.731 ± 0.026
1.511LysTyr: 1.511 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
9.085LeuAla: 9.085 ± 0.086
0.822LeuCys: 0.822 ± 0.022
5.338LeuAsp: 5.338 ± 0.058
6.881LeuGlu: 6.881 ± 0.068
4.728LeuPhe: 4.728 ± 0.071
6.916LeuGly: 6.916 ± 0.075
2.092LeuHis: 2.092 ± 0.037
6.263LeuIle: 6.263 ± 0.073
5.407LeuLys: 5.407 ± 0.058
12.326LeuLeu: 12.326 ± 0.14
2.734LeuMet: 2.734 ± 0.045
3.871LeuAsn: 3.871 ± 0.06
4.725LeuPro: 4.725 ± 0.069
4.227LeuGln: 4.227 ± 0.053
5.118LeuArg: 5.118 ± 0.067
7.237LeuSer: 7.237 ± 0.08
5.506LeuThr: 5.506 ± 0.06
6.254LeuVal: 6.254 ± 0.069
1.097LeuTrp: 1.097 ± 0.029
3.426LeuTyr: 3.426 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.429MetAla: 2.429 ± 0.046
0.142MetCys: 0.142 ± 0.008
1.49MetAsp: 1.49 ± 0.031
2.044MetGlu: 2.044 ± 0.035
0.962MetPhe: 0.962 ± 0.026
1.781MetGly: 1.781 ± 0.035
0.455MetHis: 0.455 ± 0.016
1.787MetIle: 1.787 ± 0.032
1.946MetLys: 1.946 ± 0.036
3.291MetLeu: 3.291 ± 0.046
0.885MetMet: 0.885 ± 0.025
1.357MetAsn: 1.357 ± 0.028
1.193MetPro: 1.193 ± 0.029
0.906MetGln: 0.906 ± 0.022
1.239MetArg: 1.239 ± 0.026
1.833MetSer: 1.833 ± 0.031
1.703MetThr: 1.703 ± 0.031
1.689MetVal: 1.689 ± 0.037
0.225MetTrp: 0.225 ± 0.013
0.75MetTyr: 0.75 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.122AsnAla: 3.122 ± 0.049
0.265AsnCys: 0.265 ± 0.011
1.982AsnAsp: 1.982 ± 0.035
2.78AsnGlu: 2.78 ± 0.047
1.185AsnPhe: 1.185 ± 0.027
3.703AsnGly: 3.703 ± 0.065
0.808AsnHis: 0.808 ± 0.024
2.203AsnIle: 2.203 ± 0.037
2.053AsnLys: 2.053 ± 0.038
3.06AsnLeu: 3.06 ± 0.049
0.987AsnMet: 0.987 ± 0.024
1.596AsnAsn: 1.596 ± 0.036
2.004AsnPro: 2.004 ± 0.038
1.372AsnGln: 1.372 ± 0.03
2.064AsnArg: 2.064 ± 0.036
2.194AsnSer: 2.194 ± 0.046
1.847AsnThr: 1.847 ± 0.037
2.548AsnVal: 2.548 ± 0.04
0.551AsnTrp: 0.551 ± 0.019
1.296AsnTyr: 1.296 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
3.5ProAla: 3.5 ± 0.044
0.259ProCys: 0.259 ± 0.013
2.594ProAsp: 2.594 ± 0.039
3.251ProGlu: 3.251 ± 0.05
1.927ProPhe: 1.927 ± 0.035
3.209ProGly: 3.209 ± 0.048
0.855ProHis: 0.855 ± 0.021
2.317ProIle: 2.317 ± 0.037
1.633ProLys: 1.633 ± 0.029
4.092ProLeu: 4.092 ± 0.061
0.91ProMet: 0.91 ± 0.024
1.299ProAsn: 1.299 ± 0.028
1.343ProPro: 1.343 ± 0.032
1.407ProGln: 1.407 ± 0.026
1.455ProArg: 1.455 ± 0.033
2.599ProSer: 2.599 ± 0.048
1.779ProThr: 1.779 ± 0.046
3.167ProVal: 3.167 ± 0.043
0.515ProTrp: 0.515 ± 0.018
1.548ProTyr: 1.548 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.396GlnAla: 3.396 ± 0.044
0.23GlnCys: 0.23 ± 0.011
1.619GlnAsp: 1.619 ± 0.039
2.705GlnGlu: 2.705 ± 0.05
1.558GlnPhe: 1.558 ± 0.031
2.66GlnGly: 2.66 ± 0.042
0.907GlnHis: 0.907 ± 0.023
2.201GlnIle: 2.201 ± 0.033
1.874GlnLys: 1.874 ± 0.033
4.216GlnLeu: 4.216 ± 0.055
1.035GlnMet: 1.035 ± 0.029
1.444GlnAsn: 1.444 ± 0.029
1.527GlnPro: 1.527 ± 0.029
1.803GlnGln: 1.803 ± 0.042
1.982GlnArg: 1.982 ± 0.04
2.454GlnSer: 2.454 ± 0.043
1.853GlnThr: 1.853 ± 0.031
2.124GlnVal: 2.124 ± 0.037
0.528GlnTrp: 0.528 ± 0.019
1.457GlnTyr: 1.457 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
3.277ArgAla: 3.277 ± 0.049
0.366ArgCys: 0.366 ± 0.015
2.61ArgAsp: 2.61 ± 0.048
3.873ArgGlu: 3.873 ± 0.054
2.269ArgPhe: 2.269 ± 0.04
3.193ArgGly: 3.193 ± 0.052
1.189ArgHis: 1.189 ± 0.03
3.463ArgIle: 3.463 ± 0.053
3.012ArgLys: 3.012 ± 0.049
5.635ArgLeu: 5.635 ± 0.063
1.594ArgMet: 1.594 ± 0.035
2.105ArgAsn: 2.105 ± 0.035
1.698ArgPro: 1.698 ± 0.032
2.101ArgGln: 2.101 ± 0.037
2.841ArgArg: 2.841 ± 0.052
3.362ArgSer: 3.362 ± 0.046
2.349ArgThr: 2.349 ± 0.038
2.876ArgVal: 2.876 ± 0.042
0.63ArgTrp: 0.63 ± 0.02
2.022ArgTyr: 2.022 ± 0.033
0.001ArgXaa: 0.001 ± 0.001
Ser
5.255SerAla: 5.255 ± 0.065
0.473SerCys: 0.473 ± 0.015
3.291SerAsp: 3.291 ± 0.049
3.938SerGlu: 3.938 ± 0.049
3.143SerPhe: 3.143 ± 0.046
6.165SerGly: 6.165 ± 0.077
1.323SerHis: 1.323 ± 0.026
4.575SerIle: 4.575 ± 0.065
3.058SerLys: 3.058 ± 0.04
6.717SerLeu: 6.717 ± 0.074
1.81SerMet: 1.81 ± 0.034
2.375SerAsn: 2.375 ± 0.044
2.621SerPro: 2.621 ± 0.045
2.219SerGln: 2.219 ± 0.039
3.381SerArg: 3.381 ± 0.047
4.714SerSer: 4.714 ± 0.068
2.953SerThr: 2.953 ± 0.05
4.378SerVal: 4.378 ± 0.048
0.841SerTrp: 0.841 ± 0.025
2.391SerTyr: 2.391 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
4.804ThrAla: 4.804 ± 0.063
0.323ThrCys: 0.323 ± 0.015
2.482ThrAsp: 2.482 ± 0.044
3.103ThrGlu: 3.103 ± 0.051
2.216ThrPhe: 2.216 ± 0.038
4.56ThrGly: 4.56 ± 0.066
0.895ThrHis: 0.895 ± 0.024
3.556ThrIle: 3.556 ± 0.048
2.174ThrLys: 2.174 ± 0.036
5.269ThrLeu: 5.269 ± 0.059
1.28ThrMet: 1.28 ± 0.028
1.77ThrAsn: 1.77 ± 0.037
2.428ThrPro: 2.428 ± 0.05
1.335ThrGln: 1.335 ± 0.031
2.178ThrArg: 2.178 ± 0.034
3.126ThrSer: 3.126 ± 0.05
2.651ThrThr: 2.651 ± 0.042
4.087ThrVal: 4.087 ± 0.059
0.546ThrTrp: 0.546 ± 0.022
1.713ThrTyr: 1.713 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
5.311ValAla: 5.311 ± 0.057
0.586ValCys: 0.586 ± 0.018
3.398ValAsp: 3.398 ± 0.051
4.346ValGlu: 4.346 ± 0.053
2.963ValPhe: 2.963 ± 0.106
4.253ValGly: 4.253 ± 0.063
1.347ValHis: 1.347 ± 0.029
4.716ValIle: 4.716 ± 0.056
3.756ValLys: 3.756 ± 0.053
7.261ValLeu: 7.261 ± 0.071
1.956ValMet: 1.956 ± 0.036
2.679ValAsn: 2.679 ± 0.04
2.756ValPro: 2.756 ± 0.04
2.526ValGln: 2.526 ± 0.041
3.168ValArg: 3.168 ± 0.05
4.685ValSer: 4.685 ± 0.052
3.958ValThr: 3.958 ± 0.062
4.799ValVal: 4.799 ± 0.06
0.818ValTrp: 0.818 ± 0.025
2.373ValTyr: 2.373 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.025
0.097TrpCys: 0.097 ± 0.008
0.641TrpAsp: 0.641 ± 0.02
0.762TrpGlu: 0.762 ± 0.023
0.593TrpPhe: 0.593 ± 0.022
0.851TrpGly: 0.851 ± 0.023
0.283TrpHis: 0.283 ± 0.015
0.873TrpIle: 0.873 ± 0.021
0.652TrpLys: 0.652 ± 0.019
1.617TrpLeu: 1.617 ± 0.035
0.414TrpMet: 0.414 ± 0.016
0.642TrpAsn: 0.642 ± 0.023
0.382TrpPro: 0.382 ± 0.015
0.502TrpGln: 0.502 ± 0.02
0.73TrpArg: 0.73 ± 0.022
0.963TrpSer: 0.963 ± 0.027
0.69TrpThr: 0.69 ± 0.023
0.703TrpVal: 0.703 ± 0.021
0.162TrpTrp: 0.162 ± 0.009
0.415TrpTyr: 0.415 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.681TyrAla: 2.681 ± 0.045
0.302TyrCys: 0.302 ± 0.014
1.802TyrAsp: 1.802 ± 0.035
2.289TyrGlu: 2.289 ± 0.036
1.577TyrPhe: 1.577 ± 0.031
2.807TyrGly: 2.807 ± 0.047
0.714TyrHis: 0.714 ± 0.023
2.108TyrIle: 2.108 ± 0.036
1.564TyrLys: 1.564 ± 0.033
3.358TyrLeu: 3.358 ± 0.046
0.959TyrMet: 0.959 ± 0.025
1.406TyrAsn: 1.406 ± 0.027
1.477TyrPro: 1.477 ± 0.033
1.157TyrGln: 1.157 ± 0.03
2.203TyrArg: 2.203 ± 0.039
2.446TyrSer: 2.446 ± 0.041
1.778TyrThr: 1.778 ± 0.035
2.349TyrVal: 2.349 ± 0.036
0.47TyrTrp: 0.47 ± 0.018
1.445TyrTyr: 1.445 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5020 proteins (1677518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski