Amino acid dipepetide frequency for Paenibacillus lentus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.733AlaAla: 7.733 ± 0.11
0.665AlaCys: 0.665 ± 0.022
3.905AlaAsp: 3.905 ± 0.059
5.766AlaGlu: 5.766 ± 0.076
2.994AlaPhe: 2.994 ± 0.049
6.136AlaGly: 6.136 ± 0.082
1.367AlaHis: 1.367 ± 0.037
5.283AlaIle: 5.283 ± 0.074
4.196AlaLys: 4.196 ± 0.048
7.847AlaLeu: 7.847 ± 0.088
2.31AlaMet: 2.31 ± 0.038
2.663AlaAsn: 2.663 ± 0.045
2.527AlaPro: 2.527 ± 0.046
2.517AlaGln: 2.517 ± 0.049
3.437AlaArg: 3.437 ± 0.055
4.787AlaSer: 4.787 ± 0.062
3.509AlaThr: 3.509 ± 0.055
5.964AlaVal: 5.964 ± 0.073
0.873AlaTrp: 0.873 ± 0.032
2.631AlaTyr: 2.631 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.471CysAla: 0.471 ± 0.019
0.116CysCys: 0.116 ± 0.008
0.398CysAsp: 0.398 ± 0.02
0.429CysGlu: 0.429 ± 0.018
0.294CysPhe: 0.294 ± 0.016
0.753CysGly: 0.753 ± 0.023
0.181CysHis: 0.181 ± 0.011
0.506CysIle: 0.506 ± 0.022
0.331CysLys: 0.331 ± 0.016
0.75CysLeu: 0.75 ± 0.024
0.198CysMet: 0.198 ± 0.012
0.322CysAsn: 0.322 ± 0.015
0.336CysPro: 0.336 ± 0.017
0.23CysGln: 0.23 ± 0.013
0.444CysArg: 0.444 ± 0.02
0.569CysSer: 0.569 ± 0.02
0.373CysThr: 0.373 ± 0.018
0.454CysVal: 0.454 ± 0.019
0.087CysTrp: 0.087 ± 0.008
0.268CysTyr: 0.268 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.543AspAla: 3.543 ± 0.063
0.389AspCys: 0.389 ± 0.017
2.494AspAsp: 2.494 ± 0.05
4.026AspGlu: 4.026 ± 0.059
2.247AspPhe: 2.247 ± 0.042
3.817AspGly: 3.817 ± 0.063
1.206AspHis: 1.206 ± 0.029
3.91AspIle: 3.91 ± 0.053
2.759AspLys: 2.759 ± 0.041
5.022AspLeu: 5.022 ± 0.063
1.428AspMet: 1.428 ± 0.032
1.922AspAsn: 1.922 ± 0.039
2.266AspPro: 2.266 ± 0.043
1.998AspGln: 1.998 ± 0.038
2.653AspArg: 2.653 ± 0.044
2.921AspSer: 2.921 ± 0.05
2.496AspThr: 2.496 ± 0.04
3.447AspVal: 3.447 ± 0.052
0.794AspTrp: 0.794 ± 0.024
2.215AspTyr: 2.215 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
6.107GluAla: 6.107 ± 0.078
0.382GluCys: 0.382 ± 0.019
3.392GluAsp: 3.392 ± 0.055
6.034GluGlu: 6.034 ± 0.08
2.344GluPhe: 2.344 ± 0.041
4.702GluGly: 4.702 ± 0.067
1.671GluHis: 1.671 ± 0.043
4.957GluIle: 4.957 ± 0.061
4.17GluLys: 4.17 ± 0.067
7.689GluLeu: 7.689 ± 0.089
2.162GluMet: 2.162 ± 0.044
2.612GluAsn: 2.612 ± 0.045
2.339GluPro: 2.339 ± 0.041
4.056GluGln: 4.056 ± 0.067
4.116GluArg: 4.116 ± 0.076
3.662GluSer: 3.662 ± 0.057
3.145GluThr: 3.145 ± 0.05
4.859GluVal: 4.859 ± 0.057
0.971GluTrp: 0.971 ± 0.024
2.332GluTyr: 2.332 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.038PheAla: 3.038 ± 0.048
0.34PheCys: 0.34 ± 0.017
2.363PheAsp: 2.363 ± 0.047
2.594PheGlu: 2.594 ± 0.052
1.792PhePhe: 1.792 ± 0.041
3.178PheGly: 3.178 ± 0.054
0.92PheHis: 0.92 ± 0.025
3.131PheIle: 3.131 ± 0.056
2.122PheLys: 2.122 ± 0.041
3.699PheLeu: 3.699 ± 0.064
1.2PheMet: 1.2 ± 0.035
1.678PheAsn: 1.678 ± 0.035
1.482PhePro: 1.482 ± 0.033
1.457PheGln: 1.457 ± 0.033
1.965PheArg: 1.965 ± 0.04
2.791PheSer: 2.791 ± 0.054
2.396PheThr: 2.396 ± 0.041
2.903PheVal: 2.903 ± 0.055
0.515PheTrp: 0.515 ± 0.02
1.481PheTyr: 1.481 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
5.363GlyAla: 5.363 ± 0.079
0.662GlyCys: 0.662 ± 0.023
3.54GlyAsp: 3.54 ± 0.056
4.826GlyGlu: 4.826 ± 0.063
3.062GlyPhe: 3.062 ± 0.05
5.394GlyGly: 5.394 ± 0.082
1.499GlyHis: 1.499 ± 0.038
5.813GlyIle: 5.813 ± 0.066
4.417GlyLys: 4.417 ± 0.06
7.012GlyLeu: 7.012 ± 0.08
2.253GlyMet: 2.253 ± 0.046
2.857GlyAsn: 2.857 ± 0.058
1.688GlyPro: 1.688 ± 0.032
2.873GlyGln: 2.873 ± 0.051
3.505GlyArg: 3.505 ± 0.055
4.617GlySer: 4.617 ± 0.064
4.159GlyThr: 4.159 ± 0.065
5.181GlyVal: 5.181 ± 0.061
0.937GlyTrp: 0.937 ± 0.03
3.007GlyTyr: 3.007 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.441HisAla: 1.441 ± 0.035
0.214HisCys: 0.214 ± 0.013
1.026HisAsp: 1.026 ± 0.027
1.362HisGlu: 1.362 ± 0.034
1.041HisPhe: 1.041 ± 0.027
1.566HisGly: 1.566 ± 0.036
0.634HisHis: 0.634 ± 0.024
1.517HisIle: 1.517 ± 0.037
0.936HisLys: 0.936 ± 0.027
2.127HisLeu: 2.127 ± 0.057
0.535HisMet: 0.535 ± 0.022
0.759HisAsn: 0.759 ± 0.022
1.225HisPro: 1.225 ± 0.033
0.852HisGln: 0.852 ± 0.026
1.164HisArg: 1.164 ± 0.031
1.218HisSer: 1.218 ± 0.034
1.025HisThr: 1.025 ± 0.026
1.384HisVal: 1.384 ± 0.033
0.303HisTrp: 0.303 ± 0.016
0.903HisTyr: 0.903 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.824IleAla: 5.824 ± 0.075
0.634IleCys: 0.634 ± 0.022
3.802IleAsp: 3.802 ± 0.052
4.743IleGlu: 4.743 ± 0.062
2.615IlePhe: 2.615 ± 0.047
5.32IleGly: 5.32 ± 0.076
1.58IleHis: 1.58 ± 0.037
4.751IleIle: 4.751 ± 0.077
3.331IleLys: 3.331 ± 0.06
6.187IleLeu: 6.187 ± 0.085
1.777IleMet: 1.777 ± 0.033
2.636IleAsn: 2.636 ± 0.05
3.275IlePro: 3.275 ± 0.049
2.77IleGln: 2.77 ± 0.048
3.688IleArg: 3.688 ± 0.061
4.887IleSer: 4.887 ± 0.06
4.051IleThr: 4.051 ± 0.056
5.314IleVal: 5.314 ± 0.066
0.765IleTrp: 0.765 ± 0.025
2.416IleTyr: 2.416 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.082LysAla: 4.082 ± 0.054
0.276LysCys: 0.276 ± 0.015
3.023LysAsp: 3.023 ± 0.048
4.579LysGlu: 4.579 ± 0.067
1.717LysPhe: 1.717 ± 0.035
3.631LysGly: 3.631 ± 0.062
1.132LysHis: 1.132 ± 0.03
3.466LysIle: 3.466 ± 0.059
3.366LysLys: 3.366 ± 0.062
5.666LysLeu: 5.666 ± 0.064
1.648LysMet: 1.648 ± 0.037
2.197LysAsn: 2.197 ± 0.047
2.059LysPro: 2.059 ± 0.037
2.685LysGln: 2.685 ± 0.05
2.779LysArg: 2.779 ± 0.047
3.061LysSer: 3.061 ± 0.055
2.758LysThr: 2.758 ± 0.05
3.808LysVal: 3.808 ± 0.069
0.741LysTrp: 0.741 ± 0.025
2.045LysTyr: 2.045 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
7.945LeuAla: 7.945 ± 0.087
0.835LeuCys: 0.835 ± 0.029
5.187LeuAsp: 5.187 ± 0.068
6.789LeuGlu: 6.789 ± 0.081
4.451LeuPhe: 4.451 ± 0.076
6.769LeuGly: 6.769 ± 0.084
2.152LeuHis: 2.152 ± 0.048
6.666LeuIle: 6.666 ± 0.07
5.367LeuLys: 5.367 ± 0.072
10.775LeuLeu: 10.775 ± 0.126
2.582LeuMet: 2.582 ± 0.038
4.122LeuAsn: 4.122 ± 0.055
4.377LeuPro: 4.377 ± 0.062
4.299LeuGln: 4.299 ± 0.069
5.057LeuArg: 5.057 ± 0.063
7.131LeuSer: 7.131 ± 0.077
5.559LeuThr: 5.559 ± 0.072
6.266LeuVal: 6.266 ± 0.077
0.989LeuTrp: 0.989 ± 0.03
3.222LeuTyr: 3.222 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.156MetAla: 2.156 ± 0.038
0.151MetCys: 0.151 ± 0.008
1.571MetAsp: 1.571 ± 0.034
2.008MetGlu: 2.008 ± 0.039
1.062MetPhe: 1.062 ± 0.031
1.708MetGly: 1.708 ± 0.04
0.46MetHis: 0.46 ± 0.018
2.103MetIle: 2.103 ± 0.042
2.029MetLys: 2.029 ± 0.041
2.93MetLeu: 2.93 ± 0.046
0.871MetMet: 0.871 ± 0.029
1.566MetAsn: 1.566 ± 0.032
1.102MetPro: 1.102 ± 0.033
1.073MetGln: 1.073 ± 0.03
1.287MetArg: 1.287 ± 0.031
1.806MetSer: 1.806 ± 0.039
1.689MetThr: 1.689 ± 0.035
1.746MetVal: 1.746 ± 0.038
0.236MetTrp: 0.236 ± 0.014
0.783MetTyr: 0.783 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.785AsnAla: 2.785 ± 0.048
0.258AsnCys: 0.258 ± 0.014
2.054AsnAsp: 2.054 ± 0.042
2.936AsnGlu: 2.936 ± 0.047
1.394AsnPhe: 1.394 ± 0.033
3.187AsnGly: 3.187 ± 0.06
0.85AsnHis: 0.85 ± 0.026
2.653AsnIle: 2.653 ± 0.05
2.256AsnLys: 2.256 ± 0.053
3.547AsnLeu: 3.547 ± 0.056
1.155AsnMet: 1.155 ± 0.029
1.848AsnAsn: 1.848 ± 0.049
2.034AsnPro: 2.034 ± 0.033
1.658AsnGln: 1.658 ± 0.032
2.066AsnArg: 2.066 ± 0.044
2.434AsnSer: 2.434 ± 0.047
2.046AsnThr: 2.046 ± 0.04
2.571AsnVal: 2.571 ± 0.044
0.586AsnTrp: 0.586 ± 0.021
1.668AsnTyr: 1.668 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
2.893ProAla: 2.893 ± 0.049
0.26ProCys: 0.26 ± 0.015
2.395ProAsp: 2.395 ± 0.043
3.427ProGlu: 3.427 ± 0.054
1.807ProPhe: 1.807 ± 0.04
2.944ProGly: 2.944 ± 0.052
0.884ProHis: 0.884 ± 0.027
2.501ProIle: 2.501 ± 0.045
1.825ProLys: 1.825 ± 0.035
3.856ProLeu: 3.856 ± 0.061
0.912ProMet: 0.912 ± 0.026
1.596ProAsn: 1.596 ± 0.034
1.211ProPro: 1.211 ± 0.035
1.404ProGln: 1.404 ± 0.028
1.406ProArg: 1.406 ± 0.033
2.522ProSer: 2.522 ± 0.045
1.978ProThr: 1.978 ± 0.107
2.947ProVal: 2.947 ± 0.054
0.471ProTrp: 0.471 ± 0.019
1.459ProTyr: 1.459 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.264GlnAla: 3.264 ± 0.051
0.218GlnCys: 0.218 ± 0.012
1.889GlnAsp: 1.889 ± 0.04
2.939GlnGlu: 2.939 ± 0.049
1.615GlnPhe: 1.615 ± 0.035
2.847GlnGly: 2.847 ± 0.052
0.866GlnHis: 0.866 ± 0.025
2.758GlnIle: 2.758 ± 0.046
2.061GlnLys: 2.061 ± 0.043
4.388GlnLeu: 4.388 ± 0.059
1.221GlnMet: 1.221 ± 0.032
1.499GlnAsn: 1.499 ± 0.035
1.508GlnPro: 1.508 ± 0.032
2.117GlnGln: 2.117 ± 0.057
2.181GlnArg: 2.181 ± 0.043
2.44GlnSer: 2.44 ± 0.049
1.896GlnThr: 1.896 ± 0.035
2.67GlnVal: 2.67 ± 0.043
0.469GlnTrp: 0.469 ± 0.02
1.491GlnTyr: 1.491 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
3.25ArgAla: 3.25 ± 0.061
0.357ArgCys: 0.357 ± 0.018
2.437ArgAsp: 2.437 ± 0.046
3.852ArgGlu: 3.852 ± 0.063
2.11ArgPhe: 2.11 ± 0.041
3.058ArgGly: 3.058 ± 0.057
1.076ArgHis: 1.076 ± 0.033
3.736ArgIle: 3.736 ± 0.053
3.146ArgLys: 3.146 ± 0.052
5.238ArgLeu: 5.238 ± 0.07
1.533ArgMet: 1.533 ± 0.032
2.113ArgAsn: 2.113 ± 0.037
1.654ArgPro: 1.654 ± 0.037
2.18ArgGln: 2.18 ± 0.045
2.839ArgArg: 2.839 ± 0.051
3.193ArgSer: 3.193 ± 0.051
2.587ArgThr: 2.587 ± 0.043
3.144ArgVal: 3.144 ± 0.053
0.642ArgTrp: 0.642 ± 0.023
1.896ArgTyr: 1.896 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.624SerAla: 4.624 ± 0.066
0.431SerCys: 0.431 ± 0.019
3.117SerAsp: 3.117 ± 0.046
4.047SerGlu: 4.047 ± 0.057
3.107SerPhe: 3.107 ± 0.053
5.365SerGly: 5.365 ± 0.067
1.257SerHis: 1.257 ± 0.033
4.573SerIle: 4.573 ± 0.064
3.399SerLys: 3.399 ± 0.052
6.607SerLeu: 6.607 ± 0.078
1.811SerMet: 1.811 ± 0.035
2.438SerAsn: 2.438 ± 0.05
2.445SerPro: 2.445 ± 0.043
2.102SerGln: 2.102 ± 0.046
3.215SerArg: 3.215 ± 0.056
4.607SerSer: 4.607 ± 0.079
3.221SerThr: 3.221 ± 0.051
4.292SerVal: 4.292 ± 0.065
0.834SerTrp: 0.834 ± 0.027
2.397SerTyr: 2.397 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.293ThrAla: 4.293 ± 0.052
0.336ThrCys: 0.336 ± 0.016
2.624ThrAsp: 2.624 ± 0.041
3.422ThrGlu: 3.422 ± 0.057
2.25ThrPhe: 2.25 ± 0.042
4.408ThrGly: 4.408 ± 0.061
1.013ThrHis: 1.013 ± 0.027
3.71ThrIle: 3.71 ± 0.056
2.541ThrLys: 2.541 ± 0.041
5.329ThrLeu: 5.329 ± 0.066
1.392ThrMet: 1.392 ± 0.03
1.934ThrAsn: 1.934 ± 0.041
2.436ThrPro: 2.436 ± 0.101
1.592ThrGln: 1.592 ± 0.035
2.203ThrArg: 2.203 ± 0.039
3.396ThrSer: 3.396 ± 0.048
2.77ThrThr: 2.77 ± 0.055
4.223ThrVal: 4.223 ± 0.059
0.624ThrTrp: 0.624 ± 0.021
1.892ThrTyr: 1.892 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
4.97ValAla: 4.97 ± 0.067
0.58ValCys: 0.58 ± 0.022
3.706ValAsp: 3.706 ± 0.058
4.578ValGlu: 4.578 ± 0.062
2.883ValPhe: 2.883 ± 0.049
4.402ValGly: 4.402 ± 0.062
1.453ValHis: 1.453 ± 0.034
5.117ValIle: 5.117 ± 0.066
3.962ValLys: 3.962 ± 0.061
6.99ValLeu: 6.99 ± 0.057
1.968ValMet: 1.968 ± 0.037
3.013ValAsn: 3.013 ± 0.047
2.822ValPro: 2.822 ± 0.048
2.634ValGln: 2.634 ± 0.036
3.259ValArg: 3.259 ± 0.049
4.652ValSer: 4.652 ± 0.064
4.117ValThr: 4.117 ± 0.051
4.91ValVal: 4.91 ± 0.062
0.799ValTrp: 0.799 ± 0.023
2.382ValTyr: 2.382 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.75TrpAla: 0.75 ± 0.025
0.103TrpCys: 0.103 ± 0.009
0.648TrpAsp: 0.648 ± 0.022
0.822TrpGlu: 0.822 ± 0.024
0.582TrpPhe: 0.582 ± 0.021
0.841TrpGly: 0.841 ± 0.028
0.239TrpHis: 0.239 ± 0.016
0.899TrpIle: 0.899 ± 0.026
0.698TrpLys: 0.698 ± 0.026
1.39TrpLeu: 1.39 ± 0.037
0.428TrpMet: 0.428 ± 0.017
0.662TrpAsn: 0.662 ± 0.022
0.334TrpPro: 0.334 ± 0.015
0.51TrpGln: 0.51 ± 0.021
0.622TrpArg: 0.622 ± 0.021
0.863TrpSer: 0.863 ± 0.027
0.605TrpThr: 0.605 ± 0.021
0.725TrpVal: 0.725 ± 0.024
0.166TrpTrp: 0.166 ± 0.011
0.419TrpTyr: 0.419 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.582TyrAla: 2.582 ± 0.044
0.306TyrCys: 0.306 ± 0.015
1.981TyrAsp: 1.981 ± 0.04
2.524TyrGlu: 2.524 ± 0.046
1.678TyrPhe: 1.678 ± 0.035
2.639TyrGly: 2.639 ± 0.045
0.791TyrHis: 0.791 ± 0.028
2.315TyrIle: 2.315 ± 0.04
1.806TyrLys: 1.806 ± 0.036
3.488TyrLeu: 3.488 ± 0.054
0.954TyrMet: 0.954 ± 0.027
1.55TyrAsn: 1.55 ± 0.039
1.579TyrPro: 1.579 ± 0.039
1.329TyrGln: 1.329 ± 0.033
2.155TyrArg: 2.155 ± 0.047
2.334TyrSer: 2.334 ± 0.045
2.008TyrThr: 2.008 ± 0.046
2.389TyrVal: 2.389 ± 0.041
0.502TyrTrp: 0.502 ± 0.021
1.482TyrTyr: 1.482 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4362 proteins (1430031 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski