Amino acid dipepetide frequency for Paenisporosarcina sp. HGH0030

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.439AlaAla: 5.439 ± 0.089
0.509AlaCys: 0.509 ± 0.025
3.592AlaAsp: 3.592 ± 0.066
4.648AlaGlu: 4.648 ± 0.072
3.425AlaPhe: 3.425 ± 0.065
5.209AlaGly: 5.209 ± 0.091
1.381AlaHis: 1.381 ± 0.043
5.925AlaIle: 5.925 ± 0.09
4.898AlaLys: 4.898 ± 0.076
7.038AlaLeu: 7.038 ± 0.087
2.173AlaMet: 2.173 ± 0.046
3.04AlaAsn: 3.04 ± 0.06
2.214AlaPro: 2.214 ± 0.049
2.388AlaGln: 2.388 ± 0.056
2.661AlaArg: 2.661 ± 0.058
4.339AlaSer: 4.339 ± 0.064
4.006AlaThr: 4.006 ± 0.072
5.318AlaVal: 5.318 ± 0.083
0.657AlaTrp: 0.657 ± 0.028
2.389AlaTyr: 2.389 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.394CysAla: 0.394 ± 0.021
0.091CysCys: 0.091 ± 0.01
0.329CysAsp: 0.329 ± 0.018
0.433CysGlu: 0.433 ± 0.022
0.277CysPhe: 0.277 ± 0.016
0.574CysGly: 0.574 ± 0.027
0.167CysHis: 0.167 ± 0.013
0.464CysIle: 0.464 ± 0.021
0.292CysLys: 0.292 ± 0.019
0.582CysLeu: 0.582 ± 0.026
0.165CysMet: 0.165 ± 0.013
0.229CysAsn: 0.229 ± 0.014
0.316CysPro: 0.316 ± 0.022
0.198CysGln: 0.198 ± 0.014
0.235CysArg: 0.235 ± 0.017
0.451CysSer: 0.451 ± 0.022
0.351CysThr: 0.351 ± 0.02
0.356CysVal: 0.356 ± 0.018
0.058CysTrp: 0.058 ± 0.008
0.228CysTyr: 0.228 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.803AspAla: 3.803 ± 0.07
0.326AspCys: 0.326 ± 0.018
2.623AspAsp: 2.623 ± 0.056
4.531AspGlu: 4.531 ± 0.075
2.601AspPhe: 2.601 ± 0.051
3.644AspGly: 3.644 ± 0.077
1.149AspHis: 1.149 ± 0.039
3.897AspIle: 3.897 ± 0.06
2.964AspLys: 2.964 ± 0.06
5.078AspLeu: 5.078 ± 0.071
1.499AspMet: 1.499 ± 0.04
1.898AspAsn: 1.898 ± 0.05
1.877AspPro: 1.877 ± 0.048
1.983AspGln: 1.983 ± 0.046
2.157AspArg: 2.157 ± 0.049
2.904AspSer: 2.904 ± 0.055
2.549AspThr: 2.549 ± 0.047
4.227AspVal: 4.227 ± 0.068
0.629AspTrp: 0.629 ± 0.025
2.038AspTyr: 2.038 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.22GluAla: 5.22 ± 0.09
0.324GluCys: 0.324 ± 0.02
3.852GluAsp: 3.852 ± 0.067
6.318GluGlu: 6.318 ± 0.114
2.752GluPhe: 2.752 ± 0.051
4.318GluGly: 4.318 ± 0.066
1.498GluHis: 1.498 ± 0.033
5.273GluIle: 5.273 ± 0.085
5.812GluLys: 5.812 ± 0.094
6.947GluLeu: 6.947 ± 0.084
2.339GluMet: 2.339 ± 0.049
3.498GluAsn: 3.498 ± 0.059
2.035GluPro: 2.035 ± 0.045
3.453GluGln: 3.453 ± 0.071
3.44GluArg: 3.44 ± 0.079
3.966GluSer: 3.966 ± 0.072
4.004GluThr: 4.004 ± 0.072
5.266GluVal: 5.266 ± 0.081
0.955GluTrp: 0.955 ± 0.035
2.094GluTyr: 2.094 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.227PheAla: 3.227 ± 0.065
0.261PheCys: 0.261 ± 0.016
2.569PheAsp: 2.569 ± 0.05
3.054PheGlu: 3.054 ± 0.059
2.261PhePhe: 2.261 ± 0.055
3.497PheGly: 3.497 ± 0.065
0.99PheHis: 0.99 ± 0.032
3.739PheIle: 3.739 ± 0.081
2.371PheLys: 2.371 ± 0.049
4.38PheLeu: 4.38 ± 0.087
1.272PheMet: 1.272 ± 0.036
2.016PheAsn: 2.016 ± 0.046
1.753PhePro: 1.753 ± 0.045
1.568PheGln: 1.568 ± 0.044
1.498PheArg: 1.498 ± 0.043
3.29PheSer: 3.29 ± 0.054
2.822PheThr: 2.822 ± 0.048
3.438PheVal: 3.438 ± 0.067
0.495PheTrp: 0.495 ± 0.023
1.742PheTyr: 1.742 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.771GlyAla: 4.771 ± 0.084
0.532GlyCys: 0.532 ± 0.027
3.378GlyAsp: 3.378 ± 0.07
4.447GlyGlu: 4.447 ± 0.074
3.428GlyPhe: 3.428 ± 0.067
4.542GlyGly: 4.542 ± 0.092
1.443GlyHis: 1.443 ± 0.039
5.704GlyIle: 5.704 ± 0.087
4.915GlyLys: 4.915 ± 0.085
6.399GlyLeu: 6.399 ± 0.082
2.083GlyMet: 2.083 ± 0.043
2.811GlyAsn: 2.811 ± 0.057
1.828GlyPro: 1.828 ± 0.056
2.44GlyGln: 2.44 ± 0.067
2.508GlyArg: 2.508 ± 0.056
4.083GlySer: 4.083 ± 0.066
4.264GlyThr: 4.264 ± 0.083
4.872GlyVal: 4.872 ± 0.085
0.748GlyTrp: 0.748 ± 0.031
2.764GlyTyr: 2.764 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.48HisAla: 1.48 ± 0.034
0.17HisCys: 0.17 ± 0.013
1.119HisAsp: 1.119 ± 0.038
1.485HisGlu: 1.485 ± 0.045
1.02HisPhe: 1.02 ± 0.038
1.411HisGly: 1.411 ± 0.038
0.639HisHis: 0.639 ± 0.026
1.552HisIle: 1.552 ± 0.044
1.029HisLys: 1.029 ± 0.038
2.064HisLeu: 2.064 ± 0.048
0.595HisMet: 0.595 ± 0.021
0.728HisAsn: 0.728 ± 0.028
1.181HisPro: 1.181 ± 0.041
0.871HisGln: 0.871 ± 0.03
0.84HisArg: 0.84 ± 0.031
1.205HisSer: 1.205 ± 0.034
1.153HisThr: 1.153 ± 0.038
1.627HisVal: 1.627 ± 0.04
0.203HisTrp: 0.203 ± 0.014
0.795HisTyr: 0.795 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.928IleAla: 5.928 ± 0.093
0.548IleCys: 0.548 ± 0.023
4.357IleAsp: 4.357 ± 0.073
5.752IleGlu: 5.752 ± 0.087
3.194IlePhe: 3.194 ± 0.073
5.774IleGly: 5.774 ± 0.093
1.736IleHis: 1.736 ± 0.043
5.354IleIle: 5.354 ± 0.095
4.095IleLys: 4.095 ± 0.086
6.933IleLeu: 6.933 ± 0.105
1.708IleMet: 1.708 ± 0.045
3.064IleAsn: 3.064 ± 0.069
3.199IlePro: 3.199 ± 0.058
3.006IleGln: 3.006 ± 0.052
3.176IleArg: 3.176 ± 0.057
4.967IleSer: 4.967 ± 0.069
4.228IleThr: 4.228 ± 0.073
5.723IleVal: 5.723 ± 0.074
0.667IleTrp: 0.667 ± 0.023
2.259IleTyr: 2.259 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.527LysAla: 4.527 ± 0.074
0.296LysCys: 0.296 ± 0.021
3.811LysAsp: 3.811 ± 0.066
6.105LysGlu: 6.105 ± 0.096
2.067LysPhe: 2.067 ± 0.051
4.222LysGly: 4.222 ± 0.063
1.259LysHis: 1.259 ± 0.036
4.237LysIle: 4.237 ± 0.067
5.242LysLys: 5.242 ± 0.077
5.563LysLeu: 5.563 ± 0.081
2.269LysMet: 2.269 ± 0.045
3.156LysAsn: 3.156 ± 0.064
2.249LysPro: 2.249 ± 0.054
2.971LysGln: 2.971 ± 0.058
3.041LysArg: 3.041 ± 0.056
3.698LysSer: 3.698 ± 0.07
3.83LysThr: 3.83 ± 0.067
4.497LysVal: 4.497 ± 0.07
0.887LysTrp: 0.887 ± 0.033
2.011LysTyr: 2.011 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
7.253LeuAla: 7.253 ± 0.098
0.587LeuCys: 0.587 ± 0.028
4.698LeuAsp: 4.698 ± 0.071
6.202LeuGlu: 6.202 ± 0.084
4.806LeuPhe: 4.806 ± 0.089
6.005LeuGly: 6.005 ± 0.087
1.968LeuHis: 1.968 ± 0.05
7.136LeuIle: 7.136 ± 0.101
6.262LeuLys: 6.262 ± 0.093
9.917LeuLeu: 9.917 ± 0.14
2.639LeuMet: 2.639 ± 0.063
4.188LeuAsn: 4.188 ± 0.071
3.883LeuPro: 3.883 ± 0.071
3.724LeuGln: 3.724 ± 0.07
3.626LeuArg: 3.626 ± 0.058
6.684LeuSer: 6.684 ± 0.079
5.962LeuThr: 5.962 ± 0.078
6.481LeuVal: 6.481 ± 0.095
0.825LeuTrp: 0.825 ± 0.032
2.924LeuTyr: 2.924 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.069MetAla: 2.069 ± 0.047
0.142MetCys: 0.142 ± 0.01
1.625MetAsp: 1.625 ± 0.039
1.968MetGlu: 1.968 ± 0.048
1.069MetPhe: 1.069 ± 0.04
1.75MetGly: 1.75 ± 0.044
0.507MetHis: 0.507 ± 0.023
2.144MetIle: 2.144 ± 0.049
2.692MetLys: 2.692 ± 0.057
2.593MetLeu: 2.593 ± 0.049
0.958MetMet: 0.958 ± 0.033
1.698MetAsn: 1.698 ± 0.039
1.063MetPro: 1.063 ± 0.037
0.96MetGln: 0.96 ± 0.034
1.15MetArg: 1.15 ± 0.034
1.798MetSer: 1.798 ± 0.049
2.042MetThr: 2.042 ± 0.043
1.804MetVal: 1.804 ± 0.043
0.228MetTrp: 0.228 ± 0.015
0.833MetTyr: 0.833 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.879AsnAla: 2.879 ± 0.064
0.278AsnCys: 0.278 ± 0.018
2.267AsnAsp: 2.267 ± 0.053
3.668AsnGlu: 3.668 ± 0.06
1.663AsnPhe: 1.663 ± 0.046
3.547AsnGly: 3.547 ± 0.076
1.043AsnHis: 1.043 ± 0.037
2.987AsnIle: 2.987 ± 0.061
2.673AsnLys: 2.673 ± 0.064
3.62AsnLeu: 3.62 ± 0.058
1.272AsnMet: 1.272 ± 0.038
1.948AsnAsn: 1.948 ± 0.061
2.197AsnPro: 2.197 ± 0.046
1.881AsnGln: 1.881 ± 0.045
2.029AsnArg: 2.029 ± 0.049
2.345AsnSer: 2.345 ± 0.054
2.187AsnThr: 2.187 ± 0.048
3.172AsnVal: 3.172 ± 0.066
0.528AsnTrp: 0.528 ± 0.022
1.436AsnTyr: 1.436 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
2.413ProAla: 2.413 ± 0.049
0.218ProCys: 0.218 ± 0.017
1.928ProAsp: 1.928 ± 0.043
2.971ProGlu: 2.971 ± 0.061
2.056ProPhe: 2.056 ± 0.055
2.29ProGly: 2.29 ± 0.063
0.831ProHis: 0.831 ± 0.031
2.863ProIle: 2.863 ± 0.052
2.142ProLys: 2.142 ± 0.05
3.444ProLeu: 3.444 ± 0.069
0.876ProMet: 0.876 ± 0.034
1.707ProAsn: 1.707 ± 0.046
0.994ProPro: 0.994 ± 0.035
1.234ProGln: 1.234 ± 0.039
1.035ProArg: 1.035 ± 0.031
2.457ProSer: 2.457 ± 0.054
2.3ProThr: 2.3 ± 0.048
2.981ProVal: 2.981 ± 0.051
0.377ProTrp: 0.377 ± 0.019
1.428ProTyr: 1.428 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
2.841GlnAla: 2.841 ± 0.055
0.179GlnCys: 0.179 ± 0.015
1.885GlnAsp: 1.885 ± 0.045
2.666GlnGlu: 2.666 ± 0.067
1.719GlnPhe: 1.719 ± 0.042
2.231GlnGly: 2.231 ± 0.065
0.825GlnHis: 0.825 ± 0.032
2.664GlnIle: 2.664 ± 0.05
2.653GlnLys: 2.653 ± 0.055
4.129GlnLeu: 4.129 ± 0.07
1.324GlnMet: 1.324 ± 0.042
1.631GlnAsn: 1.631 ± 0.042
1.326GlnPro: 1.326 ± 0.043
1.989GlnGln: 1.989 ± 0.063
1.39GlnArg: 1.39 ± 0.04
2.39GlnSer: 2.39 ± 0.056
2.128GlnThr: 2.128 ± 0.051
2.595GlnVal: 2.595 ± 0.049
0.416GlnTrp: 0.416 ± 0.021
1.291GlnTyr: 1.291 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.559ArgAla: 2.559 ± 0.059
0.222ArgCys: 0.222 ± 0.018
2.034ArgAsp: 2.034 ± 0.049
2.981ArgGlu: 2.981 ± 0.058
1.903ArgPhe: 1.903 ± 0.05
2.351ArgGly: 2.351 ± 0.052
0.847ArgHis: 0.847 ± 0.031
3.101ArgIle: 3.101 ± 0.069
2.947ArgLys: 2.947 ± 0.063
3.982ArgLeu: 3.982 ± 0.073
1.339ArgMet: 1.339 ± 0.037
1.789ArgAsn: 1.789 ± 0.046
1.343ArgPro: 1.343 ± 0.038
1.541ArgGln: 1.541 ± 0.04
1.643ArgArg: 1.643 ± 0.049
2.167ArgSer: 2.167 ± 0.054
2.058ArgThr: 2.058 ± 0.046
2.761ArgVal: 2.761 ± 0.051
0.379ArgTrp: 0.379 ± 0.021
1.462ArgTyr: 1.462 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.032SerAla: 4.032 ± 0.068
0.342SerCys: 0.342 ± 0.02
2.993SerAsp: 2.993 ± 0.061
4.106SerGlu: 4.106 ± 0.07
3.49SerPhe: 3.49 ± 0.071
4.545SerGly: 4.545 ± 0.077
1.275SerHis: 1.275 ± 0.036
5.182SerIle: 5.182 ± 0.076
3.892SerLys: 3.892 ± 0.069
6.027SerLeu: 6.027 ± 0.088
1.801SerMet: 1.801 ± 0.045
2.69SerAsn: 2.69 ± 0.05
2.324SerPro: 2.324 ± 0.048
2.07SerGln: 2.07 ± 0.048
2.301SerArg: 2.301 ± 0.043
4.162SerSer: 4.162 ± 0.078
3.543SerThr: 3.543 ± 0.068
4.488SerVal: 4.488 ± 0.062
0.641SerTrp: 0.641 ± 0.026
2.337SerTyr: 2.337 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
4.201ThrAla: 4.201 ± 0.07
0.33ThrCys: 0.33 ± 0.017
3.033ThrAsp: 3.033 ± 0.06
3.675ThrGlu: 3.675 ± 0.067
2.953ThrPhe: 2.953 ± 0.052
4.307ThrGly: 4.307 ± 0.07
1.227ThrHis: 1.227 ± 0.033
4.722ThrIle: 4.722 ± 0.071
3.54ThrLys: 3.54 ± 0.06
5.599ThrLeu: 5.599 ± 0.077
1.485ThrMet: 1.485 ± 0.036
2.477ThrAsn: 2.477 ± 0.054
2.334ThrPro: 2.334 ± 0.054
1.696ThrGln: 1.696 ± 0.036
1.907ThrArg: 1.907 ± 0.05
3.687ThrSer: 3.687 ± 0.06
3.348ThrThr: 3.348 ± 0.075
4.547ThrVal: 4.547 ± 0.081
0.585ThrTrp: 0.585 ± 0.023
2.124ThrTyr: 2.124 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.291ValAla: 5.291 ± 0.081
0.499ValCys: 0.499 ± 0.022
3.765ValAsp: 3.765 ± 0.072
5.125ValGlu: 5.125 ± 0.078
3.244ValPhe: 3.244 ± 0.062
4.711ValGly: 4.711 ± 0.076
1.416ValHis: 1.416 ± 0.039
5.548ValIle: 5.548 ± 0.087
4.776ValLys: 4.776 ± 0.077
7.036ValLeu: 7.036 ± 0.085
2.034ValMet: 2.034 ± 0.054
3.193ValAsn: 3.193 ± 0.069
2.754ValPro: 2.754 ± 0.055
2.523ValGln: 2.523 ± 0.047
2.829ValArg: 2.829 ± 0.058
4.849ValSer: 4.849 ± 0.079
4.504ValThr: 4.504 ± 0.088
5.418ValVal: 5.418 ± 0.085
0.71ValTrp: 0.71 ± 0.029
2.272ValTyr: 2.272 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.619TrpAla: 0.619 ± 0.026
0.078TrpCys: 0.078 ± 0.01
0.518TrpAsp: 0.518 ± 0.025
0.647TrpGlu: 0.647 ± 0.028
0.549TrpPhe: 0.549 ± 0.027
0.646TrpGly: 0.646 ± 0.028
0.223TrpHis: 0.223 ± 0.015
0.859TrpIle: 0.859 ± 0.032
0.73TrpLys: 0.73 ± 0.028
1.184TrpLeu: 1.184 ± 0.041
0.362TrpMet: 0.362 ± 0.022
0.549TrpAsn: 0.549 ± 0.026
0.28TrpPro: 0.28 ± 0.02
0.416TrpGln: 0.416 ± 0.022
0.436TrpArg: 0.436 ± 0.021
0.645TrpSer: 0.645 ± 0.03
0.65TrpThr: 0.65 ± 0.03
0.654TrpVal: 0.654 ± 0.028
0.149TrpTrp: 0.149 ± 0.014
0.348TrpTyr: 0.348 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.299TyrAla: 2.299 ± 0.056
0.258TyrCys: 0.258 ± 0.017
2.003TyrAsp: 2.003 ± 0.046
2.582TyrGlu: 2.582 ± 0.054
1.768TyrPhe: 1.768 ± 0.047
2.385TyrGly: 2.385 ± 0.048
0.742TyrHis: 0.742 ± 0.03
2.319TyrIle: 2.319 ± 0.049
2.014TyrLys: 2.014 ± 0.047
3.252TyrLeu: 3.252 ± 0.061
0.902TyrMet: 0.902 ± 0.03
1.31TyrAsn: 1.31 ± 0.038
1.394TyrPro: 1.394 ± 0.034
1.359TyrGln: 1.359 ± 0.04
1.508TyrArg: 1.508 ± 0.045
2.156TyrSer: 2.156 ± 0.057
1.847TyrThr: 1.847 ± 0.045
2.274TyrVal: 2.274 ± 0.047
0.402TyrTrp: 0.402 ± 0.02
1.294TyrTyr: 1.294 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3445 proteins (977063 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski