Amino acid dipepetide frequency for Peptacetobacter hominis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.772AlaAla: 4.772 ± 0.121
0.848AlaCys: 0.848 ± 0.042
3.645AlaAsp: 3.645 ± 0.094
4.74AlaGlu: 4.74 ± 0.107
2.35AlaPhe: 2.35 ± 0.079
4.78AlaGly: 4.78 ± 0.123
0.865AlaHis: 0.865 ± 0.035
5.573AlaIle: 5.573 ± 0.106
4.511AlaLys: 4.511 ± 0.107
5.311AlaLeu: 5.311 ± 0.113
2.092AlaMet: 2.092 ± 0.054
2.334AlaAsn: 2.334 ± 0.064
1.491AlaPro: 1.491 ± 0.057
1.45AlaGln: 1.45 ± 0.054
2.086AlaArg: 2.086 ± 0.062
3.716AlaSer: 3.716 ± 0.088
2.837AlaThr: 2.837 ± 0.078
5.419AlaVal: 5.419 ± 0.13
0.292AlaTrp: 0.292 ± 0.023
2.128AlaTyr: 2.128 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.769CysAla: 0.769 ± 0.038
0.238CysCys: 0.238 ± 0.023
0.845CysAsp: 0.845 ± 0.039
0.933CysGlu: 0.933 ± 0.04
0.527CysPhe: 0.527 ± 0.029
1.285CysGly: 1.285 ± 0.049
0.248CysHis: 0.248 ± 0.022
1.275CysIle: 1.275 ± 0.048
0.832CysLys: 0.832 ± 0.037
0.792CysLeu: 0.792 ± 0.039
0.407CysMet: 0.407 ± 0.027
0.535CysAsn: 0.535 ± 0.029
0.573CysPro: 0.573 ± 0.032
0.237CysGln: 0.237 ± 0.02
0.59CysArg: 0.59 ± 0.03
0.936CysSer: 0.936 ± 0.043
0.642CysThr: 0.642 ± 0.029
0.863CysVal: 0.863 ± 0.04
0.084CysTrp: 0.084 ± 0.012
0.399CysTyr: 0.399 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
3.535AspAla: 3.535 ± 0.085
0.699AspCys: 0.699 ± 0.035
3.447AspAsp: 3.447 ± 0.077
5.577AspGlu: 5.577 ± 0.12
2.856AspPhe: 2.856 ± 0.077
4.112AspGly: 4.112 ± 0.102
0.569AspHis: 0.569 ± 0.034
7.243AspIle: 7.243 ± 0.124
4.962AspLys: 4.962 ± 0.102
3.951AspLeu: 3.951 ± 0.086
2.096AspMet: 2.096 ± 0.064
2.895AspAsn: 2.895 ± 0.077
1.403AspPro: 1.403 ± 0.053
0.741AspGln: 0.741 ± 0.039
2.676AspArg: 2.676 ± 0.07
3.63AspSer: 3.63 ± 0.08
2.98AspThr: 2.98 ± 0.064
4.096AspVal: 4.096 ± 0.096
0.333AspTrp: 0.333 ± 0.027
2.766AspTyr: 2.766 ± 0.073
0.0AspXaa: 0.0 ± 0.0
Glu
4.409GluAla: 4.409 ± 0.112
0.96GluCys: 0.96 ± 0.041
4.717GluAsp: 4.717 ± 0.109
7.205GluGlu: 7.205 ± 0.128
3.277GluPhe: 3.277 ± 0.081
4.375GluGly: 4.375 ± 0.103
0.972GluHis: 0.972 ± 0.043
7.859GluIle: 7.859 ± 0.152
9.392GluLys: 9.392 ± 0.145
6.518GluLeu: 6.518 ± 0.108
2.552GluMet: 2.552 ± 0.07
5.914GluAsn: 5.914 ± 0.108
1.491GluPro: 1.491 ± 0.054
1.583GluGln: 1.583 ± 0.058
3.16GluArg: 3.16 ± 0.084
4.363GluSer: 4.363 ± 0.091
3.432GluThr: 3.432 ± 0.082
4.736GluVal: 4.736 ± 0.098
0.433GluTrp: 0.433 ± 0.031
3.75GluTyr: 3.75 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
2.602PheAla: 2.602 ± 0.076
0.592PheCys: 0.592 ± 0.035
2.881PheAsp: 2.881 ± 0.075
3.215PheGlu: 3.215 ± 0.08
1.848PhePhe: 1.848 ± 0.067
2.975PheGly: 2.975 ± 0.075
0.475PheHis: 0.475 ± 0.026
4.436PheIle: 4.436 ± 0.123
2.819PheLys: 2.819 ± 0.064
3.158PheLeu: 3.158 ± 0.096
1.439PheMet: 1.439 ± 0.056
2.091PheAsn: 2.091 ± 0.064
1.093PhePro: 1.093 ± 0.044
0.649PheGln: 0.649 ± 0.035
1.664PheArg: 1.664 ± 0.056
3.223PheSer: 3.223 ± 0.074
2.165PheThr: 2.165 ± 0.073
3.041PheVal: 3.041 ± 0.087
0.24PheTrp: 0.24 ± 0.02
1.489PheTyr: 1.489 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.396GlyAla: 4.396 ± 0.107
1.207GlyCys: 1.207 ± 0.045
3.437GlyAsp: 3.437 ± 0.084
4.702GlyGlu: 4.702 ± 0.085
3.2GlyPhe: 3.2 ± 0.081
4.712GlyGly: 4.712 ± 0.143
1.072GlyHis: 1.072 ± 0.053
7.343GlyIle: 7.343 ± 0.119
5.956GlyLys: 5.956 ± 0.1
5.179GlyLeu: 5.179 ± 0.094
2.384GlyMet: 2.384 ± 0.063
3.535GlyAsn: 3.535 ± 0.089
1.231GlyPro: 1.231 ± 0.045
1.348GlyGln: 1.348 ± 0.042
2.513GlyArg: 2.513 ± 0.071
3.814GlySer: 3.814 ± 0.097
3.507GlyThr: 3.507 ± 0.088
4.819GlyVal: 4.819 ± 0.096
0.457GlyTrp: 0.457 ± 0.03
3.304GlyTyr: 3.304 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.038
0.177HisCys: 0.177 ± 0.019
0.727HisAsp: 0.727 ± 0.035
0.98HisGlu: 0.98 ± 0.044
0.55HisPhe: 0.55 ± 0.031
1.017HisGly: 1.017 ± 0.037
0.287HisHis: 0.287 ± 0.026
1.343HisIle: 1.343 ± 0.046
0.967HisLys: 0.967 ± 0.043
1.024HisLeu: 1.024 ± 0.043
0.449HisMet: 0.449 ± 0.025
0.689HisAsn: 0.689 ± 0.034
0.589HisPro: 0.589 ± 0.033
0.305HisGln: 0.305 ± 0.021
0.586HisArg: 0.586 ± 0.03
0.861HisSer: 0.861 ± 0.039
0.697HisThr: 0.697 ± 0.032
0.71HisVal: 0.71 ± 0.037
0.089HisTrp: 0.089 ± 0.012
0.488HisTyr: 0.488 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.175IleAla: 6.175 ± 0.106
1.35IleCys: 1.35 ± 0.046
6.399IleAsp: 6.399 ± 0.118
7.852IleGlu: 7.852 ± 0.136
4.029IlePhe: 4.029 ± 0.123
6.227IleGly: 6.227 ± 0.101
1.161IleHis: 1.161 ± 0.044
8.858IleIle: 8.858 ± 0.163
8.487IleLys: 8.487 ± 0.133
7.763IleLeu: 7.763 ± 0.154
2.811IleMet: 2.811 ± 0.075
5.294IleAsn: 5.294 ± 0.126
3.379IlePro: 3.379 ± 0.085
1.791IleGln: 1.791 ± 0.054
3.393IleArg: 3.393 ± 0.087
7.419IleSer: 7.419 ± 0.143
4.597IleThr: 4.597 ± 0.086
6.462IleVal: 6.462 ± 0.11
0.459IleTrp: 0.459 ± 0.032
3.653IleTyr: 3.653 ± 0.089
0.0IleXaa: 0.0 ± 0.0
Lys
4.848LysAla: 4.848 ± 0.096
0.835LysCys: 0.835 ± 0.043
5.064LysAsp: 5.064 ± 0.099
8.08LysGlu: 8.08 ± 0.143
2.931LysPhe: 2.931 ± 0.073
4.884LysGly: 4.884 ± 0.082
1.064LysHis: 1.064 ± 0.043
7.582LysIle: 7.582 ± 0.152
7.893LysLys: 7.893 ± 0.138
6.26LysLeu: 6.26 ± 0.099
2.826LysMet: 2.826 ± 0.071
5.827LysAsn: 5.827 ± 0.1
2.054LysPro: 2.054 ± 0.056
1.672LysGln: 1.672 ± 0.061
3.723LysArg: 3.723 ± 0.088
5.299LysSer: 5.299 ± 0.117
3.953LysThr: 3.953 ± 0.08
5.384LysVal: 5.384 ± 0.109
0.504LysTrp: 0.504 ± 0.028
4.058LysTyr: 4.058 ± 0.093
0.0LysXaa: 0.0 ± 0.0
Leu
4.771LeuAla: 4.771 ± 0.101
1.035LeuCys: 1.035 ± 0.042
4.975LeuAsp: 4.975 ± 0.102
6.252LeuGlu: 6.252 ± 0.127
3.223LeuPhe: 3.223 ± 0.107
5.369LeuGly: 5.369 ± 0.113
0.972LeuHis: 0.972 ± 0.035
6.568LeuIle: 6.568 ± 0.132
7.111LeuLys: 7.111 ± 0.123
6.107LeuLeu: 6.107 ± 0.119
2.368LeuMet: 2.368 ± 0.053
4.201LeuAsn: 4.201 ± 0.085
2.457LeuPro: 2.457 ± 0.064
1.486LeuGln: 1.486 ± 0.047
2.882LeuArg: 2.882 ± 0.079
6.117LeuSer: 6.117 ± 0.111
3.672LeuThr: 3.672 ± 0.08
5.015LeuVal: 5.015 ± 0.092
0.409LeuTrp: 0.409 ± 0.029
2.85LeuTyr: 2.85 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
2.135MetAla: 2.135 ± 0.072
0.435MetCys: 0.435 ± 0.028
2.007MetAsp: 2.007 ± 0.055
2.292MetGlu: 2.292 ± 0.06
1.254MetPhe: 1.254 ± 0.052
2.167MetGly: 2.167 ± 0.065
0.373MetHis: 0.373 ± 0.022
2.839MetIle: 2.839 ± 0.067
3.16MetLys: 3.16 ± 0.068
2.466MetLeu: 2.466 ± 0.064
1.049MetMet: 1.049 ± 0.043
2.05MetAsn: 2.05 ± 0.054
0.989MetPro: 0.989 ± 0.041
0.702MetGln: 0.702 ± 0.033
1.161MetArg: 1.161 ± 0.04
2.135MetSer: 2.135 ± 0.056
1.556MetThr: 1.556 ± 0.052
1.846MetVal: 1.846 ± 0.065
0.165MetTrp: 0.165 ± 0.018
1.202MetTyr: 1.202 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
3.014AsnAla: 3.014 ± 0.082
0.663AsnCys: 0.663 ± 0.037
2.727AsnAsp: 2.727 ± 0.068
3.81AsnGlu: 3.81 ± 0.078
2.104AsnPhe: 2.104 ± 0.069
3.668AsnGly: 3.668 ± 0.089
0.693AsnHis: 0.693 ± 0.035
6.707AsnIle: 6.707 ± 0.132
4.054AsnLys: 4.054 ± 0.097
4.328AsnLeu: 4.328 ± 0.092
1.859AsnMet: 1.859 ± 0.05
2.9AsnAsn: 2.9 ± 0.099
2.141AsnPro: 2.141 ± 0.059
1.041AsnGln: 1.041 ± 0.04
2.527AsnArg: 2.527 ± 0.063
3.285AsnSer: 3.285 ± 0.088
2.821AsnThr: 2.821 ± 0.074
3.298AsnVal: 3.298 ± 0.077
0.326AsnTrp: 0.326 ± 0.025
2.078AsnTyr: 2.078 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.645ProAla: 1.645 ± 0.056
0.321ProCys: 0.321 ± 0.022
1.893ProAsp: 1.893 ± 0.057
2.826ProGlu: 2.826 ± 0.074
1.343ProPhe: 1.343 ± 0.048
2.075ProGly: 2.075 ± 0.061
0.493ProHis: 0.493 ± 0.031
2.404ProIle: 2.404 ± 0.076
1.896ProLys: 1.896 ± 0.066
2.029ProLeu: 2.029 ± 0.064
0.78ProMet: 0.78 ± 0.037
1.157ProAsn: 1.157 ± 0.053
0.526ProPro: 0.526 ± 0.034
0.683ProGln: 0.683 ± 0.03
0.835ProArg: 0.835 ± 0.037
1.778ProSer: 1.778 ± 0.052
1.309ProThr: 1.309 ± 0.051
2.444ProVal: 2.444 ± 0.066
0.193ProTrp: 0.193 ± 0.018
1.139ProTyr: 1.139 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
1.299GlnAla: 1.299 ± 0.051
0.203GlnCys: 0.203 ± 0.016
0.928GlnAsp: 0.928 ± 0.038
1.411GlnGlu: 1.411 ± 0.046
0.808GlnPhe: 0.808 ± 0.036
1.268GlnGly: 1.268 ± 0.051
0.256GlnHis: 0.256 ± 0.02
1.934GlnIle: 1.934 ± 0.053
1.888GlnLys: 1.888 ± 0.057
1.682GlnLeu: 1.682 ± 0.06
0.727GlnMet: 0.727 ± 0.035
1.173GlnAsn: 1.173 ± 0.051
0.522GlnPro: 0.522 ± 0.03
0.475GlnGln: 0.475 ± 0.03
0.925GlnArg: 0.925 ± 0.039
1.142GlnSer: 1.142 ± 0.044
0.946GlnThr: 0.946 ± 0.044
1.213GlnVal: 1.213 ± 0.046
0.139GlnTrp: 0.139 ± 0.016
0.824GlnTyr: 0.824 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.112ArgAla: 2.112 ± 0.062
0.539ArgCys: 0.539 ± 0.033
2.18ArgAsp: 2.18 ± 0.06
3.512ArgGlu: 3.512 ± 0.079
1.703ArgPhe: 1.703 ± 0.055
2.302ArgGly: 2.302 ± 0.059
0.534ArgHis: 0.534 ± 0.029
3.841ArgIle: 3.841 ± 0.093
4.047ArgLys: 4.047 ± 0.08
2.962ArgLeu: 2.962 ± 0.078
1.272ArgMet: 1.272 ± 0.047
2.357ArgAsn: 2.357 ± 0.058
0.994ArgPro: 0.994 ± 0.037
0.938ArgGln: 0.938 ± 0.044
1.619ArgArg: 1.619 ± 0.056
1.573ArgSer: 1.573 ± 0.054
1.802ArgThr: 1.802 ± 0.062
2.529ArgVal: 2.529 ± 0.064
0.237ArgTrp: 0.237 ± 0.018
1.82ArgTyr: 1.82 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
3.826SerAla: 3.826 ± 0.102
0.745SerCys: 0.745 ± 0.038
4.488SerAsp: 4.488 ± 0.096
5.376SerGlu: 5.376 ± 0.13
2.9SerPhe: 2.9 ± 0.065
5.14SerGly: 5.14 ± 0.114
0.842SerHis: 0.842 ± 0.039
6.487SerIle: 6.487 ± 0.129
4.84SerLys: 4.84 ± 0.097
5.014SerLeu: 5.014 ± 0.104
2.007SerMet: 2.007 ± 0.067
3.045SerAsn: 3.045 ± 0.077
1.599SerPro: 1.599 ± 0.044
1.497SerGln: 1.497 ± 0.052
2.482SerArg: 2.482 ± 0.073
4.342SerSer: 4.342 ± 0.125
2.994SerThr: 2.994 ± 0.069
4.54SerVal: 4.54 ± 0.094
0.389SerTrp: 0.389 ± 0.029
2.508SerTyr: 2.508 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
3.38ThrAla: 3.38 ± 0.088
0.493ThrCys: 0.493 ± 0.029
3.084ThrAsp: 3.084 ± 0.073
3.656ThrGlu: 3.656 ± 0.079
1.914ThrPhe: 1.914 ± 0.05
4.302ThrGly: 4.302 ± 0.091
0.835ThrHis: 0.835 ± 0.036
4.234ThrIle: 4.234 ± 0.077
3.186ThrLys: 3.186 ± 0.064
3.927ThrLeu: 3.927 ± 0.08
1.306ThrMet: 1.306 ± 0.046
2.117ThrAsn: 2.117 ± 0.066
1.757ThrPro: 1.757 ± 0.064
0.915ThrGln: 0.915 ± 0.036
1.596ThrArg: 1.596 ± 0.056
3.113ThrSer: 3.113 ± 0.085
2.307ThrThr: 2.307 ± 0.073
4.206ThrVal: 4.206 ± 0.094
0.248ThrTrp: 0.248 ± 0.021
1.63ThrTyr: 1.63 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
4.224ValAla: 4.224 ± 0.095
1.059ValCys: 1.059 ± 0.042
4.214ValAsp: 4.214 ± 0.095
5.267ValGlu: 5.267 ± 0.103
3.23ValPhe: 3.23 ± 0.085
4.161ValGly: 4.161 ± 0.106
1.012ValHis: 1.012 ± 0.048
6.282ValIle: 6.282 ± 0.125
5.161ValLys: 5.161 ± 0.089
5.802ValLeu: 5.802 ± 0.109
2.057ValMet: 2.057 ± 0.059
3.181ValAsn: 3.181 ± 0.081
2.01ValPro: 2.01 ± 0.06
1.332ValGln: 1.332 ± 0.055
2.328ValArg: 2.328 ± 0.068
5.137ValSer: 5.137 ± 0.104
3.449ValThr: 3.449 ± 0.082
5.046ValVal: 5.046 ± 0.117
0.336ValTrp: 0.336 ± 0.027
2.936ValTyr: 2.936 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.384TrpAla: 0.384 ± 0.022
0.091TrpCys: 0.091 ± 0.014
0.358TrpAsp: 0.358 ± 0.025
0.381TrpGlu: 0.381 ± 0.025
0.248TrpPhe: 0.248 ± 0.02
0.459TrpGly: 0.459 ± 0.027
0.084TrpHis: 0.084 ± 0.014
0.517TrpIle: 0.517 ± 0.034
0.477TrpLys: 0.477 ± 0.028
0.418TrpLeu: 0.418 ± 0.026
0.198TrpMet: 0.198 ± 0.019
0.365TrpAsn: 0.365 ± 0.025
0.118TrpPro: 0.118 ± 0.013
0.133TrpGln: 0.133 ± 0.014
0.195TrpArg: 0.195 ± 0.016
0.282TrpSer: 0.282 ± 0.02
0.273TrpThr: 0.273 ± 0.019
0.352TrpVal: 0.352 ± 0.027
0.058TrpTrp: 0.058 ± 0.01
0.227TrpTyr: 0.227 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.143TyrAla: 2.143 ± 0.068
0.521TyrCys: 0.521 ± 0.025
2.552TyrAsp: 2.552 ± 0.059
3.032TyrGlu: 3.032 ± 0.067
1.801TyrPhe: 1.801 ± 0.054
2.84TyrGly: 2.84 ± 0.064
0.53TyrHis: 0.53 ± 0.026
4.237TyrIle: 4.237 ± 0.08
3.103TyrLys: 3.103 ± 0.083
3.15TyrLeu: 3.15 ± 0.064
1.303TyrMet: 1.303 ± 0.048
2.383TyrAsn: 2.383 ± 0.073
1.351TyrPro: 1.351 ± 0.045
0.785TyrGln: 0.785 ± 0.036
1.93TyrArg: 1.93 ± 0.059
2.895TyrSer: 2.895 ± 0.077
2.266TyrThr: 2.266 ± 0.062
2.201TyrVal: 2.201 ± 0.058
0.227TyrTrp: 0.227 ± 0.019
1.599TyrTyr: 1.599 ± 0.053
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1940 proteins (616493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski