Amino acid dipepetide frequency for Arcticibacterium luteifluviistationis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.186AlaAla: 5.186 ± 0.087
0.648AlaCys: 0.648 ± 0.022
3.652AlaAsp: 3.652 ± 0.057
4.369AlaGlu: 4.369 ± 0.062
3.592AlaPhe: 3.592 ± 0.049
4.871AlaGly: 4.871 ± 0.072
1.012AlaHis: 1.012 ± 0.026
5.099AlaIle: 5.099 ± 0.065
4.59AlaLys: 4.59 ± 0.072
6.3AlaLeu: 6.3 ± 0.081
1.612AlaMet: 1.612 ± 0.044
3.741AlaAsn: 3.741 ± 0.064
2.065AlaPro: 2.065 ± 0.045
2.214AlaGln: 2.214 ± 0.037
2.003AlaArg: 2.003 ± 0.039
4.861AlaSer: 4.861 ± 0.082
3.876AlaThr: 3.876 ± 0.12
4.379AlaVal: 4.379 ± 0.072
0.71AlaTrp: 0.71 ± 0.023
2.635AlaTyr: 2.635 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.448CysAla: 0.448 ± 0.019
0.098CysCys: 0.098 ± 0.008
0.44CysAsp: 0.44 ± 0.017
0.539CysGlu: 0.539 ± 0.022
0.428CysPhe: 0.428 ± 0.016
0.663CysGly: 0.663 ± 0.052
0.216CysHis: 0.216 ± 0.012
0.506CysIle: 0.506 ± 0.019
0.468CysLys: 0.468 ± 0.019
0.742CysLeu: 0.742 ± 0.024
0.144CysMet: 0.144 ± 0.01
0.415CysAsn: 0.415 ± 0.019
0.369CysPro: 0.369 ± 0.021
0.297CysGln: 0.297 ± 0.015
0.241CysArg: 0.241 ± 0.014
0.663CysSer: 0.663 ± 0.033
0.56CysThr: 0.56 ± 0.033
0.483CysVal: 0.483 ± 0.021
0.062CysTrp: 0.062 ± 0.007
0.299CysTyr: 0.299 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.802AspAla: 3.802 ± 0.057
0.405AspCys: 0.405 ± 0.018
2.788AspAsp: 2.788 ± 0.053
3.503AspGlu: 3.503 ± 0.058
3.675AspPhe: 3.675 ± 0.057
4.437AspGly: 4.437 ± 0.086
0.787AspHis: 0.787 ± 0.023
4.319AspIle: 4.319 ± 0.059
3.923AspLys: 3.923 ± 0.057
5.492AspLeu: 5.492 ± 0.068
1.215AspMet: 1.215 ± 0.029
3.043AspAsn: 3.043 ± 0.052
1.764AspPro: 1.764 ± 0.04
1.463AspGln: 1.463 ± 0.031
1.861AspArg: 1.861 ± 0.04
3.545AspSer: 3.545 ± 0.065
2.553AspThr: 2.553 ± 0.051
3.455AspVal: 3.455 ± 0.052
0.859AspTrp: 0.859 ± 0.025
2.561AspTyr: 2.561 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
4.499GluAla: 4.499 ± 0.07
0.342GluCys: 0.342 ± 0.015
3.428GluAsp: 3.428 ± 0.047
4.785GluGlu: 4.785 ± 0.081
2.785GluPhe: 2.785 ± 0.046
4.213GluGly: 4.213 ± 0.056
1.118GluHis: 1.118 ± 0.032
5.377GluIle: 5.377 ± 0.07
5.508GluLys: 5.508 ± 0.078
6.186GluLeu: 6.186 ± 0.081
1.726GluMet: 1.726 ± 0.037
4.606GluAsn: 4.606 ± 0.064
1.875GluPro: 1.875 ± 0.04
2.137GluGln: 2.137 ± 0.046
2.485GluArg: 2.485 ± 0.043
3.873GluSer: 3.873 ± 0.048
3.627GluThr: 3.627 ± 0.051
4.375GluVal: 4.375 ± 0.06
0.706GluTrp: 0.706 ± 0.022
2.248GluTyr: 2.248 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
2.964PheAla: 2.964 ± 0.047
0.445PheCys: 0.445 ± 0.018
3.3PheAsp: 3.3 ± 0.049
3.568PheGlu: 3.568 ± 0.06
2.631PhePhe: 2.631 ± 0.054
3.678PheGly: 3.678 ± 0.055
0.836PheHis: 0.836 ± 0.024
3.614PheIle: 3.614 ± 0.063
3.598PheLys: 3.598 ± 0.057
4.886PheLeu: 4.886 ± 0.079
1.114PheMet: 1.114 ± 0.031
2.968PheAsn: 2.968 ± 0.052
1.703PhePro: 1.703 ± 0.037
1.62PheGln: 1.62 ± 0.03
1.665PheArg: 1.665 ± 0.034
4.137PheSer: 4.137 ± 0.064
2.971PheThr: 2.971 ± 0.055
3.038PheVal: 3.038 ± 0.05
0.651PheTrp: 0.651 ± 0.022
2.074PheTyr: 2.074 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.383GlyAla: 4.383 ± 0.071
0.695GlyCys: 0.695 ± 0.033
3.898GlyAsp: 3.898 ± 0.066
4.185GlyGlu: 4.185 ± 0.056
3.926GlyPhe: 3.926 ± 0.053
5.401GlyGly: 5.401 ± 0.097
1.223GlyHis: 1.223 ± 0.029
5.6GlyIle: 5.6 ± 0.061
5.009GlyLys: 5.009 ± 0.075
6.767GlyLeu: 6.767 ± 0.075
1.684GlyMet: 1.684 ± 0.038
4.053GlyAsn: 4.053 ± 0.077
1.562GlyPro: 1.562 ± 0.033
2.289GlyGln: 2.289 ± 0.039
2.426GlyArg: 2.426 ± 0.046
4.877GlySer: 4.877 ± 0.092
4.565GlyThr: 4.565 ± 0.164
4.696GlyVal: 4.696 ± 0.068
0.839GlyTrp: 0.839 ± 0.025
2.918GlyTyr: 2.918 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.011HisAla: 1.011 ± 0.026
0.157HisCys: 0.157 ± 0.009
0.764HisAsp: 0.764 ± 0.022
1.011HisGlu: 1.011 ± 0.025
1.102HisPhe: 1.102 ± 0.027
1.202HisGly: 1.202 ± 0.033
0.447HisHis: 0.447 ± 0.017
1.165HisIle: 1.165 ± 0.029
1.015HisLys: 1.015 ± 0.028
1.667HisLeu: 1.667 ± 0.039
0.31HisMet: 0.31 ± 0.013
0.788HisAsn: 0.788 ± 0.022
0.856HisPro: 0.856 ± 0.023
0.605HisGln: 0.605 ± 0.022
0.59HisArg: 0.59 ± 0.021
1.015HisSer: 1.015 ± 0.027
0.904HisThr: 0.904 ± 0.028
0.889HisVal: 0.889 ± 0.025
0.232HisTrp: 0.232 ± 0.014
0.775HisTyr: 0.775 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
4.919IleAla: 4.919 ± 0.07
0.688IleCys: 0.688 ± 0.026
4.69IleAsp: 4.69 ± 0.06
4.944IleGlu: 4.944 ± 0.064
3.305IlePhe: 3.305 ± 0.063
5.288IleGly: 5.288 ± 0.075
1.162IleHis: 1.162 ± 0.03
5.083IleIle: 5.083 ± 0.063
5.239IleLys: 5.239 ± 0.064
6.76IleLeu: 6.76 ± 0.091
1.352IleMet: 1.352 ± 0.036
4.354IleAsn: 4.354 ± 0.058
3.157IlePro: 3.157 ± 0.05
2.345IleGln: 2.345 ± 0.04
2.493IleArg: 2.493 ± 0.045
5.878IleSer: 5.878 ± 0.069
4.181IleThr: 4.181 ± 0.081
3.988IleVal: 3.988 ± 0.049
0.748IleTrp: 0.748 ± 0.025
2.529IleTyr: 2.529 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
5.1LysAla: 5.1 ± 0.073
0.344LysCys: 0.344 ± 0.015
4.114LysAsp: 4.114 ± 0.058
5.717LysGlu: 5.717 ± 0.079
2.584LysPhe: 2.584 ± 0.041
4.815LysGly: 4.815 ± 0.07
1.201LysHis: 1.201 ± 0.032
5.152LysIle: 5.152 ± 0.067
5.813LysLys: 5.813 ± 0.095
6.277LysLeu: 6.277 ± 0.071
1.915LysMet: 1.915 ± 0.039
4.3LysAsn: 4.3 ± 0.055
2.673LysPro: 2.673 ± 0.049
1.968LysGln: 1.968 ± 0.039
2.726LysArg: 2.726 ± 0.051
4.728LysSer: 4.728 ± 0.062
4.341LysThr: 4.341 ± 0.066
4.778LysVal: 4.778 ± 0.071
0.921LysTrp: 0.921 ± 0.025
2.805LysTyr: 2.805 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
6.666LeuAla: 6.666 ± 0.082
0.717LeuCys: 0.717 ± 0.024
5.034LeuAsp: 5.034 ± 0.064
6.116LeuGlu: 6.116 ± 0.078
4.717LeuPhe: 4.717 ± 0.077
6.24LeuGly: 6.24 ± 0.074
1.404LeuHis: 1.404 ± 0.033
6.803LeuIle: 6.803 ± 0.091
7.567LeuLys: 7.567 ± 0.091
8.668LeuLeu: 8.668 ± 0.117
2.153LeuMet: 2.153 ± 0.04
5.665LeuAsn: 5.665 ± 0.063
3.739LeuPro: 3.739 ± 0.051
2.689LeuGln: 2.689 ± 0.044
3.323LeuArg: 3.323 ± 0.054
7.417LeuSer: 7.417 ± 0.071
5.305LeuThr: 5.305 ± 0.071
5.527LeuVal: 5.527 ± 0.064
0.908LeuTrp: 0.908 ± 0.026
3.064LeuTyr: 3.064 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.038
0.124MetCys: 0.124 ± 0.009
1.229MetAsp: 1.229 ± 0.03
1.507MetGlu: 1.507 ± 0.035
0.803MetPhe: 0.803 ± 0.027
1.579MetGly: 1.579 ± 0.037
0.37MetHis: 0.37 ± 0.016
1.368MetIle: 1.368 ± 0.034
2.078MetLys: 2.078 ± 0.038
1.968MetLeu: 1.968 ± 0.043
0.593MetMet: 0.593 ± 0.021
1.267MetAsn: 1.267 ± 0.033
1.02MetPro: 1.02 ± 0.024
0.706MetGln: 0.706 ± 0.022
0.8MetArg: 0.8 ± 0.025
1.413MetSer: 1.413 ± 0.031
1.192MetThr: 1.192 ± 0.031
1.446MetVal: 1.446 ± 0.03
0.181MetTrp: 0.181 ± 0.01
0.614MetTyr: 0.614 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.811AsnAla: 3.811 ± 0.06
0.522AsnCys: 0.522 ± 0.033
3.057AsnAsp: 3.057 ± 0.059
3.448AsnGlu: 3.448 ± 0.045
2.909AsnPhe: 2.909 ± 0.046
4.533AsnGly: 4.533 ± 0.089
0.993AsnHis: 0.993 ± 0.026
4.342AsnIle: 4.342 ± 0.066
3.757AsnLys: 3.757 ± 0.054
5.52AsnLeu: 5.52 ± 0.069
1.141AsnMet: 1.141 ± 0.027
3.568AsnAsn: 3.568 ± 0.094
2.819AsnPro: 2.819 ± 0.044
2.144AsnGln: 2.144 ± 0.042
2.001AsnArg: 2.001 ± 0.042
4.275AsnSer: 4.275 ± 0.084
3.605AsnThr: 3.605 ± 0.09
3.417AsnVal: 3.417 ± 0.054
0.792AsnTrp: 0.792 ± 0.021
2.683AsnTyr: 2.683 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
2.354ProAla: 2.354 ± 0.043
0.209ProCys: 0.209 ± 0.01
2.207ProAsp: 2.207 ± 0.041
3.159ProGlu: 3.159 ± 0.056
2.004ProPhe: 2.004 ± 0.041
2.141ProGly: 2.141 ± 0.043
0.611ProHis: 0.611 ± 0.02
2.615ProIle: 2.615 ± 0.046
2.384ProLys: 2.384 ± 0.043
3.052ProLeu: 3.052 ± 0.051
0.746ProMet: 0.746 ± 0.024
2.35ProAsn: 2.35 ± 0.051
0.935ProPro: 0.935 ± 0.028
1.063ProGln: 1.063 ± 0.026
0.967ProArg: 0.967 ± 0.026
2.725ProSer: 2.725 ± 0.053
2.226ProThr: 2.226 ± 0.062
2.34ProVal: 2.34 ± 0.04
0.412ProTrp: 0.412 ± 0.018
1.438ProTyr: 1.438 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.045GlnAla: 2.045 ± 0.036
0.167GlnCys: 0.167 ± 0.01
1.57GlnAsp: 1.57 ± 0.033
2.124GlnGlu: 2.124 ± 0.044
1.523GlnPhe: 1.523 ± 0.036
1.874GlnGly: 1.874 ± 0.044
0.492GlnHis: 0.492 ± 0.02
2.505GlnIle: 2.505 ± 0.037
2.626GlnLys: 2.626 ± 0.04
2.873GlnLeu: 2.873 ± 0.045
0.776GlnMet: 0.776 ± 0.023
2.197GlnAsn: 2.197 ± 0.033
1.017GlnPro: 1.017 ± 0.025
1.017GlnGln: 1.017 ± 0.028
1.117GlnArg: 1.117 ± 0.029
2.092GlnSer: 2.092 ± 0.041
1.884GlnThr: 1.884 ± 0.033
1.986GlnVal: 1.986 ± 0.037
0.399GlnTrp: 0.399 ± 0.016
1.21GlnTyr: 1.21 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
2.181ArgAla: 2.181 ± 0.04
0.205ArgCys: 0.205 ± 0.014
1.804ArgAsp: 1.804 ± 0.039
2.188ArgGlu: 2.188 ± 0.045
1.857ArgPhe: 1.857 ± 0.034
2.078ArgGly: 2.078 ± 0.042
0.564ArgHis: 0.564 ± 0.017
2.68ArgIle: 2.68 ± 0.038
2.671ArgLys: 2.671 ± 0.048
3.216ArgLeu: 3.216 ± 0.047
0.885ArgMet: 0.885 ± 0.022
2.056ArgAsn: 2.056 ± 0.041
1.177ArgPro: 1.177 ± 0.029
1.098ArgGln: 1.098 ± 0.031
1.39ArgArg: 1.39 ± 0.038
1.991ArgSer: 1.991 ± 0.037
1.87ArgThr: 1.87 ± 0.036
2.182ArgVal: 2.182 ± 0.04
0.42ArgTrp: 0.42 ± 0.018
1.401ArgTyr: 1.401 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.246SerAla: 4.246 ± 0.077
0.789SerCys: 0.789 ± 0.031
3.874SerAsp: 3.874 ± 0.06
4.436SerGlu: 4.436 ± 0.059
4.211SerPhe: 4.211 ± 0.053
5.497SerGly: 5.497 ± 0.109
1.168SerHis: 1.168 ± 0.027
5.336SerIle: 5.336 ± 0.072
4.869SerLys: 4.869 ± 0.067
6.841SerLeu: 6.841 ± 0.071
1.441SerMet: 1.441 ± 0.035
4.102SerAsn: 4.102 ± 0.086
2.705SerPro: 2.705 ± 0.054
2.444SerGln: 2.444 ± 0.053
2.267SerArg: 2.267 ± 0.038
5.623SerSer: 5.623 ± 0.126
4.05SerThr: 4.05 ± 0.102
4.3SerVal: 4.3 ± 0.075
0.842SerTrp: 0.842 ± 0.027
2.885SerTyr: 2.885 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
4.284ThrAla: 4.284 ± 0.144
0.482ThrCys: 0.482 ± 0.034
3.308ThrAsp: 3.308 ± 0.057
3.615ThrGlu: 3.615 ± 0.06
3.087ThrPhe: 3.087 ± 0.054
4.586ThrGly: 4.586 ± 0.116
0.906ThrHis: 0.906 ± 0.027
4.12ThrIle: 4.12 ± 0.089
3.447ThrLys: 3.447 ± 0.043
5.537ThrLeu: 5.537 ± 0.064
0.979ThrMet: 0.979 ± 0.027
3.097ThrAsn: 3.097 ± 0.067
2.435ThrPro: 2.435 ± 0.062
1.671ThrGln: 1.671 ± 0.039
1.553ThrArg: 1.553 ± 0.032
4.121ThrSer: 4.121 ± 0.094
3.317ThrThr: 3.317 ± 0.101
3.813ThrVal: 3.813 ± 0.115
0.735ThrTrp: 0.735 ± 0.032
2.47ThrTyr: 2.47 ± 0.071
0.0ThrXaa: 0.0 ± 0.0
Val
4.23ValAla: 4.23 ± 0.066
0.625ValCys: 0.625 ± 0.029
3.344ValAsp: 3.344 ± 0.05
3.67ValGlu: 3.67 ± 0.053
3.412ValPhe: 3.412 ± 0.053
4.225ValGly: 4.225 ± 0.073
0.936ValHis: 0.936 ± 0.027
4.373ValIle: 4.373 ± 0.053
4.397ValLys: 4.397 ± 0.058
6.062ValLeu: 6.062 ± 0.076
1.342ValMet: 1.342 ± 0.03
3.833ValAsn: 3.833 ± 0.063
2.244ValPro: 2.244 ± 0.036
1.641ValGln: 1.641 ± 0.03
2.055ValArg: 2.055 ± 0.039
4.918ValSer: 4.918 ± 0.079
3.599ValThr: 3.599 ± 0.122
3.977ValVal: 3.977 ± 0.076
0.69ValTrp: 0.69 ± 0.021
2.307ValTyr: 2.307 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.025
0.088TrpCys: 0.088 ± 0.008
0.672TrpAsp: 0.672 ± 0.023
0.738TrpGlu: 0.738 ± 0.022
0.593TrpPhe: 0.593 ± 0.02
0.85TrpGly: 0.85 ± 0.025
0.273TrpHis: 0.273 ± 0.013
0.754TrpIle: 0.754 ± 0.023
0.785TrpLys: 0.785 ± 0.023
1.083TrpLeu: 1.083 ± 0.031
0.31TrpMet: 0.31 ± 0.014
0.694TrpAsn: 0.694 ± 0.024
0.319TrpPro: 0.319 ± 0.014
0.51TrpGln: 0.51 ± 0.018
0.463TrpArg: 0.463 ± 0.018
0.836TrpSer: 0.836 ± 0.041
0.641TrpThr: 0.641 ± 0.024
0.723TrpVal: 0.723 ± 0.022
0.181TrpTrp: 0.181 ± 0.011
0.482TrpTyr: 0.482 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.59TyrAla: 2.59 ± 0.039
0.332TyrCys: 0.332 ± 0.015
2.278TyrAsp: 2.278 ± 0.042
2.277TyrGlu: 2.277 ± 0.04
2.318TyrPhe: 2.318 ± 0.043
2.824TyrGly: 2.824 ± 0.043
0.788TyrHis: 0.788 ± 0.023
2.293TyrIle: 2.293 ± 0.042
2.528TyrLys: 2.528 ± 0.043
3.825TyrLeu: 3.825 ± 0.058
0.735TyrMet: 0.735 ± 0.024
2.239TyrAsn: 2.239 ± 0.046
1.502TyrPro: 1.502 ± 0.035
1.574TyrGln: 1.574 ± 0.036
1.487TyrArg: 1.487 ± 0.039
2.937TyrSer: 2.937 ± 0.053
2.293TyrThr: 2.293 ± 0.064
2.086TyrVal: 2.086 ± 0.04
0.49TyrTrp: 0.49 ± 0.019
1.694TyrTyr: 1.694 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4329 proteins (1600995 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski