Amino acid dipepetide frequency for Neisseria weaveri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.466AlaAla: 12.466 ± 0.211
1.143AlaCys: 1.143 ± 0.048
5.859AlaAsp: 5.859 ± 0.107
7.471AlaGlu: 7.471 ± 0.136
3.685AlaPhe: 3.685 ± 0.087
7.752AlaGly: 7.752 ± 0.152
1.888AlaHis: 1.888 ± 0.065
4.469AlaIle: 4.469 ± 0.096
5.587AlaLys: 5.587 ± 0.097
10.439AlaLeu: 10.439 ± 0.149
2.55AlaMet: 2.55 ± 0.072
3.113AlaAsn: 3.113 ± 0.092
3.424AlaPro: 3.424 ± 0.126
4.443AlaGln: 4.443 ± 0.105
4.462AlaArg: 4.462 ± 0.094
4.561AlaSer: 4.561 ± 0.099
3.972AlaThr: 3.972 ± 0.099
8.711AlaVal: 8.711 ± 0.143
1.181AlaTrp: 1.181 ± 0.046
2.824AlaTyr: 2.824 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.04
0.152CysCys: 0.152 ± 0.016
0.456CysAsp: 0.456 ± 0.027
0.518CysGlu: 0.518 ± 0.025
0.362CysPhe: 0.362 ± 0.024
1.014CysGly: 1.014 ± 0.047
0.324CysHis: 0.324 ± 0.025
0.542CysIle: 0.542 ± 0.032
0.382CysLys: 0.382 ± 0.028
0.961CysLeu: 0.961 ± 0.042
0.185CysMet: 0.185 ± 0.017
0.382CysAsn: 0.382 ± 0.026
0.44CysPro: 0.44 ± 0.03
0.347CysGln: 0.347 ± 0.022
0.6CysArg: 0.6 ± 0.035
0.631CysSer: 0.631 ± 0.038
0.466CysThr: 0.466 ± 0.025
0.589CysVal: 0.589 ± 0.033
0.098CysTrp: 0.098 ± 0.013
0.253CysTyr: 0.253 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.681AspAla: 4.681 ± 0.158
0.488AspCys: 0.488 ± 0.031
2.61AspAsp: 2.61 ± 0.091
3.451AspGlu: 3.451 ± 0.086
2.348AspPhe: 2.348 ± 0.072
4.31AspGly: 4.31 ± 0.142
0.968AspHis: 0.968 ± 0.041
3.873AspIle: 3.873 ± 0.092
3.282AspLys: 3.282 ± 0.111
5.112AspLeu: 5.112 ± 0.101
1.374AspMet: 1.374 ± 0.046
2.419AspAsn: 2.419 ± 0.076
1.853AspPro: 1.853 ± 0.056
1.359AspGln: 1.359 ± 0.048
2.169AspArg: 2.169 ± 0.069
2.87AspSer: 2.87 ± 0.067
2.864AspThr: 2.864 ± 0.076
3.712AspVal: 3.712 ± 0.085
0.857AspTrp: 0.857 ± 0.037
1.872AspTyr: 1.872 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
6.579GluAla: 6.579 ± 0.121
0.474GluCys: 0.474 ± 0.028
2.695GluAsp: 2.695 ± 0.068
3.778GluGlu: 3.778 ± 0.089
2.076GluPhe: 2.076 ± 0.056
3.8GluGly: 3.8 ± 0.118
1.55GluHis: 1.55 ± 0.054
4.002GluIle: 4.002 ± 0.094
4.091GluLys: 4.091 ± 0.088
5.811GluLeu: 5.811 ± 0.131
1.64GluMet: 1.64 ± 0.058
3.145GluAsn: 3.145 ± 0.072
2.127GluPro: 2.127 ± 0.064
3.42GluGln: 3.42 ± 0.09
3.473GluArg: 3.473 ± 0.084
3.094GluSer: 3.094 ± 0.079
3.595GluThr: 3.595 ± 0.076
3.865GluVal: 3.865 ± 0.101
0.846GluTrp: 0.846 ± 0.039
1.83GluTyr: 1.83 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.83PheAla: 3.83 ± 0.088
0.445PheCys: 0.445 ± 0.026
2.528PheAsp: 2.528 ± 0.061
2.332PheGlu: 2.332 ± 0.067
1.699PhePhe: 1.699 ± 0.061
3.489PheGly: 3.489 ± 0.089
0.789PheHis: 0.789 ± 0.037
2.37PheIle: 2.37 ± 0.071
1.893PheLys: 1.893 ± 0.058
3.396PheLeu: 3.396 ± 0.085
0.922PheMet: 0.922 ± 0.044
1.732PheAsn: 1.732 ± 0.053
1.415PhePro: 1.415 ± 0.055
1.424PheGln: 1.424 ± 0.049
1.686PheArg: 1.686 ± 0.057
2.684PheSer: 2.684 ± 0.074
2.09PheThr: 2.09 ± 0.059
2.55PheVal: 2.55 ± 0.071
0.57PheTrp: 0.57 ± 0.035
1.285PheTyr: 1.285 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
6.061GlyAla: 6.061 ± 0.125
0.799GlyCys: 0.799 ± 0.038
3.434GlyAsp: 3.434 ± 0.107
4.379GlyGlu: 4.379 ± 0.094
3.364GlyPhe: 3.364 ± 0.082
5.979GlyGly: 5.979 ± 0.14
1.623GlyHis: 1.623 ± 0.054
4.904GlyIle: 4.904 ± 0.088
5.453GlyLys: 5.453 ± 0.107
7.913GlyLeu: 7.913 ± 0.12
2.264GlyMet: 2.264 ± 0.066
3.344GlyAsn: 3.344 ± 0.128
1.374GlyPro: 1.374 ± 0.067
2.79GlyGln: 2.79 ± 0.066
4.181GlyArg: 4.181 ± 0.094
4.793GlySer: 4.793 ± 0.129
3.675GlyThr: 3.675 ± 0.089
5.301GlyVal: 5.301 ± 0.107
1.1GlyTrp: 1.1 ± 0.044
2.374GlyTyr: 2.374 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
2.003HisAla: 2.003 ± 0.057
0.254HisCys: 0.254 ± 0.022
1.025HisAsp: 1.025 ± 0.04
1.279HisGlu: 1.279 ± 0.039
0.908HisPhe: 0.908 ± 0.041
1.787HisGly: 1.787 ± 0.057
0.714HisHis: 0.714 ± 0.043
1.65HisIle: 1.65 ± 0.06
1.031HisLys: 1.031 ± 0.039
2.108HisLeu: 2.108 ± 0.073
0.467HisMet: 0.467 ± 0.024
0.938HisAsn: 0.938 ± 0.034
1.32HisPro: 1.32 ± 0.055
0.892HisGln: 0.892 ± 0.042
1.148HisArg: 1.148 ± 0.05
1.238HisSer: 1.238 ± 0.045
1.457HisThr: 1.457 ± 0.047
1.222HisVal: 1.222 ± 0.045
0.313HisTrp: 0.313 ± 0.024
0.846HisTyr: 0.846 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.017IleAla: 6.017 ± 0.107
0.59IleCys: 0.59 ± 0.031
3.461IleAsp: 3.461 ± 0.079
3.896IleGlu: 3.896 ± 0.091
1.981IlePhe: 1.981 ± 0.061
4.842IleGly: 4.842 ± 0.095
1.261IleHis: 1.261 ± 0.049
3.172IleIle: 3.172 ± 0.089
2.775IleLys: 2.775 ± 0.073
5.185IleLeu: 5.185 ± 0.103
1.23IleMet: 1.23 ± 0.044
2.602IleAsn: 2.602 ± 0.072
2.526IlePro: 2.526 ± 0.065
2.169IleGln: 2.169 ± 0.07
3.238IleArg: 3.238 ± 0.074
3.513IleSer: 3.513 ± 0.079
3.32IleThr: 3.32 ± 0.095
3.781IleVal: 3.781 ± 0.089
0.606IleTrp: 0.606 ± 0.032
1.519IleTyr: 1.519 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
5.371LysAla: 5.371 ± 0.118
0.289LysCys: 0.289 ± 0.027
2.816LysAsp: 2.816 ± 0.108
3.172LysGlu: 3.172 ± 0.083
1.618LysPhe: 1.618 ± 0.048
3.666LysGly: 3.666 ± 0.097
1.394LysHis: 1.394 ± 0.048
3.246LysIle: 3.246 ± 0.083
3.08LysLys: 3.08 ± 0.092
5.527LysLeu: 5.527 ± 0.105
1.519LysMet: 1.519 ± 0.048
2.752LysAsn: 2.752 ± 0.083
2.823LysPro: 2.823 ± 0.073
3.178LysGln: 3.178 ± 0.07
2.779LysArg: 2.779 ± 0.078
2.543LysSer: 2.543 ± 0.066
3.397LysThr: 3.397 ± 0.076
3.667LysVal: 3.667 ± 0.084
0.589LysTrp: 0.589 ± 0.032
1.399LysTyr: 1.399 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
10.555LeuAla: 10.555 ± 0.149
0.993LeuCys: 0.993 ± 0.043
5.312LeuAsp: 5.312 ± 0.119
5.387LeuGlu: 5.387 ± 0.119
3.928LeuPhe: 3.928 ± 0.087
7.182LeuGly: 7.182 ± 0.129
2.248LeuHis: 2.248 ± 0.064
5.393LeuIle: 5.393 ± 0.093
5.941LeuLys: 5.941 ± 0.103
10.371LeuLeu: 10.371 ± 0.181
2.543LeuMet: 2.543 ± 0.077
4.75LeuAsn: 4.75 ± 0.094
5.672LeuPro: 5.672 ± 0.108
4.261LeuGln: 4.261 ± 0.101
4.761LeuArg: 4.761 ± 0.084
6.899LeuSer: 6.899 ± 0.123
5.241LeuThr: 5.241 ± 0.096
6.178LeuVal: 6.178 ± 0.117
1.227LeuTrp: 1.227 ± 0.055
2.449LeuTyr: 2.449 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.425MetAla: 2.425 ± 0.071
0.212MetCys: 0.212 ± 0.021
1.091MetAsp: 1.091 ± 0.042
1.252MetGlu: 1.252 ± 0.056
0.838MetPhe: 0.838 ± 0.041
1.773MetGly: 1.773 ± 0.063
0.493MetHis: 0.493 ± 0.027
1.298MetIle: 1.298 ± 0.048
1.533MetLys: 1.533 ± 0.047
2.758MetLeu: 2.758 ± 0.072
0.781MetMet: 0.781 ± 0.039
1.233MetAsn: 1.233 ± 0.041
1.312MetPro: 1.312 ± 0.049
1.235MetGln: 1.235 ± 0.047
1.366MetArg: 1.366 ± 0.052
1.639MetSer: 1.639 ± 0.053
1.377MetThr: 1.377 ± 0.047
1.691MetVal: 1.691 ± 0.06
0.242MetTrp: 0.242 ± 0.017
0.5MetTyr: 0.5 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.78AsnAla: 3.78 ± 0.082
0.339AsnCys: 0.339 ± 0.023
2.073AsnAsp: 2.073 ± 0.075
2.3AsnGlu: 2.3 ± 0.076
1.356AsnPhe: 1.356 ± 0.044
3.642AsnGly: 3.642 ± 0.098
1.059AsnHis: 1.059 ± 0.041
3.094AsnIle: 3.094 ± 0.086
2.226AsnLys: 2.226 ± 0.062
3.914AsnLeu: 3.914 ± 0.09
1.026AsnMet: 1.026 ± 0.04
1.907AsnAsn: 1.907 ± 0.078
2.507AsnPro: 2.507 ± 0.067
1.798AsnGln: 1.798 ± 0.059
2.565AsnArg: 2.565 ± 0.07
2.116AsnSer: 2.116 ± 0.075
2.458AsnThr: 2.458 ± 0.079
2.744AsnVal: 2.744 ± 0.082
0.535AsnTrp: 0.535 ± 0.027
1.167AsnTyr: 1.167 ± 0.053
0.0AsnXaa: 0.0 ± 0.0
Pro
4.045ProAla: 4.045 ± 0.09
0.32ProCys: 0.32 ± 0.024
2.684ProAsp: 2.684 ± 0.072
4.027ProGlu: 4.027 ± 0.093
1.765ProPhe: 1.765 ± 0.057
2.269ProGly: 2.269 ± 0.066
0.991ProHis: 0.991 ± 0.044
1.909ProIle: 1.909 ± 0.059
2.343ProLys: 2.343 ± 0.068
4.034ProLeu: 4.034 ± 0.081
0.922ProMet: 0.922 ± 0.035
1.749ProAsn: 1.749 ± 0.058
1.441ProPro: 1.441 ± 0.058
2.029ProGln: 2.029 ± 0.053
1.535ProArg: 1.535 ± 0.046
2.504ProSer: 2.504 ± 0.07
1.986ProThr: 1.986 ± 0.057
3.603ProVal: 3.603 ± 0.089
0.448ProTrp: 0.448 ± 0.031
1.348ProTyr: 1.348 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
4.951GlnAla: 4.951 ± 0.113
0.286GlnCys: 0.286 ± 0.021
1.964GlnAsp: 1.964 ± 0.06
2.461GlnGlu: 2.461 ± 0.069
1.449GlnPhe: 1.449 ± 0.049
2.771GlnGly: 2.771 ± 0.066
1.157GlnHis: 1.157 ± 0.044
2.586GlnIle: 2.586 ± 0.067
2.359GlnLys: 2.359 ± 0.065
3.697GlnLeu: 3.697 ± 0.09
1.062GlnMet: 1.062 ± 0.045
2.149GlnAsn: 2.149 ± 0.066
1.85GlnPro: 1.85 ± 0.057
2.378GlnGln: 2.378 ± 0.086
2.18GlnArg: 2.18 ± 0.065
2.534GlnSer: 2.534 ± 0.066
3.172GlnThr: 3.172 ± 0.086
2.654GlnVal: 2.654 ± 0.083
0.575GlnTrp: 0.575 ± 0.032
1.307GlnTyr: 1.307 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
4.141ArgAla: 4.141 ± 0.085
0.412ArgCys: 0.412 ± 0.025
2.48ArgAsp: 2.48 ± 0.067
3.206ArgGlu: 3.206 ± 0.085
2.605ArgPhe: 2.605 ± 0.077
2.922ArgGly: 2.922 ± 0.072
1.345ArgHis: 1.345 ± 0.053
3.236ArgIle: 3.236 ± 0.074
2.635ArgLys: 2.635 ± 0.072
5.755ArgLeu: 5.755 ± 0.115
1.452ArgMet: 1.452 ± 0.043
2.235ArgAsn: 2.235 ± 0.069
2.139ArgPro: 2.139 ± 0.061
2.572ArgGln: 2.572 ± 0.075
3.072ArgArg: 3.072 ± 0.072
2.72ArgSer: 2.72 ± 0.062
2.292ArgThr: 2.292 ± 0.07
3.206ArgVal: 3.206 ± 0.063
0.643ArgTrp: 0.643 ± 0.035
1.988ArgTyr: 1.988 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
5.44SerAla: 5.44 ± 0.104
0.542SerCys: 0.542 ± 0.031
3.356SerAsp: 3.356 ± 0.077
3.543SerGlu: 3.543 ± 0.082
2.286SerPhe: 2.286 ± 0.066
5.538SerGly: 5.538 ± 0.115
1.216SerHis: 1.216 ± 0.043
2.93SerIle: 2.93 ± 0.08
2.78SerLys: 2.78 ± 0.076
5.813SerLeu: 5.813 ± 0.11
1.353SerMet: 1.353 ± 0.05
2.237SerAsn: 2.237 ± 0.067
2.177SerPro: 2.177 ± 0.064
2.125SerGln: 2.125 ± 0.058
3.06SerArg: 3.06 ± 0.081
3.164SerSer: 3.164 ± 0.093
2.547SerThr: 2.547 ± 0.078
4.198SerVal: 4.198 ± 0.122
0.691SerTrp: 0.691 ± 0.032
1.604SerTyr: 1.604 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
6.364ThrAla: 6.364 ± 0.134
0.482ThrCys: 0.482 ± 0.027
3.015ThrAsp: 3.015 ± 0.109
3.102ThrGlu: 3.102 ± 0.071
1.882ThrPhe: 1.882 ± 0.055
4.269ThrGly: 4.269 ± 0.093
1.157ThrHis: 1.157 ± 0.043
2.731ThrIle: 2.731 ± 0.078
1.885ThrLys: 1.885 ± 0.055
5.843ThrLeu: 5.843 ± 0.104
0.949ThrMet: 0.949 ± 0.035
1.593ThrAsn: 1.593 ± 0.059
2.851ThrPro: 2.851 ± 0.081
2.049ThrGln: 2.049 ± 0.065
2.389ThrArg: 2.389 ± 0.077
2.229ThrSer: 2.229 ± 0.056
2.496ThrThr: 2.496 ± 0.121
4.902ThrVal: 4.902 ± 0.18
0.527ThrTrp: 0.527 ± 0.029
1.344ThrTyr: 1.344 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
6.762ValAla: 6.762 ± 0.118
0.835ValCys: 0.835 ± 0.039
3.571ValAsp: 3.571 ± 0.113
4.305ValGlu: 4.305 ± 0.099
3.025ValPhe: 3.025 ± 0.087
4.889ValGly: 4.889 ± 0.106
1.301ValHis: 1.301 ± 0.046
3.972ValIle: 3.972 ± 0.082
3.734ValLys: 3.734 ± 0.091
7.666ValLeu: 7.666 ± 0.141
1.934ValMet: 1.934 ± 0.061
2.815ValAsn: 2.815 ± 0.094
2.884ValPro: 2.884 ± 0.082
2.592ValGln: 2.592 ± 0.067
3.634ValArg: 3.634 ± 0.084
4.694ValSer: 4.694 ± 0.087
3.5ValThr: 3.5 ± 0.155
5.413ValVal: 5.413 ± 0.143
0.966ValTrp: 0.966 ± 0.047
2.164ValTyr: 2.164 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
1.006TrpAla: 1.006 ± 0.046
0.159TrpCys: 0.159 ± 0.017
0.494TrpAsp: 0.494 ± 0.029
0.542TrpGlu: 0.542 ± 0.033
0.66TrpPhe: 0.66 ± 0.032
0.78TrpGly: 0.78 ± 0.031
0.417TrpHis: 0.417 ± 0.024
0.685TrpIle: 0.685 ± 0.032
0.578TrpLys: 0.578 ± 0.033
1.885TrpLeu: 1.885 ± 0.063
0.316TrpMet: 0.316 ± 0.02
0.456TrpAsn: 0.456 ± 0.029
0.429TrpPro: 0.429 ± 0.029
1.048TrpGln: 1.048 ± 0.052
0.72TrpArg: 0.72 ± 0.032
0.573TrpSer: 0.573 ± 0.033
0.466TrpThr: 0.466 ± 0.027
0.889TrpVal: 0.889 ± 0.035
0.2TrpTrp: 0.2 ± 0.018
0.347TrpTyr: 0.347 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.659TyrAla: 2.659 ± 0.065
0.352TyrCys: 0.352 ± 0.018
1.547TyrAsp: 1.547 ± 0.048
1.489TyrGlu: 1.489 ± 0.051
1.293TyrPhe: 1.293 ± 0.046
2.431TyrGly: 2.431 ± 0.078
0.703TyrHis: 0.703 ± 0.033
1.623TyrIle: 1.623 ± 0.051
1.238TyrLys: 1.238 ± 0.043
3.105TyrLeu: 3.105 ± 0.072
0.556TyrMet: 0.556 ± 0.033
1.056TyrAsn: 1.056 ± 0.042
1.386TyrPro: 1.386 ± 0.047
1.433TyrGln: 1.433 ± 0.051
2.041TyrArg: 2.041 ± 0.056
1.601TyrSer: 1.601 ± 0.058
1.688TyrThr: 1.688 ± 0.057
1.784TyrVal: 1.784 ± 0.057
0.436TyrTrp: 0.436 ± 0.025
0.853TyrTyr: 0.853 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2011 proteins (633418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski