Amino acid dipepetide frequency for Endomicrobium trichonymphae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.179AlaAla: 6.179 ± 0.111
0.962AlaCys: 0.962 ± 0.04
4.06AlaAsp: 4.06 ± 0.075
4.929AlaGlu: 4.929 ± 0.096
2.994AlaPhe: 2.994 ± 0.07
5.792AlaGly: 5.792 ± 0.109
0.936AlaHis: 0.936 ± 0.038
4.862AlaIle: 4.862 ± 0.102
6.133AlaLys: 6.133 ± 0.102
6.646AlaLeu: 6.646 ± 0.093
1.721AlaMet: 1.721 ± 0.05
2.678AlaAsn: 2.678 ± 0.069
1.472AlaPro: 1.472 ± 0.051
1.845AlaGln: 1.845 ± 0.047
2.731AlaArg: 2.731 ± 0.069
4.03AlaSer: 4.03 ± 0.077
2.533AlaThr: 2.533 ± 0.061
6.805AlaVal: 6.805 ± 0.114
0.407AlaTrp: 0.407 ± 0.024
2.116AlaTyr: 2.116 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.969CysAla: 0.969 ± 0.043
0.22CysCys: 0.22 ± 0.018
0.722CysAsp: 0.722 ± 0.029
0.8CysGlu: 0.8 ± 0.038
0.655CysPhe: 0.655 ± 0.03
1.593CysGly: 1.593 ± 0.049
0.175CysHis: 0.175 ± 0.015
1.107CysIle: 1.107 ± 0.048
0.956CysLys: 0.956 ± 0.041
1.107CysLeu: 1.107 ± 0.038
0.256CysMet: 0.256 ± 0.019
0.504CysAsn: 0.504 ± 0.031
0.634CysPro: 0.634 ± 0.033
0.253CysGln: 0.253 ± 0.018
0.712CysArg: 0.712 ± 0.032
1.0CysSer: 1.0 ± 0.045
0.523CysThr: 0.523 ± 0.029
0.881CysVal: 0.881 ± 0.031
0.103CysTrp: 0.103 ± 0.012
0.429CysTyr: 0.429 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
3.105AspAla: 3.105 ± 0.078
0.686AspCys: 0.686 ± 0.028
2.176AspAsp: 2.176 ± 0.07
3.555AspGlu: 3.555 ± 0.083
3.048AspPhe: 3.048 ± 0.063
3.36AspGly: 3.36 ± 0.075
0.637AspHis: 0.637 ± 0.029
6.013AspIle: 6.013 ± 0.095
4.654AspLys: 4.654 ± 0.091
4.796AspLeu: 4.796 ± 0.085
1.327AspMet: 1.327 ± 0.044
2.563AspAsn: 2.563 ± 0.062
1.331AspPro: 1.331 ± 0.05
0.601AspGln: 0.601 ± 0.028
1.892AspArg: 1.892 ± 0.047
2.858AspSer: 2.858 ± 0.065
2.317AspThr: 2.317 ± 0.063
3.166AspVal: 3.166 ± 0.075
0.407AspTrp: 0.407 ± 0.027
2.007AspTyr: 2.007 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
4.289GluAla: 4.289 ± 0.081
0.715GluCys: 0.715 ± 0.037
3.037GluAsp: 3.037 ± 0.079
4.629GluGlu: 4.629 ± 0.095
2.874GluPhe: 2.874 ± 0.065
2.876GluGly: 2.876 ± 0.071
1.071GluHis: 1.071 ± 0.042
7.676GluIle: 7.676 ± 0.131
8.053GluLys: 8.053 ± 0.123
5.693GluLeu: 5.693 ± 0.099
1.711GluMet: 1.711 ± 0.057
4.754GluAsn: 4.754 ± 0.092
1.291GluPro: 1.291 ± 0.046
1.741GluGln: 1.741 ± 0.046
3.161GluArg: 3.161 ± 0.075
3.636GluSer: 3.636 ± 0.071
3.703GluThr: 3.703 ± 0.088
3.921GluVal: 3.921 ± 0.082
0.407GluTrp: 0.407 ± 0.025
2.575GluTyr: 2.575 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
3.467PheAla: 3.467 ± 0.09
0.809PheCys: 0.809 ± 0.042
2.731PheAsp: 2.731 ± 0.058
2.978PheGlu: 2.978 ± 0.069
2.577PhePhe: 2.577 ± 0.08
3.148PheGly: 3.148 ± 0.076
0.597PheHis: 0.597 ± 0.032
4.149PheIle: 4.149 ± 0.086
4.243PheLys: 4.243 ± 0.085
4.629PheLeu: 4.629 ± 0.09
1.169PheMet: 1.169 ± 0.043
2.379PheAsn: 2.379 ± 0.068
1.554PhePro: 1.554 ± 0.047
1.087PheGln: 1.087 ± 0.036
1.679PheArg: 1.679 ± 0.047
3.547PheSer: 3.547 ± 0.083
2.294PheThr: 2.294 ± 0.06
2.909PheVal: 2.909 ± 0.077
0.357PheTrp: 0.357 ± 0.024
1.793PheTyr: 1.793 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
4.362GlyAla: 4.362 ± 0.094
1.038GlyCys: 1.038 ± 0.047
2.919GlyAsp: 2.919 ± 0.064
3.453GlyGlu: 3.453 ± 0.074
3.35GlyPhe: 3.35 ± 0.085
4.455GlyGly: 4.455 ± 0.089
1.232GlyHis: 1.232 ± 0.052
6.61GlyIle: 6.61 ± 0.105
6.096GlyLys: 6.096 ± 0.102
5.276GlyLeu: 5.276 ± 0.089
1.593GlyMet: 1.593 ± 0.061
2.901GlyAsn: 2.901 ± 0.063
1.403GlyPro: 1.403 ± 0.048
1.661GlyGln: 1.661 ± 0.055
3.389GlyArg: 3.389 ± 0.087
3.951GlySer: 3.951 ± 0.09
3.247GlyThr: 3.247 ± 0.077
4.237GlyVal: 4.237 ± 0.085
0.549GlyTrp: 0.549 ± 0.032
2.342GlyTyr: 2.342 ± 0.059
0.001GlyXaa: 0.001 ± 0.002
His
0.775HisAla: 0.775 ± 0.033
0.284HisCys: 0.284 ± 0.02
0.637HisAsp: 0.637 ± 0.034
0.787HisGlu: 0.787 ± 0.036
0.817HisPhe: 0.817 ± 0.035
1.143HisGly: 1.143 ± 0.044
0.292HisHis: 0.292 ± 0.023
1.59HisIle: 1.59 ± 0.054
1.279HisLys: 1.279 ± 0.04
1.313HisLeu: 1.313 ± 0.042
0.289HisMet: 0.289 ± 0.017
0.758HisAsn: 0.758 ± 0.036
0.791HisPro: 0.791 ± 0.032
0.304HisGln: 0.304 ± 0.024
0.621HisArg: 0.621 ± 0.03
1.15HisSer: 1.15 ± 0.04
0.779HisThr: 0.779 ± 0.038
0.707HisVal: 0.707 ± 0.032
0.084HisTrp: 0.084 ± 0.01
0.597HisTyr: 0.597 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.688IleAla: 6.688 ± 0.12
1.347IleCys: 1.347 ± 0.042
4.896IleAsp: 4.896 ± 0.077
6.495IleGlu: 6.495 ± 0.105
4.421IlePhe: 4.421 ± 0.099
5.352IleGly: 5.352 ± 0.095
1.125IleHis: 1.125 ± 0.039
8.206IleIle: 8.206 ± 0.123
8.946IleLys: 8.946 ± 0.109
7.937IleLeu: 7.937 ± 0.103
2.127IleMet: 2.127 ± 0.062
4.579IleAsn: 4.579 ± 0.098
3.342IlePro: 3.342 ± 0.07
1.742IleGln: 1.742 ± 0.053
3.419IleArg: 3.419 ± 0.07
6.821IleSer: 6.821 ± 0.109
4.306IleThr: 4.306 ± 0.082
5.8IleVal: 5.8 ± 0.1
0.543IleTrp: 0.543 ± 0.033
2.948IleTyr: 2.948 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
6.097LysAla: 6.097 ± 0.087
1.002LysCys: 1.002 ± 0.038
5.167LysAsp: 5.167 ± 0.099
7.581LysGlu: 7.581 ± 0.126
3.839LysPhe: 3.839 ± 0.078
4.584LysGly: 4.584 ± 0.093
1.365LysHis: 1.365 ± 0.052
9.908LysIle: 9.908 ± 0.131
10.449LysLys: 10.449 ± 0.129
7.356LysLeu: 7.356 ± 0.108
2.647LysMet: 2.647 ± 0.065
6.58LysAsn: 6.58 ± 0.109
2.496LysPro: 2.496 ± 0.068
2.704LysGln: 2.704 ± 0.071
3.969LysArg: 3.969 ± 0.082
5.795LysSer: 5.795 ± 0.099
5.412LysThr: 5.412 ± 0.093
5.381LysVal: 5.381 ± 0.082
0.61LysTrp: 0.61 ± 0.032
3.852LysTyr: 3.852 ± 0.083
0.001LysXaa: 0.001 ± 0.002
Leu
6.015LeuAla: 6.015 ± 0.094
1.292LeuCys: 1.292 ± 0.045
4.234LeuAsp: 4.234 ± 0.089
5.826LeuGlu: 5.826 ± 0.106
3.927LeuPhe: 3.927 ± 0.085
5.299LeuGly: 5.299 ± 0.088
1.355LeuHis: 1.355 ± 0.045
7.15LeuIle: 7.15 ± 0.116
9.65LeuLys: 9.65 ± 0.122
7.609LeuLeu: 7.609 ± 0.128
2.019LeuMet: 2.019 ± 0.057
5.032LeuAsn: 5.032 ± 0.102
3.236LeuPro: 3.236 ± 0.08
2.207LeuGln: 2.207 ± 0.058
4.03LeuArg: 4.03 ± 0.08
7.297LeuSer: 7.297 ± 0.113
4.243LeuThr: 4.243 ± 0.074
4.611LeuVal: 4.611 ± 0.088
0.677LeuTrp: 0.677 ± 0.03
2.791LeuTyr: 2.791 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 0.052
0.298MetCys: 0.298 ± 0.024
1.044MetAsp: 1.044 ± 0.043
1.536MetGlu: 1.536 ± 0.056
1.119MetPhe: 1.119 ± 0.041
1.44MetGly: 1.44 ± 0.046
0.449MetHis: 0.449 ± 0.024
1.709MetIle: 1.709 ± 0.048
2.276MetLys: 2.276 ± 0.046
2.518MetLeu: 2.518 ± 0.064
0.473MetMet: 0.473 ± 0.026
1.161MetAsn: 1.161 ± 0.042
1.235MetPro: 1.235 ± 0.045
0.902MetGln: 0.902 ± 0.036
1.095MetArg: 1.095 ± 0.038
1.819MetSer: 1.819 ± 0.054
1.075MetThr: 1.075 ± 0.041
1.347MetVal: 1.347 ± 0.046
0.123MetTrp: 0.123 ± 0.013
0.597MetTyr: 0.597 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.49AsnAla: 3.49 ± 0.066
0.715AsnCys: 0.715 ± 0.032
2.159AsnAsp: 2.159 ± 0.057
3.185AsnGlu: 3.185 ± 0.076
2.672AsnPhe: 2.672 ± 0.07
3.009AsnGly: 3.009 ± 0.074
0.625AsnHis: 0.625 ± 0.029
6.052AsnIle: 6.052 ± 0.098
4.654AsnLys: 4.654 ± 0.091
4.703AsnLeu: 4.703 ± 0.094
1.319AsnMet: 1.319 ± 0.045
2.592AsnAsn: 2.592 ± 0.071
2.273AsnPro: 2.273 ± 0.056
1.026AsnGln: 1.026 ± 0.046
2.112AsnArg: 2.112 ± 0.059
3.241AsnSer: 3.241 ± 0.073
2.297AsnThr: 2.297 ± 0.054
3.342AsnVal: 3.342 ± 0.071
0.389AsnTrp: 0.389 ± 0.025
1.947AsnTyr: 1.947 ± 0.058
0.001AsnXaa: 0.001 ± 0.002
Pro
2.173ProAla: 2.173 ± 0.059
0.374ProCys: 0.374 ± 0.024
1.932ProAsp: 1.932 ± 0.048
2.826ProGlu: 2.826 ± 0.064
1.516ProPhe: 1.516 ± 0.048
2.237ProGly: 2.237 ± 0.063
0.634ProHis: 0.634 ± 0.036
1.875ProIle: 1.875 ± 0.052
2.409ProLys: 2.409 ± 0.054
2.779ProLeu: 2.779 ± 0.073
0.573ProMet: 0.573 ± 0.03
1.159ProAsn: 1.159 ± 0.04
0.921ProPro: 0.921 ± 0.043
1.279ProGln: 1.279 ± 0.043
1.017ProArg: 1.017 ± 0.041
2.008ProSer: 2.008 ± 0.056
1.303ProThr: 1.303 ± 0.045
3.013ProVal: 3.013 ± 0.074
0.199ProTrp: 0.199 ± 0.018
1.158ProTyr: 1.158 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
1.694GlnAla: 1.694 ± 0.049
0.277GlnCys: 0.277 ± 0.023
1.059GlnAsp: 1.059 ± 0.039
1.745GlnGlu: 1.745 ± 0.055
1.005GlnPhe: 1.005 ± 0.039
1.246GlnGly: 1.246 ± 0.045
0.401GlnHis: 0.401 ± 0.023
2.412GlnIle: 2.412 ± 0.065
2.738GlnLys: 2.738 ± 0.07
1.842GlnLeu: 1.842 ± 0.056
0.692GlnMet: 0.692 ± 0.039
1.573GlnAsn: 1.573 ± 0.046
0.642GlnPro: 0.642 ± 0.034
0.837GlnGln: 0.837 ± 0.044
1.159GlnArg: 1.159 ± 0.045
1.751GlnSer: 1.751 ± 0.06
1.389GlnThr: 1.389 ± 0.051
1.279GlnVal: 1.279 ± 0.041
0.2GlnTrp: 0.2 ± 0.018
0.936GlnTyr: 0.936 ± 0.042
0.001GlnXaa: 0.001 ± 0.002
Arg
2.339ArgAla: 2.339 ± 0.063
0.618ArgCys: 0.618 ± 0.03
2.02ArgAsp: 2.02 ± 0.057
2.96ArgGlu: 2.96 ± 0.079
1.92ArgPhe: 1.92 ± 0.056
2.441ArgGly: 2.441 ± 0.064
0.829ArgHis: 0.829 ± 0.038
3.899ArgIle: 3.899 ± 0.07
4.231ArgLys: 4.231 ± 0.076
3.731ArgLeu: 3.731 ± 0.072
1.051ArgMet: 1.051 ± 0.039
2.45ArgAsn: 2.45 ± 0.055
1.283ArgPro: 1.283 ± 0.048
1.401ArgGln: 1.401 ± 0.047
1.964ArgArg: 1.964 ± 0.055
2.435ArgSer: 2.435 ± 0.067
2.17ArgThr: 2.17 ± 0.053
2.351ArgVal: 2.351 ± 0.065
0.317ArgTrp: 0.317 ± 0.022
1.51ArgTyr: 1.51 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
5.15SerAla: 5.15 ± 0.08
0.918SerCys: 0.918 ± 0.043
3.698SerAsp: 3.698 ± 0.081
4.506SerGlu: 4.506 ± 0.097
3.648SerPhe: 3.648 ± 0.087
5.568SerGly: 5.568 ± 0.102
0.917SerHis: 0.917 ± 0.031
5.035SerIle: 5.035 ± 0.107
5.919SerLys: 5.919 ± 0.109
6.17SerLeu: 6.17 ± 0.111
1.37SerMet: 1.37 ± 0.048
2.674SerAsn: 2.674 ± 0.072
1.953SerPro: 1.953 ± 0.06
1.792SerGln: 1.792 ± 0.055
2.792SerArg: 2.792 ± 0.064
4.594SerSer: 4.594 ± 0.09
2.255SerThr: 2.255 ± 0.058
4.989SerVal: 4.989 ± 0.093
0.429SerTrp: 0.429 ± 0.024
2.276SerTyr: 2.276 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.31ThrAla: 4.31 ± 0.075
0.458ThrCys: 0.458 ± 0.028
2.672ThrAsp: 2.672 ± 0.058
2.835ThrGlu: 2.835 ± 0.063
1.983ThrPhe: 1.983 ± 0.053
4.056ThrGly: 4.056 ± 0.076
0.742ThrHis: 0.742 ± 0.034
3.438ThrIle: 3.438 ± 0.077
3.709ThrLys: 3.709 ± 0.076
4.149ThrLeu: 4.149 ± 0.079
1.039ThrMet: 1.039 ± 0.039
2.031ThrAsn: 2.031 ± 0.055
1.697ThrPro: 1.697 ± 0.054
1.092ThrGln: 1.092 ± 0.039
1.554ThrArg: 1.554 ± 0.046
2.753ThrSer: 2.753 ± 0.064
2.089ThrThr: 2.089 ± 0.063
4.28ThrVal: 4.28 ± 0.084
0.221ThrTrp: 0.221 ± 0.018
1.313ThrTyr: 1.313 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
4.282ValAla: 4.282 ± 0.062
0.965ValCys: 0.965 ± 0.038
3.118ValAsp: 3.118 ± 0.074
4.436ValGlu: 4.436 ± 0.091
3.477ValPhe: 3.477 ± 0.084
3.703ValGly: 3.703 ± 0.074
0.983ValHis: 0.983 ± 0.036
5.529ValIle: 5.529 ± 0.108
6.573ValLys: 6.573 ± 0.106
6.484ValLeu: 6.484 ± 0.103
1.642ValMet: 1.642 ± 0.054
3.124ValAsn: 3.124 ± 0.067
2.382ValPro: 2.382 ± 0.061
1.401ValGln: 1.401 ± 0.053
2.641ValArg: 2.641 ± 0.068
5.173ValSer: 5.173 ± 0.1
2.725ValThr: 2.725 ± 0.071
4.331ValVal: 4.331 ± 0.091
0.488ValTrp: 0.488 ± 0.026
2.297ValTyr: 2.297 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.36TrpAla: 0.36 ± 0.023
0.111TrpCys: 0.111 ± 0.016
0.356TrpAsp: 0.356 ± 0.025
0.502TrpGlu: 0.502 ± 0.028
0.393TrpPhe: 0.393 ± 0.028
0.384TrpGly: 0.384 ± 0.023
0.17TrpHis: 0.17 ± 0.016
0.664TrpIle: 0.664 ± 0.036
0.594TrpLys: 0.594 ± 0.031
0.577TrpLeu: 0.577 ± 0.035
0.206TrpMet: 0.206 ± 0.017
0.446TrpAsn: 0.446 ± 0.027
0.141TrpPro: 0.141 ± 0.014
0.254TrpGln: 0.254 ± 0.019
0.353TrpArg: 0.353 ± 0.023
0.351TrpSer: 0.351 ± 0.021
0.344TrpThr: 0.344 ± 0.023
0.416TrpVal: 0.416 ± 0.026
0.07TrpTrp: 0.07 ± 0.011
0.187TrpTyr: 0.187 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 0.061
0.511TyrCys: 0.511 ± 0.029
1.859TyrAsp: 1.859 ± 0.051
2.249TyrGlu: 2.249 ± 0.058
1.86TyrPhe: 1.86 ± 0.057
2.379TyrGly: 2.379 ± 0.067
0.461TyrHis: 0.461 ± 0.024
3.154TyrIle: 3.154 ± 0.069
3.23TyrLys: 3.23 ± 0.064
3.184TyrLeu: 3.184 ± 0.069
0.743TyrMet: 0.743 ± 0.033
1.866TyrAsn: 1.866 ± 0.057
1.309TyrPro: 1.309 ± 0.043
0.751TyrGln: 0.751 ± 0.037
1.63TyrArg: 1.63 ± 0.045
2.444TyrSer: 2.444 ± 0.063
1.418TyrThr: 1.418 ± 0.043
1.988TyrVal: 1.988 ± 0.051
0.301TyrTrp: 0.301 ± 0.024
1.418TyrTyr: 1.418 ± 0.053
0.003TyrXaa: 0.003 ± 0.003
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.004XaaLys: 0.004 ± 0.005
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.002
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.002
0.001XaaXaa: 0.001 ± 0.002
Statistics based on 2732 proteins (668676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski