Amino acid dipepetide frequency for Bathymodiolus platifrons methanotrophic gill symbiont

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.42AlaAla: 6.42 ± 0.108
0.954AlaCys: 0.954 ± 0.034
4.375AlaAsp: 4.375 ± 0.077
5.507AlaGlu: 5.507 ± 0.088
2.991AlaPhe: 2.991 ± 0.062
5.662AlaGly: 5.662 ± 0.092
1.695AlaHis: 1.695 ± 0.041
5.976AlaIle: 5.976 ± 0.081
4.914AlaLys: 4.914 ± 0.072
8.458AlaLeu: 8.458 ± 0.117
2.178AlaMet: 2.178 ± 0.052
3.303AlaAsn: 3.303 ± 0.062
2.435AlaPro: 2.435 ± 0.06
3.386AlaGln: 3.386 ± 0.061
3.498AlaArg: 3.498 ± 0.062
4.589AlaSer: 4.589 ± 0.073
4.009AlaThr: 4.009 ± 0.073
5.226AlaVal: 5.226 ± 0.081
0.941AlaTrp: 0.941 ± 0.03
2.345AlaTyr: 2.345 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.027
0.172CysCys: 0.172 ± 0.014
0.644CysAsp: 0.644 ± 0.027
0.604CysGlu: 0.604 ± 0.026
0.508CysPhe: 0.508 ± 0.022
0.777CysGly: 0.777 ± 0.032
0.346CysHis: 0.346 ± 0.022
0.71CysIle: 0.71 ± 0.027
0.547CysLys: 0.547 ± 0.022
1.058CysLeu: 1.058 ± 0.037
0.227CysMet: 0.227 ± 0.017
0.368CysAsn: 0.368 ± 0.021
0.52CysPro: 0.52 ± 0.025
0.448CysGln: 0.448 ± 0.024
0.44CysArg: 0.44 ± 0.022
0.72CysSer: 0.72 ± 0.027
0.534CysThr: 0.534 ± 0.027
0.664CysVal: 0.664 ± 0.025
0.111CysTrp: 0.111 ± 0.014
0.367CysTyr: 0.367 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
4.181AspAla: 4.181 ± 0.076
0.596AspCys: 0.596 ± 0.024
3.029AspAsp: 3.029 ± 0.061
3.847AspGlu: 3.847 ± 0.075
2.888AspPhe: 2.888 ± 0.063
3.486AspGly: 3.486 ± 0.076
1.05AspHis: 1.05 ± 0.035
4.564AspIle: 4.564 ± 0.068
3.962AspLys: 3.962 ± 0.073
5.594AspLeu: 5.594 ± 0.071
1.27AspMet: 1.27 ± 0.041
2.63AspAsn: 2.63 ± 0.058
2.025AspPro: 2.025 ± 0.047
1.903AspGln: 1.903 ± 0.048
2.245AspArg: 2.245 ± 0.049
3.481AspSer: 3.481 ± 0.069
2.833AspThr: 2.833 ± 0.057
3.476AspVal: 3.476 ± 0.066
0.898AspTrp: 0.898 ± 0.029
2.211AspTyr: 2.211 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
4.623GluAla: 4.623 ± 0.08
0.549GluCys: 0.549 ± 0.026
3.093GluAsp: 3.093 ± 0.064
4.12GluGlu: 4.12 ± 0.084
2.529GluPhe: 2.529 ± 0.051
3.561GluGly: 3.561 ± 0.066
1.641GluHis: 1.641 ± 0.049
4.91GluIle: 4.91 ± 0.07
4.619GluLys: 4.619 ± 0.087
6.924GluLeu: 6.924 ± 0.096
1.643GluMet: 1.643 ± 0.041
3.114GluAsn: 3.114 ± 0.063
1.855GluPro: 1.855 ± 0.045
4.036GluGln: 4.036 ± 0.08
3.086GluArg: 3.086 ± 0.065
3.783GluSer: 3.783 ± 0.07
3.4GluThr: 3.4 ± 0.06
4.136GluVal: 4.136 ± 0.083
0.707GluTrp: 0.707 ± 0.027
2.049GluTyr: 2.049 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
2.997PheAla: 2.997 ± 0.066
0.55PheCys: 0.55 ± 0.024
2.621PheAsp: 2.621 ± 0.05
2.308PheGlu: 2.308 ± 0.057
2.03PhePhe: 2.03 ± 0.059
2.587PheGly: 2.587 ± 0.069
0.84PheHis: 0.84 ± 0.03
3.271PheIle: 3.271 ± 0.068
2.609PheLys: 2.609 ± 0.059
3.781PheLeu: 3.781 ± 0.08
1.062PheMet: 1.062 ± 0.041
2.229PheAsn: 2.229 ± 0.047
1.667PhePro: 1.667 ± 0.041
1.437PheGln: 1.437 ± 0.044
1.629PheArg: 1.629 ± 0.045
3.583PheSer: 3.583 ± 0.062
2.274PheThr: 2.274 ± 0.05
2.505PheVal: 2.505 ± 0.053
0.546PheTrp: 0.546 ± 0.026
1.473PheTyr: 1.473 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
4.623GlyAla: 4.623 ± 0.086
0.883GlyCys: 0.883 ± 0.033
3.545GlyAsp: 3.545 ± 0.078
3.981GlyGlu: 3.981 ± 0.066
3.098GlyPhe: 3.098 ± 0.059
4.626GlyGly: 4.626 ± 0.099
1.47GlyHis: 1.47 ± 0.042
4.781GlyIle: 4.781 ± 0.082
4.535GlyLys: 4.535 ± 0.078
6.58GlyLeu: 6.58 ± 0.091
1.847GlyMet: 1.847 ± 0.053
2.743GlyAsn: 2.743 ± 0.064
1.387GlyPro: 1.387 ± 0.043
2.388GlyGln: 2.388 ± 0.053
3.166GlyArg: 3.166 ± 0.07
4.025GlySer: 4.025 ± 0.072
3.235GlyThr: 3.235 ± 0.071
4.6GlyVal: 4.6 ± 0.081
0.893GlyTrp: 0.893 ± 0.034
2.378GlyTyr: 2.378 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.681HisAla: 1.681 ± 0.043
0.335HisCys: 0.335 ± 0.022
1.23HisAsp: 1.23 ± 0.036
1.177HisGlu: 1.177 ± 0.038
1.093HisPhe: 1.093 ± 0.038
1.538HisGly: 1.538 ± 0.041
0.567HisHis: 0.567 ± 0.025
1.571HisIle: 1.571 ± 0.04
1.37HisLys: 1.37 ± 0.042
2.279HisLeu: 2.279 ± 0.052
0.466HisMet: 0.466 ± 0.021
0.977HisAsn: 0.977 ± 0.035
1.117HisPro: 1.117 ± 0.037
0.999HisGln: 0.999 ± 0.032
1.068HisArg: 1.068 ± 0.035
1.431HisSer: 1.431 ± 0.039
1.097HisThr: 1.097 ± 0.036
1.279HisVal: 1.279 ± 0.039
0.354HisTrp: 0.354 ± 0.019
0.959HisTyr: 0.959 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.321IleAla: 6.321 ± 0.085
0.673IleCys: 0.673 ± 0.026
4.752IleAsp: 4.752 ± 0.073
5.18IleGlu: 5.18 ± 0.084
2.731IlePhe: 2.731 ± 0.064
4.39IleGly: 4.39 ± 0.076
1.475IleHis: 1.475 ± 0.041
5.065IleIle: 5.065 ± 0.081
4.839IleLys: 4.839 ± 0.083
6.254IleLeu: 6.254 ± 0.086
1.428IleMet: 1.428 ± 0.041
3.815IleAsn: 3.815 ± 0.078
3.131IlePro: 3.131 ± 0.057
2.707IleGln: 2.707 ± 0.058
3.095IleArg: 3.095 ± 0.053
5.284IleSer: 5.284 ± 0.075
3.925IleThr: 3.925 ± 0.067
4.256IleVal: 4.256 ± 0.062
0.648IleTrp: 0.648 ± 0.029
2.236IleTyr: 2.236 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
4.931LysAla: 4.931 ± 0.084
0.452LysCys: 0.452 ± 0.025
3.402LysAsp: 3.402 ± 0.065
4.133LysGlu: 4.133 ± 0.074
1.919LysPhe: 1.919 ± 0.049
3.908LysGly: 3.908 ± 0.079
1.557LysHis: 1.557 ± 0.037
4.654LysIle: 4.654 ± 0.075
4.755LysLys: 4.755 ± 0.102
5.939LysLeu: 5.939 ± 0.089
1.557LysMet: 1.557 ± 0.045
3.435LysAsn: 3.435 ± 0.067
2.494LysPro: 2.494 ± 0.058
3.633LysGln: 3.633 ± 0.065
3.016LysArg: 3.016 ± 0.055
4.033LysSer: 4.033 ± 0.072
3.782LysThr: 3.782 ± 0.07
3.936LysVal: 3.936 ± 0.062
0.576LysTrp: 0.576 ± 0.028
2.003LysTyr: 2.003 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
8.799LeuAla: 8.799 ± 0.108
1.107LeuCys: 1.107 ± 0.041
5.657LeuAsp: 5.657 ± 0.082
6.215LeuGlu: 6.215 ± 0.105
4.253LeuPhe: 4.253 ± 0.075
6.375LeuGly: 6.375 ± 0.098
2.133LeuHis: 2.133 ± 0.05
6.88LeuIle: 6.88 ± 0.086
6.454LeuLys: 6.454 ± 0.085
10.377LeuLeu: 10.377 ± 0.154
2.446LeuMet: 2.446 ± 0.058
4.877LeuAsn: 4.877 ± 0.082
4.454LeuPro: 4.454 ± 0.081
4.194LeuGln: 4.194 ± 0.079
4.303LeuArg: 4.303 ± 0.064
7.478LeuSer: 7.478 ± 0.109
5.438LeuThr: 5.438 ± 0.082
6.011LeuVal: 6.011 ± 0.084
1.07LeuTrp: 1.07 ± 0.037
2.815LeuTyr: 2.815 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.199MetAla: 2.199 ± 0.053
0.196MetCys: 0.196 ± 0.013
1.216MetAsp: 1.216 ± 0.036
1.323MetGlu: 1.323 ± 0.038
0.755MetPhe: 0.755 ± 0.028
1.586MetGly: 1.586 ± 0.051
0.544MetHis: 0.544 ± 0.025
1.582MetIle: 1.582 ± 0.037
1.435MetLys: 1.435 ± 0.046
2.507MetLeu: 2.507 ± 0.053
0.659MetMet: 0.659 ± 0.029
1.106MetAsn: 1.106 ± 0.034
1.173MetPro: 1.173 ± 0.037
1.337MetGln: 1.337 ± 0.043
1.156MetArg: 1.156 ± 0.04
1.661MetSer: 1.661 ± 0.045
1.445MetThr: 1.445 ± 0.036
1.42MetVal: 1.42 ± 0.041
0.174MetTrp: 0.174 ± 0.012
0.55MetTyr: 0.55 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.378AsnAla: 3.378 ± 0.065
0.459AsnCys: 0.459 ± 0.021
2.401AsnAsp: 2.401 ± 0.056
2.841AsnGlu: 2.841 ± 0.058
1.817AsnPhe: 1.817 ± 0.047
2.895AsnGly: 2.895 ± 0.062
1.045AsnHis: 1.045 ± 0.04
3.708AsnIle: 3.708 ± 0.066
3.293AsnLys: 3.293 ± 0.062
4.136AsnLeu: 4.136 ± 0.071
1.037AsnMet: 1.037 ± 0.03
2.512AsnAsn: 2.512 ± 0.061
2.246AsnPro: 2.246 ± 0.047
2.022AsnGln: 2.022 ± 0.055
1.992AsnArg: 1.992 ± 0.059
3.135AsnSer: 3.135 ± 0.065
2.626AsnThr: 2.626 ± 0.068
2.475AsnVal: 2.475 ± 0.052
0.569AsnTrp: 0.569 ± 0.024
1.597AsnTyr: 1.597 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
2.992ProAla: 2.992 ± 0.055
0.358ProCys: 0.358 ± 0.019
2.606ProAsp: 2.606 ± 0.055
3.482ProGlu: 3.482 ± 0.072
1.624ProPhe: 1.624 ± 0.038
2.392ProGly: 2.392 ± 0.056
0.783ProHis: 0.783 ± 0.029
2.565ProIle: 2.565 ± 0.054
2.264ProLys: 2.264 ± 0.06
3.65ProLeu: 3.65 ± 0.066
0.818ProMet: 0.818 ± 0.029
1.475ProAsn: 1.475 ± 0.043
1.242ProPro: 1.242 ± 0.041
1.279ProGln: 1.279 ± 0.04
1.376ProArg: 1.376 ± 0.043
2.236ProSer: 2.236 ± 0.056
1.78ProThr: 1.78 ± 0.046
3.031ProVal: 3.031 ± 0.053
0.43ProTrp: 0.43 ± 0.026
1.271ProTyr: 1.271 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.14GlnAla: 4.14 ± 0.078
0.433GlnCys: 0.433 ± 0.021
2.236GlnAsp: 2.236 ± 0.047
2.745GlnGlu: 2.745 ± 0.069
1.661GlnPhe: 1.661 ± 0.046
2.799GlnGly: 2.799 ± 0.059
1.215GlnHis: 1.215 ± 0.036
2.78GlnIle: 2.78 ± 0.058
2.605GlnLys: 2.605 ± 0.061
4.801GlnLeu: 4.801 ± 0.095
1.056GlnMet: 1.056 ± 0.035
1.703GlnAsn: 1.703 ± 0.043
1.424GlnPro: 1.424 ± 0.036
2.948GlnGln: 2.948 ± 0.079
2.21GlnArg: 2.21 ± 0.049
2.496GlnSer: 2.496 ± 0.054
2.247GlnThr: 2.247 ± 0.054
2.868GlnVal: 2.868 ± 0.065
0.646GlnTrp: 0.646 ± 0.027
1.408GlnTyr: 1.408 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
3.157ArgAla: 3.157 ± 0.069
0.492ArgCys: 0.492 ± 0.023
2.306ArgAsp: 2.306 ± 0.047
2.885ArgGlu: 2.885 ± 0.056
2.096ArgPhe: 2.096 ± 0.047
2.581ArgGly: 2.581 ± 0.055
1.124ArgHis: 1.124 ± 0.04
3.213ArgIle: 3.213 ± 0.055
2.973ArgLys: 2.973 ± 0.058
4.78ArgLeu: 4.78 ± 0.078
1.102ArgMet: 1.102 ± 0.039
2.095ArgAsn: 2.095 ± 0.048
1.491ArgPro: 1.491 ± 0.043
2.087ArgGln: 2.087 ± 0.059
2.151ArgArg: 2.151 ± 0.051
2.542ArgSer: 2.542 ± 0.058
2.04ArgThr: 2.04 ± 0.048
2.912ArgVal: 2.912 ± 0.057
0.678ArgTrp: 0.678 ± 0.029
1.82ArgTyr: 1.82 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
5.118SerAla: 5.118 ± 0.073
0.707SerCys: 0.707 ± 0.031
3.721SerAsp: 3.721 ± 0.068
4.022SerGlu: 4.022 ± 0.07
2.986SerPhe: 2.986 ± 0.06
5.028SerGly: 5.028 ± 0.079
1.432SerHis: 1.432 ± 0.04
4.735SerIle: 4.735 ± 0.071
3.611SerLys: 3.611 ± 0.075
6.949SerLeu: 6.949 ± 0.089
1.616SerMet: 1.616 ± 0.043
2.881SerAsn: 2.881 ± 0.061
2.463SerPro: 2.463 ± 0.05
2.466SerGln: 2.466 ± 0.054
2.875SerArg: 2.875 ± 0.052
4.607SerSer: 4.607 ± 0.078
3.26SerThr: 3.26 ± 0.059
4.194SerVal: 4.194 ± 0.068
0.835SerTrp: 0.835 ± 0.03
2.168SerTyr: 2.168 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
4.231ThrAla: 4.231 ± 0.071
0.419ThrCys: 0.419 ± 0.022
3.03ThrAsp: 3.03 ± 0.06
3.595ThrGlu: 3.595 ± 0.065
2.024ThrPhe: 2.024 ± 0.046
4.069ThrGly: 4.069 ± 0.063
1.252ThrHis: 1.252 ± 0.038
3.566ThrIle: 3.566 ± 0.062
2.878ThrLys: 2.878 ± 0.052
5.861ThrLeu: 5.861 ± 0.088
1.014ThrMet: 1.014 ± 0.033
2.128ThrAsn: 2.128 ± 0.055
2.437ThrPro: 2.437 ± 0.057
2.257ThrGln: 2.257 ± 0.053
2.236ThrArg: 2.236 ± 0.048
3.1ThrSer: 3.1 ± 0.067
2.775ThrThr: 2.775 ± 0.06
3.581ThrVal: 3.581 ± 0.062
0.565ThrTrp: 0.565 ± 0.024
1.515ThrTyr: 1.515 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
5.059ValAla: 5.059 ± 0.089
0.732ValCys: 0.732 ± 0.029
3.8ValAsp: 3.8 ± 0.07
4.011ValGlu: 4.011 ± 0.076
2.85ValPhe: 2.85 ± 0.057
3.791ValGly: 3.791 ± 0.07
1.261ValHis: 1.261 ± 0.038
4.857ValIle: 4.857 ± 0.078
3.867ValLys: 3.867 ± 0.065
6.393ValLeu: 6.393 ± 0.102
1.631ValMet: 1.631 ± 0.043
2.917ValAsn: 2.917 ± 0.059
2.277ValPro: 2.277 ± 0.046
2.307ValGln: 2.307 ± 0.055
2.66ValArg: 2.66 ± 0.063
4.558ValSer: 4.558 ± 0.073
3.586ValThr: 3.586 ± 0.071
4.2ValVal: 4.2 ± 0.082
0.73ValTrp: 0.73 ± 0.031
1.88ValTyr: 1.88 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.031
0.128TrpCys: 0.128 ± 0.011
0.651TrpAsp: 0.651 ± 0.03
0.658TrpGlu: 0.658 ± 0.027
0.516TrpPhe: 0.516 ± 0.025
0.762TrpGly: 0.762 ± 0.031
0.341TrpHis: 0.341 ± 0.02
0.74TrpIle: 0.74 ± 0.028
0.603TrpLys: 0.603 ± 0.031
1.51TrpLeu: 1.51 ± 0.045
0.317TrpMet: 0.317 ± 0.018
0.528TrpAsn: 0.528 ± 0.025
0.366TrpPro: 0.366 ± 0.018
0.766TrpGln: 0.766 ± 0.032
0.629TrpArg: 0.629 ± 0.025
0.776TrpSer: 0.776 ± 0.032
0.508TrpThr: 0.508 ± 0.028
0.867TrpVal: 0.867 ± 0.033
0.184TrpTrp: 0.184 ± 0.017
0.405TrpTyr: 0.405 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.411TyrAla: 2.411 ± 0.053
0.383TyrCys: 0.383 ± 0.018
1.855TyrAsp: 1.855 ± 0.062
1.744TyrGlu: 1.744 ± 0.05
1.633TyrPhe: 1.633 ± 0.043
1.97TyrGly: 1.97 ± 0.043
0.82TyrHis: 0.82 ± 0.027
2.076TyrIle: 2.076 ± 0.046
1.98TyrLys: 1.98 ± 0.055
3.506TyrLeu: 3.506 ± 0.062
0.65TyrMet: 0.65 ± 0.025
1.348TyrAsn: 1.348 ± 0.039
1.448TyrPro: 1.448 ± 0.046
1.819TyrGln: 1.819 ± 0.05
1.695TyrArg: 1.695 ± 0.048
2.155TyrSer: 2.155 ± 0.056
1.739TyrThr: 1.739 ± 0.054
1.729TyrVal: 1.729 ± 0.043
0.49TyrTrp: 0.49 ± 0.023
1.094TyrTyr: 1.094 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3127 proteins (912154 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski