Amino acid dipepetide frequency for Solibacter usitatus (strain Ellin6076)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.843AlaAla: 15.843 ± 0.107
1.084AlaCys: 1.084 ± 0.022
5.252AlaAsp: 5.252 ± 0.048
5.938AlaGlu: 5.938 ± 0.061
4.071AlaPhe: 4.071 ± 0.04
10.829AlaGly: 10.829 ± 0.077
2.031AlaHis: 2.031 ± 0.028
5.841AlaIle: 5.841 ± 0.048
3.656AlaLys: 3.656 ± 0.043
11.189AlaLeu: 11.189 ± 0.077
2.697AlaMet: 2.697 ± 0.03
3.514AlaAsn: 3.514 ± 0.042
5.804AlaPro: 5.804 ± 0.06
4.101AlaGln: 4.101 ± 0.039
7.364AlaArg: 7.364 ± 0.065
6.348AlaSer: 6.348 ± 0.061
6.019AlaThr: 6.019 ± 0.071
8.206AlaVal: 8.206 ± 0.059
1.531AlaTrp: 1.531 ± 0.028
2.599AlaTyr: 2.599 ± 0.03
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.019
0.162CysCys: 0.162 ± 0.008
0.461CysAsp: 0.461 ± 0.016
0.446CysGlu: 0.446 ± 0.013
0.335CysPhe: 0.335 ± 0.011
0.956CysGly: 0.956 ± 0.022
0.319CysHis: 0.319 ± 0.019
0.37CysIle: 0.37 ± 0.012
0.223CysLys: 0.223 ± 0.008
0.86CysLeu: 0.86 ± 0.019
0.192CysMet: 0.192 ± 0.008
0.236CysAsn: 0.236 ± 0.009
0.459CysPro: 0.459 ± 0.014
0.226CysGln: 0.226 ± 0.009
0.607CysArg: 0.607 ± 0.016
0.552CysSer: 0.552 ± 0.015
0.486CysThr: 0.486 ± 0.014
0.637CysVal: 0.637 ± 0.014
0.133CysTrp: 0.133 ± 0.007
0.264CysTyr: 0.264 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
5.417AspAla: 5.417 ± 0.041
0.408AspCys: 0.408 ± 0.013
2.228AspAsp: 2.228 ± 0.031
2.552AspGlu: 2.552 ± 0.032
2.21AspPhe: 2.21 ± 0.028
4.571AspGly: 4.571 ± 0.05
1.068AspHis: 1.068 ± 0.018
2.25AspIle: 2.25 ± 0.027
1.454AspLys: 1.454 ± 0.022
5.186AspLeu: 5.186 ± 0.048
0.928AspMet: 0.928 ± 0.018
1.392AspAsn: 1.392 ± 0.024
3.693AspPro: 3.693 ± 0.038
1.691AspGln: 1.691 ± 0.028
3.721AspArg: 3.721 ± 0.038
2.759AspSer: 2.759 ± 0.031
2.407AspThr: 2.407 ± 0.032
3.486AspVal: 3.486 ± 0.031
0.85AspTrp: 0.85 ± 0.021
1.579AspTyr: 1.579 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
5.403GluAla: 5.403 ± 0.06
0.426GluCys: 0.426 ± 0.013
2.231GluAsp: 2.231 ± 0.032
2.835GluGlu: 2.835 ± 0.042
2.256GluPhe: 2.256 ± 0.03
3.658GluGly: 3.658 ± 0.04
1.166GluHis: 1.166 ± 0.021
3.214GluIle: 3.214 ± 0.038
2.289GluLys: 2.289 ± 0.032
5.315GluLeu: 5.315 ± 0.051
1.425GluMet: 1.425 ± 0.021
1.611GluAsn: 1.611 ± 0.023
2.46GluPro: 2.46 ± 0.035
2.127GluGln: 2.127 ± 0.028
4.081GluArg: 4.081 ± 0.045
2.949GluSer: 2.949 ± 0.033
2.792GluThr: 2.792 ± 0.031
3.667GluVal: 3.667 ± 0.042
0.828GluTrp: 0.828 ± 0.017
1.436GluTyr: 1.436 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.392PheAla: 4.392 ± 0.046
0.396PheCys: 0.396 ± 0.013
2.453PheAsp: 2.453 ± 0.031
2.128PheGlu: 2.128 ± 0.031
1.686PhePhe: 1.686 ± 0.029
3.771PheGly: 3.771 ± 0.042
0.96PheHis: 0.96 ± 0.02
1.352PheIle: 1.352 ± 0.021
1.001PheLys: 1.001 ± 0.019
3.756PheLeu: 3.756 ± 0.043
0.638PheMet: 0.638 ± 0.014
1.462PheAsn: 1.462 ± 0.029
1.984PhePro: 1.984 ± 0.028
1.5PheGln: 1.5 ± 0.025
2.651PheArg: 2.651 ± 0.034
2.699PheSer: 2.699 ± 0.036
2.669PheThr: 2.669 ± 0.039
2.678PheVal: 2.678 ± 0.031
0.574PheTrp: 0.574 ± 0.014
1.202PheTyr: 1.202 ± 0.021
0.0PheXaa: 0.0 ± 0.0
Gly
8.859GlyAla: 8.859 ± 0.063
0.871GlyCys: 0.871 ± 0.018
4.113GlyAsp: 4.113 ± 0.043
4.16GlyGlu: 4.16 ± 0.044
3.588GlyPhe: 3.588 ± 0.036
7.808GlyGly: 7.808 ± 0.083
1.567GlyHis: 1.567 ± 0.025
4.542GlyIle: 4.542 ± 0.038
3.823GlyLys: 3.823 ± 0.045
7.663GlyLeu: 7.663 ± 0.066
2.142GlyMet: 2.142 ± 0.03
3.047GlyAsn: 3.047 ± 0.044
3.657GlyPro: 3.657 ± 0.04
2.951GlyGln: 2.951 ± 0.041
5.656GlyArg: 5.656 ± 0.046
5.45GlySer: 5.45 ± 0.062
5.419GlyThr: 5.419 ± 0.077
6.351GlyVal: 6.351 ± 0.056
1.403GlyTrp: 1.403 ± 0.025
2.479GlyTyr: 2.479 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
2.027HisAla: 2.027 ± 0.024
0.219HisCys: 0.219 ± 0.009
1.032HisAsp: 1.032 ± 0.021
1.047HisGlu: 1.047 ± 0.02
0.913HisPhe: 0.913 ± 0.019
1.816HisGly: 1.816 ± 0.027
0.504HisHis: 0.504 ± 0.017
0.919HisIle: 0.919 ± 0.02
0.508HisLys: 0.508 ± 0.014
1.978HisLeu: 1.978 ± 0.028
0.433HisMet: 0.433 ± 0.01
0.627HisAsn: 0.627 ± 0.014
1.324HisPro: 1.324 ± 0.023
0.653HisGln: 0.653 ± 0.015
1.38HisArg: 1.38 ± 0.023
1.18HisSer: 1.18 ± 0.02
1.043HisThr: 1.043 ± 0.021
1.4HisVal: 1.4 ± 0.025
0.353HisTrp: 0.353 ± 0.012
0.64HisTyr: 0.64 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
6.088IleAla: 6.088 ± 0.052
0.485IleCys: 0.485 ± 0.011
2.88IleAsp: 2.88 ± 0.034
2.762IleGlu: 2.762 ± 0.036
1.8IlePhe: 1.8 ± 0.022
4.151IleGly: 4.151 ± 0.038
1.003IleHis: 1.003 ± 0.021
1.591IleIle: 1.591 ± 0.028
1.313IleLys: 1.313 ± 0.02
4.387IleLeu: 4.387 ± 0.051
0.697IleMet: 0.697 ± 0.018
1.505IleAsn: 1.505 ± 0.029
2.815IlePro: 2.815 ± 0.034
1.661IleGln: 1.661 ± 0.027
3.233IleArg: 3.233 ± 0.035
3.022IleSer: 3.022 ± 0.039
2.976IleThr: 2.976 ± 0.041
3.609IleVal: 3.609 ± 0.036
0.603IleTrp: 0.603 ± 0.014
1.334IleTyr: 1.334 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
3.508LysAla: 3.508 ± 0.041
0.214LysCys: 0.214 ± 0.01
1.774LysAsp: 1.774 ± 0.032
1.76LysGlu: 1.76 ± 0.03
1.249LysPhe: 1.249 ± 0.023
2.485LysGly: 2.485 ± 0.036
0.622LysHis: 0.622 ± 0.015
1.783LysIle: 1.783 ± 0.025
1.49LysLys: 1.49 ± 0.031
3.661LysLeu: 3.661 ± 0.037
1.022LysMet: 1.022 ± 0.022
1.147LysAsn: 1.147 ± 0.022
2.175LysPro: 2.175 ± 0.035
1.296LysGln: 1.296 ± 0.023
2.196LysArg: 2.196 ± 0.027
2.024LysSer: 2.024 ± 0.028
2.067LysThr: 2.067 ± 0.025
2.63LysVal: 2.63 ± 0.031
0.489LysTrp: 0.489 ± 0.013
0.921LysTyr: 0.921 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
12.248LeuAla: 12.248 ± 0.079
0.914LeuCys: 0.914 ± 0.019
5.149LeuAsp: 5.149 ± 0.038
5.227LeuGlu: 5.227 ± 0.051
3.624LeuPhe: 3.624 ± 0.04
7.664LeuGly: 7.664 ± 0.058
1.967LeuHis: 1.967 ± 0.03
4.145LeuIle: 4.145 ± 0.042
3.611LeuLys: 3.611 ± 0.041
9.966LeuLeu: 9.966 ± 0.092
2.019LeuMet: 2.019 ± 0.028
3.168LeuAsn: 3.168 ± 0.038
5.653LeuPro: 5.653 ± 0.049
3.236LeuGln: 3.236 ± 0.034
7.083LeuArg: 7.083 ± 0.073
5.916LeuSer: 5.916 ± 0.052
6.018LeuThr: 6.018 ± 0.068
6.684LeuVal: 6.684 ± 0.061
1.327LeuTrp: 1.327 ± 0.021
2.51LeuTyr: 2.51 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.572MetAla: 2.572 ± 0.035
0.152MetCys: 0.152 ± 0.007
0.99MetAsp: 0.99 ± 0.018
1.155MetGlu: 1.155 ± 0.021
0.698MetPhe: 0.698 ± 0.015
1.616MetGly: 1.616 ± 0.027
0.423MetHis: 0.423 ± 0.013
1.094MetIle: 1.094 ± 0.021
1.094MetLys: 1.094 ± 0.019
2.259MetLeu: 2.259 ± 0.028
0.558MetMet: 0.558 ± 0.013
0.766MetAsn: 0.766 ± 0.017
1.36MetPro: 1.36 ± 0.023
0.806MetGln: 0.806 ± 0.017
1.757MetArg: 1.757 ± 0.023
1.322MetSer: 1.322 ± 0.023
1.409MetThr: 1.409 ± 0.026
1.537MetVal: 1.537 ± 0.026
0.249MetTrp: 0.249 ± 0.009
0.449MetTyr: 0.449 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.45AsnAla: 3.45 ± 0.046
0.323AsnCys: 0.323 ± 0.012
1.43AsnAsp: 1.43 ± 0.026
1.315AsnGlu: 1.315 ± 0.02
1.541AsnPhe: 1.541 ± 0.027
3.154AsnGly: 3.154 ± 0.044
0.712AsnHis: 0.712 ± 0.017
1.594AsnIle: 1.594 ± 0.027
0.819AsnLys: 0.819 ± 0.021
3.441AsnLeu: 3.441 ± 0.037
0.635AsnMet: 0.635 ± 0.014
1.295AsnAsn: 1.295 ± 0.032
2.444AsnPro: 2.444 ± 0.034
1.27AsnGln: 1.27 ± 0.024
2.124AsnArg: 2.124 ± 0.027
2.084AsnSer: 2.084 ± 0.036
1.839AsnThr: 1.839 ± 0.032
2.406AsnVal: 2.406 ± 0.033
0.564AsnTrp: 0.564 ± 0.013
1.056AsnTyr: 1.056 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
7.253ProAla: 7.253 ± 0.065
0.361ProCys: 0.361 ± 0.013
3.358ProAsp: 3.358 ± 0.037
3.627ProGlu: 3.627 ± 0.04
2.004ProPhe: 2.004 ± 0.027
5.457ProGly: 5.457 ± 0.05
1.013ProHis: 1.013 ± 0.017
2.211ProIle: 2.211 ± 0.028
1.873ProLys: 1.873 ± 0.032
4.894ProLeu: 4.894 ± 0.042
1.147ProMet: 1.147 ± 0.021
2.018ProAsn: 2.018 ± 0.031
3.176ProPro: 3.176 ± 0.05
1.877ProGln: 1.877 ± 0.029
3.067ProArg: 3.067 ± 0.036
3.073ProSer: 3.073 ± 0.038
2.767ProThr: 2.767 ± 0.035
4.54ProVal: 4.54 ± 0.046
0.73ProTrp: 0.73 ± 0.015
1.466ProTyr: 1.466 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.888GlnAla: 3.888 ± 0.037
0.264GlnCys: 0.264 ± 0.012
1.516GlnAsp: 1.516 ± 0.025
1.656GlnGlu: 1.656 ± 0.027
1.511GlnPhe: 1.511 ± 0.025
2.512GlnGly: 2.512 ± 0.031
0.695GlnHis: 0.695 ± 0.015
1.972GlnIle: 1.972 ± 0.026
1.284GlnLys: 1.284 ± 0.022
3.404GlnLeu: 3.404 ± 0.041
0.929GlnMet: 0.929 ± 0.017
1.19GlnAsn: 1.19 ± 0.023
2.078GlnPro: 2.078 ± 0.029
1.664GlnGln: 1.664 ± 0.034
2.46GlnArg: 2.46 ± 0.028
2.159GlnSer: 2.159 ± 0.027
2.121GlnThr: 2.121 ± 0.028
2.725GlnVal: 2.725 ± 0.028
0.532GlnTrp: 0.532 ± 0.014
1.064GlnTyr: 1.064 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
6.875ArgAla: 6.875 ± 0.062
0.57ArgCys: 0.57 ± 0.016
3.517ArgAsp: 3.517 ± 0.04
4.234ArgGlu: 4.234 ± 0.053
2.934ArgPhe: 2.934 ± 0.035
4.763ArgGly: 4.763 ± 0.04
1.426ArgHis: 1.426 ± 0.023
3.781ArgIle: 3.781 ± 0.04
2.653ArgLys: 2.653 ± 0.033
6.82ArgLeu: 6.82 ± 0.06
1.901ArgMet: 1.901 ± 0.029
2.309ArgAsn: 2.309 ± 0.032
3.351ArgPro: 3.351 ± 0.034
2.555ArgGln: 2.555 ± 0.032
5.272ArgArg: 5.272 ± 0.059
3.688ArgSer: 3.688 ± 0.036
3.619ArgThr: 3.619 ± 0.038
4.872ArgVal: 4.872 ± 0.046
1.066ArgTrp: 1.066 ± 0.02
2.135ArgTyr: 2.135 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
6.61SerAla: 6.61 ± 0.056
0.542SerCys: 0.542 ± 0.017
2.817SerAsp: 2.817 ± 0.029
2.625SerGlu: 2.625 ± 0.035
2.481SerPhe: 2.481 ± 0.035
6.109SerGly: 6.109 ± 0.076
1.122SerHis: 1.122 ± 0.02
2.929SerIle: 2.929 ± 0.035
1.814SerLys: 1.814 ± 0.025
5.856SerLeu: 5.856 ± 0.05
1.273SerMet: 1.273 ± 0.023
1.999SerAsn: 1.999 ± 0.032
3.601SerPro: 3.601 ± 0.045
1.997SerGln: 1.997 ± 0.031
3.809SerArg: 3.809 ± 0.041
3.685SerSer: 3.685 ± 0.046
3.423SerThr: 3.423 ± 0.047
4.535SerVal: 4.535 ± 0.05
0.89SerTrp: 0.89 ± 0.017
1.654SerTyr: 1.654 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
6.618ThrAla: 6.618 ± 0.081
0.502ThrCys: 0.502 ± 0.015
2.636ThrAsp: 2.636 ± 0.035
2.47ThrGlu: 2.47 ± 0.03
2.387ThrPhe: 2.387 ± 0.042
5.784ThrGly: 5.784 ± 0.06
1.032ThrHis: 1.032 ± 0.016
2.975ThrIle: 2.975 ± 0.04
1.537ThrLys: 1.537 ± 0.025
6.01ThrLeu: 6.01 ± 0.062
1.084ThrMet: 1.084 ± 0.021
1.897ThrAsn: 1.897 ± 0.04
3.85ThrPro: 3.85 ± 0.05
1.789ThrGln: 1.789 ± 0.028
3.32ThrArg: 3.32 ± 0.032
3.392ThrSer: 3.392 ± 0.052
3.313ThrThr: 3.313 ± 0.052
4.877ThrVal: 4.877 ± 0.061
0.916ThrTrp: 0.916 ± 0.02
1.569ThrTyr: 1.569 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
7.991ValAla: 7.991 ± 0.062
0.673ValCys: 0.673 ± 0.016
3.736ValAsp: 3.736 ± 0.037
4.169ValGlu: 4.169 ± 0.047
2.778ValPhe: 2.778 ± 0.029
5.047ValGly: 5.047 ± 0.05
1.41ValHis: 1.41 ± 0.024
3.534ValIle: 3.534 ± 0.033
2.587ValLys: 2.587 ± 0.037
7.224ValLeu: 7.224 ± 0.053
1.632ValMet: 1.632 ± 0.029
2.676ValAsn: 2.676 ± 0.038
3.956ValPro: 3.956 ± 0.036
2.432ValGln: 2.432 ± 0.029
5.196ValArg: 5.196 ± 0.051
4.568ValSer: 4.568 ± 0.053
5.006ValThr: 5.006 ± 0.066
5.638ValVal: 5.638 ± 0.051
1.0ValTrp: 1.0 ± 0.019
1.988ValTyr: 1.988 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.212TrpAla: 1.212 ± 0.023
0.132TrpCys: 0.132 ± 0.007
0.719TrpAsp: 0.719 ± 0.014
0.688TrpGlu: 0.688 ± 0.015
0.598TrpPhe: 0.598 ± 0.017
0.995TrpGly: 0.995 ± 0.02
0.348TrpHis: 0.348 ± 0.012
0.792TrpIle: 0.792 ± 0.016
0.716TrpLys: 0.716 ± 0.017
1.506TrpLeu: 1.506 ± 0.024
0.426TrpMet: 0.426 ± 0.012
0.612TrpAsn: 0.612 ± 0.017
0.654TrpPro: 0.654 ± 0.016
0.662TrpGln: 0.662 ± 0.017
1.157TrpArg: 1.157 ± 0.022
1.012TrpSer: 1.012 ± 0.022
0.915TrpThr: 0.915 ± 0.018
0.948TrpVal: 0.948 ± 0.018
0.252TrpTrp: 0.252 ± 0.01
0.4TrpTyr: 0.4 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.596TyrAla: 2.596 ± 0.029
0.296TyrCys: 0.296 ± 0.01
1.556TyrAsp: 1.556 ± 0.027
1.373TyrGlu: 1.373 ± 0.026
1.288TyrPhe: 1.288 ± 0.02
2.402TyrGly: 2.402 ± 0.031
0.61TyrHis: 0.61 ± 0.013
1.025TyrIle: 1.025 ± 0.019
0.767TyrLys: 0.767 ± 0.018
2.754TyrLeu: 2.754 ± 0.034
0.489TyrMet: 0.489 ± 0.014
1.025TyrAsn: 1.025 ± 0.025
1.418TyrPro: 1.418 ± 0.021
1.131TyrGln: 1.131 ± 0.024
2.208TyrArg: 2.208 ± 0.03
1.889TyrSer: 1.889 ± 0.028
1.649TyrThr: 1.649 ± 0.026
1.817TyrVal: 1.817 ± 0.026
0.452TyrTrp: 0.452 ± 0.012
0.914TyrTyr: 0.914 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7761 proteins (2980018 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski