Amino acid dipepetide frequency for Desulfurispirillum indicum (strain ATCC BAA-1389 / DSM 22839 / S5)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.774AlaAla: 8.774 ± 0.128
1.176AlaCys: 1.176 ± 0.038
4.448AlaAsp: 4.448 ± 0.081
5.1AlaGlu: 5.1 ± 0.085
3.654AlaPhe: 3.654 ± 0.069
6.547AlaGly: 6.547 ± 0.105
2.194AlaHis: 2.194 ± 0.051
5.988AlaIle: 5.988 ± 0.089
3.014AlaLys: 3.014 ± 0.072
10.18AlaLeu: 10.18 ± 0.123
2.83AlaMet: 2.83 ± 0.064
2.548AlaAsn: 2.548 ± 0.064
3.303AlaPro: 3.303 ± 0.072
3.732AlaGln: 3.732 ± 0.076
5.877AlaArg: 5.877 ± 0.08
5.601AlaSer: 5.601 ± 0.102
4.638AlaThr: 4.638 ± 0.082
6.125AlaVal: 6.125 ± 0.094
0.911AlaTrp: 0.911 ± 0.035
2.425AlaTyr: 2.425 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
1.066CysAla: 1.066 ± 0.034
0.183CysCys: 0.183 ± 0.016
0.624CysAsp: 0.624 ± 0.027
0.615CysGlu: 0.615 ± 0.028
0.474CysPhe: 0.474 ± 0.024
1.098CysGly: 1.098 ± 0.041
0.487CysHis: 0.487 ± 0.036
0.628CysIle: 0.628 ± 0.026
0.32CysLys: 0.32 ± 0.019
1.091CysLeu: 1.091 ± 0.034
0.257CysMet: 0.257 ± 0.017
0.37CysAsn: 0.37 ± 0.024
0.649CysPro: 0.649 ± 0.036
0.545CysGln: 0.545 ± 0.022
0.805CysArg: 0.805 ± 0.035
0.791CysSer: 0.791 ± 0.035
0.631CysThr: 0.631 ± 0.029
0.757CysVal: 0.757 ± 0.029
0.089CysTrp: 0.089 ± 0.012
0.33CysTyr: 0.33 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
4.476AspAla: 4.476 ± 0.08
0.54AspCys: 0.54 ± 0.029
2.685AspAsp: 2.685 ± 0.054
3.813AspGlu: 3.813 ± 0.067
2.485AspPhe: 2.485 ± 0.04
3.828AspGly: 3.828 ± 0.071
1.235AspHis: 1.235 ± 0.045
4.279AspIle: 4.279 ± 0.077
1.73AspLys: 1.73 ± 0.047
4.784AspLeu: 4.784 ± 0.085
1.464AspMet: 1.464 ± 0.047
1.658AspAsn: 1.658 ± 0.048
2.462AspPro: 2.462 ± 0.058
1.851AspGln: 1.851 ± 0.047
2.981AspArg: 2.981 ± 0.059
2.877AspSer: 2.877 ± 0.058
2.711AspThr: 2.711 ± 0.055
3.482AspVal: 3.482 ± 0.062
0.552AspTrp: 0.552 ± 0.025
1.588AspTyr: 1.588 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.726GluAla: 5.726 ± 0.102
0.548GluCys: 0.548 ± 0.028
3.026GluAsp: 3.026 ± 0.068
4.83GluGlu: 4.83 ± 0.094
2.343GluPhe: 2.343 ± 0.055
4.167GluGly: 4.167 ± 0.073
1.847GluHis: 1.847 ± 0.05
4.127GluIle: 4.127 ± 0.068
3.732GluLys: 3.732 ± 0.076
7.0GluLeu: 7.0 ± 0.093
1.815GluMet: 1.815 ± 0.051
2.613GluAsn: 2.613 ± 0.063
2.419GluPro: 2.419 ± 0.058
3.348GluGln: 3.348 ± 0.073
4.456GluArg: 4.456 ± 0.079
3.67GluSer: 3.67 ± 0.067
2.833GluThr: 2.833 ± 0.065
4.648GluVal: 4.648 ± 0.09
0.507GluTrp: 0.507 ± 0.023
2.143GluTyr: 2.143 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.77PheAla: 3.77 ± 0.071
0.588PheCys: 0.588 ± 0.026
2.351PheAsp: 2.351 ± 0.058
2.344PheGlu: 2.344 ± 0.05
2.113PhePhe: 2.113 ± 0.058
3.091PheGly: 3.091 ± 0.066
1.248PheHis: 1.248 ± 0.038
2.371PheIle: 2.371 ± 0.056
1.119PheLys: 1.119 ± 0.036
4.426PheLeu: 4.426 ± 0.075
1.092PheMet: 1.092 ± 0.041
1.285PheAsn: 1.285 ± 0.045
1.777PhePro: 1.777 ± 0.051
1.478PheGln: 1.478 ± 0.041
2.482PheArg: 2.482 ± 0.06
3.137PheSer: 3.137 ± 0.062
2.431PheThr: 2.431 ± 0.058
2.646PheVal: 2.646 ± 0.058
0.505PheTrp: 0.505 ± 0.028
1.189PheTyr: 1.189 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
5.902GlyAla: 5.902 ± 0.09
1.002GlyCys: 1.002 ± 0.041
3.517GlyAsp: 3.517 ± 0.059
4.523GlyGlu: 4.523 ± 0.075
3.163GlyPhe: 3.163 ± 0.06
5.234GlyGly: 5.234 ± 0.097
1.676GlyHis: 1.676 ± 0.045
5.342GlyIle: 5.342 ± 0.084
3.677GlyLys: 3.677 ± 0.075
6.751GlyLeu: 6.751 ± 0.103
2.378GlyMet: 2.378 ± 0.068
2.628GlyAsn: 2.628 ± 0.062
1.902GlyPro: 1.902 ± 0.049
2.913GlyGln: 2.913 ± 0.066
4.283GlyArg: 4.283 ± 0.079
4.745GlySer: 4.745 ± 0.082
4.125GlyThr: 4.125 ± 0.081
5.416GlyVal: 5.416 ± 0.074
0.766GlyTrp: 0.766 ± 0.031
2.59GlyTyr: 2.59 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.86HisAla: 1.86 ± 0.051
0.409HisCys: 0.409 ± 0.023
1.341HisAsp: 1.341 ± 0.037
1.571HisGlu: 1.571 ± 0.041
1.216HisPhe: 1.216 ± 0.037
2.166HisGly: 2.166 ± 0.056
0.902HisHis: 0.902 ± 0.04
1.797HisIle: 1.797 ± 0.05
0.809HisLys: 0.809 ± 0.031
2.674HisLeu: 2.674 ± 0.062
0.697HisMet: 0.697 ± 0.029
0.808HisAsn: 0.808 ± 0.033
1.48HisPro: 1.48 ± 0.039
1.173HisGln: 1.173 ± 0.038
1.601HisArg: 1.601 ± 0.05
1.62HisSer: 1.62 ± 0.044
1.292HisThr: 1.292 ± 0.038
1.558HisVal: 1.558 ± 0.047
0.347HisTrp: 0.347 ± 0.021
0.902HisTyr: 0.902 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.321IleAla: 6.321 ± 0.088
0.757IleCys: 0.757 ± 0.032
3.626IleAsp: 3.626 ± 0.066
3.962IleGlu: 3.962 ± 0.072
2.585IlePhe: 2.585 ± 0.067
4.591IleGly: 4.591 ± 0.086
1.651IleHis: 1.651 ± 0.046
4.123IleIle: 4.123 ± 0.077
2.164IleLys: 2.164 ± 0.054
6.368IleLeu: 6.368 ± 0.097
1.46IleMet: 1.46 ± 0.041
2.243IleAsn: 2.243 ± 0.054
3.17IlePro: 3.17 ± 0.066
2.235IleGln: 2.235 ± 0.045
3.761IleArg: 3.761 ± 0.076
4.423IleSer: 4.423 ± 0.073
3.807IleThr: 3.807 ± 0.07
4.283IleVal: 4.283 ± 0.082
0.492IleTrp: 0.492 ± 0.024
1.651IleTyr: 1.651 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
3.446LysAla: 3.446 ± 0.069
0.329LysCys: 0.329 ± 0.022
1.995LysAsp: 1.995 ± 0.062
2.687LysGlu: 2.687 ± 0.068
1.161LysPhe: 1.161 ± 0.039
2.738LysGly: 2.738 ± 0.067
0.882LysHis: 0.882 ± 0.033
2.497LysIle: 2.497 ± 0.057
2.236LysLys: 2.236 ± 0.071
3.723LysLeu: 3.723 ± 0.072
1.046LysMet: 1.046 ± 0.037
1.467LysAsn: 1.467 ± 0.044
1.793LysPro: 1.793 ± 0.048
1.564LysGln: 1.564 ± 0.043
2.563LysArg: 2.563 ± 0.058
2.345LysSer: 2.345 ± 0.062
2.121LysThr: 2.121 ± 0.051
2.747LysVal: 2.747 ± 0.067
0.33LysTrp: 0.33 ± 0.021
1.133LysTyr: 1.133 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
9.416LeuAla: 9.416 ± 0.114
1.361LeuCys: 1.361 ± 0.041
5.341LeuAsp: 5.341 ± 0.088
7.591LeuGlu: 7.591 ± 0.101
4.346LeuPhe: 4.346 ± 0.084
7.202LeuGly: 7.202 ± 0.097
2.544LeuHis: 2.544 ± 0.062
5.401LeuIle: 5.401 ± 0.097
4.023LeuLys: 4.023 ± 0.075
11.566LeuLeu: 11.566 ± 0.154
2.612LeuMet: 2.612 ± 0.061
3.235LeuAsn: 3.235 ± 0.07
5.161LeuPro: 5.161 ± 0.094
4.981LeuGln: 4.981 ± 0.081
6.752LeuArg: 6.752 ± 0.093
7.164LeuSer: 7.164 ± 0.098
4.745LeuThr: 4.745 ± 0.073
6.705LeuVal: 6.705 ± 0.108
1.081LeuTrp: 1.081 ± 0.034
2.731LeuTyr: 2.731 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.841MetAla: 2.841 ± 0.058
0.22MetCys: 0.22 ± 0.017
1.563MetAsp: 1.563 ± 0.048
2.322MetGlu: 2.322 ± 0.054
0.828MetPhe: 0.828 ± 0.032
2.35MetGly: 2.35 ± 0.056
0.597MetHis: 0.597 ± 0.025
1.455MetIle: 1.455 ± 0.04
1.355MetLys: 1.355 ± 0.037
2.617MetLeu: 2.617 ± 0.054
0.63MetMet: 0.63 ± 0.026
1.04MetAsn: 1.04 ± 0.035
1.219MetPro: 1.219 ± 0.036
1.108MetGln: 1.108 ± 0.035
1.557MetArg: 1.557 ± 0.042
1.633MetSer: 1.633 ± 0.043
1.454MetThr: 1.454 ± 0.043
2.072MetVal: 2.072 ± 0.056
0.178MetTrp: 0.178 ± 0.014
0.56MetTyr: 0.56 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.768AsnAla: 2.768 ± 0.059
0.354AsnCys: 0.354 ± 0.024
1.503AsnAsp: 1.503 ± 0.042
1.646AsnGlu: 1.646 ± 0.048
1.362AsnPhe: 1.362 ± 0.04
2.205AsnGly: 2.205 ± 0.061
0.824AsnHis: 0.824 ± 0.03
2.42AsnIle: 2.42 ± 0.063
1.009AsnLys: 1.009 ± 0.039
3.429AsnLeu: 3.429 ± 0.069
0.911AsnMet: 0.911 ± 0.029
1.094AsnAsn: 1.094 ± 0.04
2.093AsnPro: 2.093 ± 0.049
1.352AsnGln: 1.352 ± 0.042
2.298AsnArg: 2.298 ± 0.053
1.877AsnSer: 1.877 ± 0.046
1.921AsnThr: 1.921 ± 0.054
2.194AsnVal: 2.194 ± 0.055
0.343AsnTrp: 0.343 ± 0.02
0.971AsnTyr: 0.971 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
4.217ProAla: 4.217 ± 0.074
0.465ProCys: 0.465 ± 0.024
2.693ProAsp: 2.693 ± 0.058
3.191ProGlu: 3.191 ± 0.067
1.9ProPhe: 1.9 ± 0.046
3.409ProGly: 3.409 ± 0.066
1.177ProHis: 1.177 ± 0.035
2.08ProIle: 2.08 ± 0.05
1.389ProLys: 1.389 ± 0.043
4.569ProLeu: 4.569 ± 0.083
1.27ProMet: 1.27 ± 0.033
1.027ProAsn: 1.027 ± 0.034
1.734ProPro: 1.734 ± 0.057
1.974ProGln: 1.974 ± 0.049
2.379ProArg: 2.379 ± 0.054
2.397ProSer: 2.397 ± 0.05
1.99ProThr: 1.99 ± 0.055
3.688ProVal: 3.688 ± 0.067
0.604ProTrp: 0.604 ± 0.025
1.289ProTyr: 1.289 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
4.048GlnAla: 4.048 ± 0.08
0.473GlnCys: 0.473 ± 0.027
1.854GlnAsp: 1.854 ± 0.056
3.175GlnGlu: 3.175 ± 0.062
1.372GlnPhe: 1.372 ± 0.039
2.941GlnGly: 2.941 ± 0.06
1.27GlnHis: 1.27 ± 0.041
2.331GlnIle: 2.331 ± 0.059
1.934GlnLys: 1.934 ± 0.052
4.252GlnLeu: 4.252 ± 0.088
1.262GlnMet: 1.262 ± 0.04
1.301GlnAsn: 1.301 ± 0.039
1.784GlnPro: 1.784 ± 0.054
3.043GlnGln: 3.043 ± 0.075
3.336GlnArg: 3.336 ± 0.071
2.583GlnSer: 2.583 ± 0.057
2.013GlnThr: 2.013 ± 0.049
3.12GlnVal: 3.12 ± 0.06
0.74GlnTrp: 0.74 ± 0.031
1.222GlnTyr: 1.222 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
4.599ArgAla: 4.599 ± 0.078
0.678ArgCys: 0.678 ± 0.029
3.371ArgAsp: 3.371 ± 0.069
4.968ArgGlu: 4.968 ± 0.088
2.813ArgPhe: 2.813 ± 0.057
3.814ArgGly: 3.814 ± 0.072
1.898ArgHis: 1.898 ± 0.052
4.287ArgIle: 4.287 ± 0.068
3.014ArgLys: 3.014 ± 0.063
6.296ArgLeu: 6.296 ± 0.114
1.838ArgMet: 1.838 ± 0.046
2.262ArgAsn: 2.262 ± 0.051
2.207ArgPro: 2.207 ± 0.052
3.719ArgGln: 3.719 ± 0.073
4.011ArgArg: 4.011 ± 0.078
3.78ArgSer: 3.78 ± 0.065
3.127ArgThr: 3.127 ± 0.062
4.205ArgVal: 4.205 ± 0.079
0.712ArgTrp: 0.712 ± 0.031
2.112ArgTyr: 2.112 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
5.884SerAla: 5.884 ± 0.088
0.736SerCys: 0.736 ± 0.036
3.097SerAsp: 3.097 ± 0.059
3.544SerGlu: 3.544 ± 0.076
2.759SerPhe: 2.759 ± 0.06
5.33SerGly: 5.33 ± 0.088
1.769SerHis: 1.769 ± 0.045
4.04SerIle: 4.04 ± 0.068
1.879SerLys: 1.879 ± 0.052
6.917SerLeu: 6.917 ± 0.095
1.861SerMet: 1.861 ± 0.052
1.686SerAsn: 1.686 ± 0.044
2.741SerPro: 2.741 ± 0.064
2.548SerGln: 2.548 ± 0.056
4.204SerArg: 4.204 ± 0.084
4.265SerSer: 4.265 ± 0.085
3.307SerThr: 3.307 ± 0.063
4.174SerVal: 4.174 ± 0.075
0.706SerTrp: 0.706 ± 0.029
1.783SerTyr: 1.783 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.618ThrAla: 4.618 ± 0.094
0.627ThrCys: 0.627 ± 0.029
2.411ThrAsp: 2.411 ± 0.053
2.676ThrGlu: 2.676 ± 0.059
2.172ThrPhe: 2.172 ± 0.054
4.442ThrGly: 4.442 ± 0.082
1.228ThrHis: 1.228 ± 0.038
3.822ThrIle: 3.822 ± 0.075
1.484ThrLys: 1.484 ± 0.044
5.773ThrLeu: 5.773 ± 0.095
1.465ThrMet: 1.465 ± 0.044
1.484ThrAsn: 1.484 ± 0.046
2.941ThrPro: 2.941 ± 0.065
1.743ThrGln: 1.743 ± 0.046
2.991ThrArg: 2.991 ± 0.05
3.206ThrSer: 3.206 ± 0.065
2.903ThrThr: 2.903 ± 0.058
3.668ThrVal: 3.668 ± 0.069
0.526ThrTrp: 0.526 ± 0.023
1.544ThrTyr: 1.544 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
6.321ValAla: 6.321 ± 0.095
0.84ValCys: 0.84 ± 0.034
3.875ValAsp: 3.875 ± 0.08
4.899ValGlu: 4.899 ± 0.074
2.863ValPhe: 2.863 ± 0.072
4.433ValGly: 4.433 ± 0.077
1.566ValHis: 1.566 ± 0.044
4.471ValIle: 4.471 ± 0.085
2.563ValLys: 2.563 ± 0.06
7.235ValLeu: 7.235 ± 0.097
1.755ValMet: 1.755 ± 0.045
2.537ValAsn: 2.537 ± 0.054
2.881ValPro: 2.881 ± 0.06
2.672ValGln: 2.672 ± 0.058
4.333ValArg: 4.333 ± 0.075
4.53ValSer: 4.53 ± 0.071
3.663ValThr: 3.663 ± 0.074
5.25ValVal: 5.25 ± 0.091
0.528ValTrp: 0.528 ± 0.023
1.872ValTyr: 1.872 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.661TrpAla: 0.661 ± 0.028
0.135TrpCys: 0.135 ± 0.013
0.477TrpAsp: 0.477 ± 0.025
0.644TrpGlu: 0.644 ± 0.025
0.422TrpPhe: 0.422 ± 0.021
0.664TrpGly: 0.664 ± 0.031
0.312TrpHis: 0.312 ± 0.02
0.614TrpIle: 0.614 ± 0.025
0.512TrpLys: 0.512 ± 0.023
1.211TrpLeu: 1.211 ± 0.045
0.271TrpMet: 0.271 ± 0.02
0.432TrpAsn: 0.432 ± 0.019
0.376TrpPro: 0.376 ± 0.02
0.719TrpGln: 0.719 ± 0.034
0.693TrpArg: 0.693 ± 0.033
0.671TrpSer: 0.671 ± 0.032
0.431TrpThr: 0.431 ± 0.026
0.621TrpVal: 0.621 ± 0.026
0.146TrpTrp: 0.146 ± 0.015
0.378TrpTyr: 0.378 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.352TyrAla: 2.352 ± 0.057
0.389TyrCys: 0.389 ± 0.02
1.683TyrAsp: 1.683 ± 0.044
1.699TyrGlu: 1.699 ± 0.051
1.413TyrPhe: 1.413 ± 0.043
2.282TyrGly: 2.282 ± 0.049
0.918TyrHis: 0.918 ± 0.034
1.595TyrIle: 1.595 ± 0.05
0.841TyrLys: 0.841 ± 0.032
3.098TyrLeu: 3.098 ± 0.064
0.685TyrMet: 0.685 ± 0.028
0.854TyrAsn: 0.854 ± 0.036
1.348TyrPro: 1.348 ± 0.04
1.357TyrGln: 1.357 ± 0.048
2.348TyrArg: 2.348 ± 0.051
1.914TyrSer: 1.914 ± 0.057
1.581TyrThr: 1.581 ± 0.053
1.729TyrVal: 1.729 ± 0.045
0.324TyrTrp: 0.324 ± 0.02
0.98TyrTyr: 0.98 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2551 proteins (853656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski