Amino acid dipepetide frequency for Thermodesulfovibrio yellowstonii (strain ATCC 51303 / DSM 11347 / YP87)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.436AlaAla: 3.436 ± 0.096
0.831AlaCys: 0.831 ± 0.04
2.569AlaAsp: 2.569 ± 0.067
4.502AlaGlu: 4.502 ± 0.09
3.068AlaPhe: 3.068 ± 0.084
4.441AlaGly: 4.441 ± 0.111
1.064AlaHis: 1.064 ± 0.046
5.898AlaIle: 5.898 ± 0.105
5.124AlaLys: 5.124 ± 0.096
6.98AlaLeu: 6.98 ± 0.133
1.618AlaMet: 1.618 ± 0.053
1.971AlaAsn: 1.971 ± 0.06
1.669AlaPro: 1.669 ± 0.07
1.999AlaGln: 1.999 ± 0.064
2.411AlaArg: 2.411 ± 0.065
3.475AlaSer: 3.475 ± 0.083
3.015AlaThr: 3.015 ± 0.074
4.525AlaVal: 4.525 ± 0.098
0.526AlaTrp: 0.526 ± 0.029
2.317AlaTyr: 2.317 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.618CysAla: 0.618 ± 0.035
0.168CysCys: 0.168 ± 0.018
0.445CysAsp: 0.445 ± 0.027
0.611CysGlu: 0.611 ± 0.033
0.557CysPhe: 0.557 ± 0.03
0.979CysGly: 0.979 ± 0.05
0.407CysHis: 0.407 ± 0.06
0.921CysIle: 0.921 ± 0.045
0.82CysLys: 0.82 ± 0.036
0.982CysLeu: 0.982 ± 0.045
0.273CysMet: 0.273 ± 0.025
0.442CysAsn: 0.442 ± 0.027
0.755CysPro: 0.755 ± 0.045
0.263CysGln: 0.263 ± 0.019
0.429CysArg: 0.429 ± 0.029
0.765CysSer: 0.765 ± 0.039
0.439CysThr: 0.439 ± 0.025
0.631CysVal: 0.631 ± 0.034
0.113CysTrp: 0.113 ± 0.014
0.47CysTyr: 0.47 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
2.838AspAla: 2.838 ± 0.07
0.53AspCys: 0.53 ± 0.035
1.894AspAsp: 1.894 ± 0.06
4.117AspGlu: 4.117 ± 0.114
2.889AspPhe: 2.889 ± 0.078
2.661AspGly: 2.661 ± 0.075
0.504AspHis: 0.504 ± 0.031
5.707AspIle: 5.707 ± 0.097
4.024AspLys: 4.024 ± 0.091
4.227AspLeu: 4.227 ± 0.091
1.16AspMet: 1.16 ± 0.046
1.651AspAsn: 1.651 ± 0.055
1.94AspPro: 1.94 ± 0.068
0.672AspGln: 0.672 ± 0.035
2.161AspArg: 2.161 ± 0.07
2.457AspSer: 2.457 ± 0.063
2.087AspThr: 2.087 ± 0.059
2.881AspVal: 2.881 ± 0.084
0.463AspTrp: 0.463 ± 0.028
2.055AspTyr: 2.055 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
5.04GluAla: 5.04 ± 0.109
0.578GluCys: 0.578 ± 0.031
3.456GluAsp: 3.456 ± 0.083
7.53GluGlu: 7.53 ± 0.14
3.375GluPhe: 3.375 ± 0.076
4.595GluGly: 4.595 ± 0.085
1.14GluHis: 1.14 ± 0.044
9.051GluIle: 9.051 ± 0.157
8.603GluLys: 8.603 ± 0.122
7.708GluLeu: 7.708 ± 0.134
1.738GluMet: 1.738 ± 0.053
3.853GluAsn: 3.853 ± 0.097
2.375GluPro: 2.375 ± 0.067
2.166GluGln: 2.166 ± 0.066
3.735GluArg: 3.735 ± 0.087
3.106GluSer: 3.106 ± 0.073
3.313GluThr: 3.313 ± 0.073
4.796GluVal: 4.796 ± 0.113
0.614GluTrp: 0.614 ± 0.03
2.229GluTyr: 2.229 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
2.733PheAla: 2.733 ± 0.069
0.619PheCys: 0.619 ± 0.032
2.541PheAsp: 2.541 ± 0.074
3.211PheGlu: 3.211 ± 0.07
2.731PhePhe: 2.731 ± 0.081
3.076PheGly: 3.076 ± 0.073
0.843PheHis: 0.843 ± 0.035
5.251PheIle: 5.251 ± 0.113
3.938PheLys: 3.938 ± 0.092
5.398PheLeu: 5.398 ± 0.127
1.153PheMet: 1.153 ± 0.045
2.161PheAsn: 2.161 ± 0.062
1.736PhePro: 1.736 ± 0.055
1.251PheGln: 1.251 ± 0.051
1.761PheArg: 1.761 ± 0.05
3.539PheSer: 3.539 ± 0.087
2.414PheThr: 2.414 ± 0.065
2.815PheVal: 2.815 ± 0.071
0.496PheTrp: 0.496 ± 0.031
2.175PheTyr: 2.175 ± 0.072
0.0PheXaa: 0.0 ± 0.0
Gly
4.035GlyAla: 4.035 ± 0.106
0.843GlyCys: 0.843 ± 0.047
2.784GlyAsp: 2.784 ± 0.064
3.968GlyGlu: 3.968 ± 0.088
3.513GlyPhe: 3.513 ± 0.085
4.244GlyGly: 4.244 ± 0.117
1.054GlyHis: 1.054 ± 0.04
7.024GlyIle: 7.024 ± 0.123
6.031GlyLys: 6.031 ± 0.098
5.648GlyLeu: 5.648 ± 0.114
1.608GlyMet: 1.608 ± 0.05
2.424GlyAsn: 2.424 ± 0.066
1.452GlyPro: 1.452 ± 0.049
1.629GlyGln: 1.629 ± 0.056
2.782GlyArg: 2.782 ± 0.077
3.423GlySer: 3.423 ± 0.087
3.209GlyThr: 3.209 ± 0.082
4.533GlyVal: 4.533 ± 0.085
0.668GlyTrp: 0.668 ± 0.038
2.692GlyTyr: 2.692 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
0.949HisAla: 0.949 ± 0.037
0.228HisCys: 0.228 ± 0.021
0.687HisAsp: 0.687 ± 0.036
1.094HisGlu: 1.094 ± 0.04
0.788HisPhe: 0.788 ± 0.037
1.206HisGly: 1.206 ± 0.044
0.343HisHis: 0.343 ± 0.023
1.634HisIle: 1.634 ± 0.049
1.105HisLys: 1.105 ± 0.049
1.618HisLeu: 1.618 ± 0.049
0.314HisMet: 0.314 ± 0.027
0.614HisAsn: 0.614 ± 0.029
0.946HisPro: 0.946 ± 0.039
0.348HisGln: 0.348 ± 0.028
0.754HisArg: 0.754 ± 0.035
1.023HisSer: 1.023 ± 0.045
0.793HisThr: 0.793 ± 0.034
0.866HisVal: 0.866 ± 0.037
0.156HisTrp: 0.156 ± 0.014
0.664HisTyr: 0.664 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
6.502IleAla: 6.502 ± 0.11
1.02IleCys: 1.02 ± 0.044
5.413IleAsp: 5.413 ± 0.104
8.077IleGlu: 8.077 ± 0.123
5.001IlePhe: 5.001 ± 0.121
6.172IleGly: 6.172 ± 0.132
1.567IleHis: 1.567 ± 0.048
8.416IleIle: 8.416 ± 0.161
9.649IleLys: 9.649 ± 0.147
9.394IleLeu: 9.394 ± 0.152
1.836IleMet: 1.836 ± 0.057
4.737IleAsn: 4.737 ± 0.095
4.471IlePro: 4.471 ± 0.092
2.473IleGln: 2.473 ± 0.064
3.584IleArg: 3.584 ± 0.075
6.287IleSer: 6.287 ± 0.096
4.807IleThr: 4.807 ± 0.085
6.129IleVal: 6.129 ± 0.116
0.609IleTrp: 0.609 ± 0.036
3.316IleTyr: 3.316 ± 0.077
0.0IleXaa: 0.0 ± 0.0
Lys
5.942LysAla: 5.942 ± 0.107
0.777LysCys: 0.777 ± 0.048
4.829LysAsp: 4.829 ± 0.106
9.468LysGlu: 9.468 ± 0.139
3.474LysPhe: 3.474 ± 0.071
5.656LysGly: 5.656 ± 0.098
1.178LysHis: 1.178 ± 0.042
10.017LysIle: 10.017 ± 0.142
9.59LysLys: 9.59 ± 0.149
7.425LysLeu: 7.425 ± 0.119
2.06LysMet: 2.06 ± 0.06
4.819LysAsn: 4.819 ± 0.096
3.048LysPro: 3.048 ± 0.078
2.386LysGln: 2.386 ± 0.071
3.712LysArg: 3.712 ± 0.091
4.08LysSer: 4.08 ± 0.092
4.512LysThr: 4.512 ± 0.077
5.224LysVal: 5.224 ± 0.103
0.7LysTrp: 0.7 ± 0.034
2.994LysTyr: 2.994 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
5.704LeuAla: 5.704 ± 0.121
1.079LeuCys: 1.079 ± 0.043
3.84LeuAsp: 3.84 ± 0.079
6.21LeuGlu: 6.21 ± 0.102
4.743LeuPhe: 4.743 ± 0.112
5.551LeuGly: 5.551 ± 0.1
1.449LeuHis: 1.449 ± 0.052
9.559LeuIle: 9.559 ± 0.159
10.728LeuLys: 10.728 ± 0.154
8.69LeuLeu: 8.69 ± 0.144
2.05LeuMet: 2.05 ± 0.066
4.898LeuAsn: 4.898 ± 0.104
4.024LeuPro: 4.024 ± 0.079
2.735LeuGln: 2.735 ± 0.073
4.548LeuArg: 4.548 ± 0.1
7.203LeuSer: 7.203 ± 0.118
4.687LeuThr: 4.687 ± 0.103
4.812LeuVal: 4.812 ± 0.094
0.92LeuTrp: 0.92 ± 0.041
3.362LeuTyr: 3.362 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.539MetAla: 1.539 ± 0.057
0.189MetCys: 0.189 ± 0.017
1.079MetAsp: 1.079 ± 0.041
1.545MetGlu: 1.545 ± 0.049
0.841MetPhe: 0.841 ± 0.043
1.485MetGly: 1.485 ± 0.056
0.42MetHis: 0.42 ± 0.025
2.015MetIle: 2.015 ± 0.057
2.406MetLys: 2.406 ± 0.063
2.147MetLeu: 2.147 ± 0.059
0.42MetMet: 0.42 ± 0.031
1.045MetAsn: 1.045 ± 0.04
1.104MetPro: 1.104 ± 0.045
0.701MetGln: 0.701 ± 0.036
1.084MetArg: 1.084 ± 0.039
1.314MetSer: 1.314 ± 0.046
0.936MetThr: 0.936 ± 0.041
1.406MetVal: 1.406 ± 0.049
0.143MetTrp: 0.143 ± 0.016
0.475MetTyr: 0.475 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.46AsnAla: 2.46 ± 0.063
0.501AsnCys: 0.501 ± 0.034
1.616AsnAsp: 1.616 ± 0.056
3.089AsnGlu: 3.089 ± 0.084
2.441AsnPhe: 2.441 ± 0.063
2.14AsnGly: 2.14 ± 0.059
0.622AsnHis: 0.622 ± 0.03
4.669AsnIle: 4.669 ± 0.099
3.649AsnLys: 3.649 ± 0.076
4.912AsnLeu: 4.912 ± 0.097
0.91AsnMet: 0.91 ± 0.039
1.836AsnAsn: 1.836 ± 0.066
2.455AsnPro: 2.455 ± 0.065
1.079AsnGln: 1.079 ± 0.045
1.69AsnArg: 1.69 ± 0.057
2.649AsnSer: 2.649 ± 0.075
1.793AsnThr: 1.793 ± 0.057
2.383AsnVal: 2.383 ± 0.071
0.409AsnTrp: 0.409 ± 0.025
1.777AsnTyr: 1.777 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
2.032ProAla: 2.032 ± 0.058
0.443ProCys: 0.443 ± 0.028
2.219ProAsp: 2.219 ± 0.054
3.97ProGlu: 3.97 ± 0.081
2.263ProPhe: 2.263 ± 0.059
2.022ProGly: 2.022 ± 0.06
0.778ProHis: 0.778 ± 0.039
2.817ProIle: 2.817 ± 0.079
2.577ProLys: 2.577 ± 0.083
3.741ProLeu: 3.741 ± 0.082
0.746ProMet: 0.746 ± 0.034
1.245ProAsn: 1.245 ± 0.048
1.478ProPro: 1.478 ± 0.054
1.314ProGln: 1.314 ± 0.047
1.271ProArg: 1.271 ± 0.047
2.354ProSer: 2.354 ± 0.063
1.679ProThr: 1.679 ± 0.053
3.173ProVal: 3.173 ± 0.073
0.35ProTrp: 0.35 ± 0.025
1.733ProTyr: 1.733 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
1.877GlnAla: 1.877 ± 0.058
0.253GlnCys: 0.253 ± 0.022
1.164GlnAsp: 1.164 ± 0.048
2.204GlnGlu: 2.204 ± 0.061
1.046GlnPhe: 1.046 ± 0.039
1.613GlnGly: 1.613 ± 0.055
0.401GlnHis: 0.401 ± 0.028
2.822GlnIle: 2.822 ± 0.069
2.964GlnLys: 2.964 ± 0.075
2.176GlnLeu: 2.176 ± 0.063
0.721GlnMet: 0.721 ± 0.037
1.297GlnAsn: 1.297 ± 0.053
0.893GlnPro: 0.893 ± 0.041
0.806GlnGln: 0.806 ± 0.051
1.481GlnArg: 1.481 ± 0.051
1.513GlnSer: 1.513 ± 0.051
1.337GlnThr: 1.337 ± 0.052
1.391GlnVal: 1.391 ± 0.05
0.315GlnTrp: 0.315 ± 0.023
0.905GlnTyr: 0.905 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.675ArgAla: 2.675 ± 0.07
0.485ArgCys: 0.485 ± 0.032
2.232ArgAsp: 2.232 ± 0.069
3.625ArgGlu: 3.625 ± 0.083
1.953ArgPhe: 1.953 ± 0.049
2.542ArgGly: 2.542 ± 0.075
0.705ArgHis: 0.705 ± 0.03
4.071ArgIle: 4.071 ± 0.089
3.819ArgLys: 3.819 ± 0.076
3.96ArgLeu: 3.96 ± 0.088
0.964ArgMet: 0.964 ± 0.04
1.775ArgAsn: 1.775 ± 0.064
1.212ArgPro: 1.212 ± 0.049
1.375ArgGln: 1.375 ± 0.057
1.882ArgArg: 1.882 ± 0.056
1.578ArgSer: 1.578 ± 0.061
1.72ArgThr: 1.72 ± 0.056
2.983ArgVal: 2.983 ± 0.069
0.394ArgTrp: 0.394 ± 0.025
1.547ArgTyr: 1.547 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
3.152SerAla: 3.152 ± 0.08
0.696SerCys: 0.696 ± 0.04
2.664SerAsp: 2.664 ± 0.061
4.249SerGlu: 4.249 ± 0.086
3.433SerPhe: 3.433 ± 0.093
4.025SerGly: 4.025 ± 0.083
1.066SerHis: 1.066 ± 0.042
5.311SerIle: 5.311 ± 0.116
4.403SerLys: 4.403 ± 0.093
6.614SerLeu: 6.614 ± 0.114
1.412SerMet: 1.412 ± 0.051
2.035SerAsn: 2.035 ± 0.065
2.434SerPro: 2.434 ± 0.062
1.795SerGln: 1.795 ± 0.051
2.16SerArg: 2.16 ± 0.066
3.64SerSer: 3.64 ± 0.086
2.475SerThr: 2.475 ± 0.061
3.421SerVal: 3.421 ± 0.074
0.519SerTrp: 0.519 ± 0.03
2.109SerTyr: 2.109 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
3.096ThrAla: 3.096 ± 0.088
0.445ThrCys: 0.445 ± 0.031
2.168ThrAsp: 2.168 ± 0.065
3.597ThrGlu: 3.597 ± 0.079
2.101ThrPhe: 2.101 ± 0.061
4.116ThrGly: 4.116 ± 0.081
0.821ThrHis: 0.821 ± 0.037
4.001ThrIle: 4.001 ± 0.082
3.497ThrLys: 3.497 ± 0.068
4.687ThrLeu: 4.687 ± 0.082
0.925ThrMet: 0.925 ± 0.039
1.682ThrAsn: 1.682 ± 0.053
2.165ThrPro: 2.165 ± 0.06
1.302ThrGln: 1.302 ± 0.049
1.506ThrArg: 1.506 ± 0.056
2.495ThrSer: 2.495 ± 0.06
2.34ThrThr: 2.34 ± 0.065
3.617ThrVal: 3.617 ± 0.077
0.36ThrTrp: 0.36 ± 0.026
1.618ThrTyr: 1.618 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
4.07ValAla: 4.07 ± 0.101
0.815ValCys: 0.815 ± 0.04
3.196ValAsp: 3.196 ± 0.074
4.592ValGlu: 4.592 ± 0.089
3.224ValPhe: 3.224 ± 0.075
3.851ValGly: 3.851 ± 0.09
0.913ValHis: 0.913 ± 0.039
6.195ValIle: 6.195 ± 0.093
5.477ValLys: 5.477 ± 0.095
5.674ValLeu: 5.674 ± 0.11
1.444ValMet: 1.444 ± 0.048
2.613ValAsn: 2.613 ± 0.063
2.102ValPro: 2.102 ± 0.064
1.442ValGln: 1.442 ± 0.054
2.493ValArg: 2.493 ± 0.067
3.948ValSer: 3.948 ± 0.084
3.078ValThr: 3.078 ± 0.072
4.27ValVal: 4.27 ± 0.109
0.486ValTrp: 0.486 ± 0.028
2.329ValTyr: 2.329 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.029
0.085TrpCys: 0.085 ± 0.013
0.442TrpAsp: 0.442 ± 0.027
0.576TrpGlu: 0.576 ± 0.036
0.452TrpPhe: 0.452 ± 0.026
0.545TrpGly: 0.545 ± 0.03
0.209TrpHis: 0.209 ± 0.018
0.869TrpIle: 0.869 ± 0.041
0.723TrpLys: 0.723 ± 0.037
0.961TrpLeu: 0.961 ± 0.039
0.197TrpMet: 0.197 ± 0.018
0.43TrpAsn: 0.43 ± 0.028
0.205TrpPro: 0.205 ± 0.019
0.411TrpGln: 0.411 ± 0.024
0.381TrpArg: 0.381 ± 0.023
0.452TrpSer: 0.452 ± 0.03
0.342TrpThr: 0.342 ± 0.025
0.402TrpVal: 0.402 ± 0.024
0.12TrpTrp: 0.12 ± 0.014
0.315TrpTyr: 0.315 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.152TyrAla: 2.152 ± 0.064
0.501TyrCys: 0.501 ± 0.028
1.682TyrAsp: 1.682 ± 0.05
2.764TyrGlu: 2.764 ± 0.077
1.951TyrPhe: 1.951 ± 0.063
2.644TyrGly: 2.644 ± 0.071
0.645TyrHis: 0.645 ± 0.031
3.212TyrIle: 3.212 ± 0.073
2.762TyrLys: 2.762 ± 0.066
3.846TyrLeu: 3.846 ± 0.095
0.746TyrMet: 0.746 ± 0.034
1.555TyrAsn: 1.555 ± 0.053
1.739TyrPro: 1.739 ± 0.052
1.033TyrGln: 1.033 ± 0.043
1.674TyrArg: 1.674 ± 0.055
2.312TyrSer: 2.312 ± 0.065
1.56TyrThr: 1.56 ± 0.043
1.956TyrVal: 1.956 ± 0.059
0.343TyrTrp: 0.343 ± 0.023
1.476TyrTyr: 1.476 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1982 proteins (608875 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski