Amino acid dipepetide frequency for Desulfovibrio ferrophilus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.702AlaAla: 9.702 ± 0.126
1.309AlaCys: 1.309 ± 0.035
5.04AlaAsp: 5.04 ± 0.071
6.43AlaGlu: 6.43 ± 0.091
3.513AlaPhe: 3.513 ± 0.056
8.004AlaGly: 8.004 ± 0.101
1.877AlaHis: 1.877 ± 0.046
4.72AlaIle: 4.72 ± 0.076
4.413AlaLys: 4.413 ± 0.079
10.982AlaLeu: 10.982 ± 0.108
3.207AlaMet: 3.207 ± 0.056
2.535AlaAsn: 2.535 ± 0.05
3.95AlaPro: 3.95 ± 0.066
3.601AlaGln: 3.601 ± 0.06
5.615AlaArg: 5.615 ± 0.072
5.277AlaSer: 5.277 ± 0.069
4.579AlaThr: 4.579 ± 0.069
7.161AlaVal: 7.161 ± 0.101
1.207AlaTrp: 1.207 ± 0.033
2.375AlaTyr: 2.375 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
1.134CysAla: 1.134 ± 0.029
0.233CysCys: 0.233 ± 0.017
0.62CysAsp: 0.62 ± 0.024
0.639CysGlu: 0.639 ± 0.026
0.475CysPhe: 0.475 ± 0.02
1.339CysGly: 1.339 ± 0.043
0.534CysHis: 0.534 ± 0.039
0.709CysIle: 0.709 ± 0.031
0.519CysLys: 0.519 ± 0.022
1.266CysLeu: 1.266 ± 0.032
0.335CysMet: 0.335 ± 0.019
0.389CysAsn: 0.389 ± 0.02
0.852CysPro: 0.852 ± 0.025
0.357CysGln: 0.357 ± 0.018
0.742CysArg: 0.742 ± 0.026
0.8CysSer: 0.8 ± 0.03
0.642CysThr: 0.642 ± 0.027
0.869CysVal: 0.869 ± 0.03
0.158CysTrp: 0.158 ± 0.014
0.291CysTyr: 0.291 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.019AspAla: 5.019 ± 0.079
0.741AspCys: 0.741 ± 0.028
3.325AspAsp: 3.325 ± 0.094
4.035AspGlu: 4.035 ± 0.068
2.569AspPhe: 2.569 ± 0.054
4.494AspGly: 4.494 ± 0.079
1.199AspHis: 1.199 ± 0.032
3.629AspIle: 3.629 ± 0.059
2.715AspLys: 2.715 ± 0.055
5.95AspLeu: 5.95 ± 0.089
1.738AspMet: 1.738 ± 0.039
1.786AspAsn: 1.786 ± 0.044
2.705AspPro: 2.705 ± 0.054
1.818AspGln: 1.818 ± 0.044
3.106AspArg: 3.106 ± 0.056
3.136AspSer: 3.136 ± 0.064
2.722AspThr: 2.722 ± 0.074
4.13AspVal: 4.13 ± 0.071
0.804AspTrp: 0.804 ± 0.027
1.773AspTyr: 1.773 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
6.696GluAla: 6.696 ± 0.079
0.616GluCys: 0.616 ± 0.026
3.571GluAsp: 3.571 ± 0.066
4.261GluGlu: 4.261 ± 0.079
2.435GluPhe: 2.435 ± 0.049
4.477GluGly: 4.477 ± 0.063
1.479GluHis: 1.479 ± 0.034
3.883GluIle: 3.883 ± 0.058
3.066GluLys: 3.066 ± 0.061
7.052GluLeu: 7.052 ± 0.091
2.112GluMet: 2.112 ± 0.042
2.176GluAsn: 2.176 ± 0.046
2.43GluPro: 2.43 ± 0.053
2.981GluGln: 2.981 ± 0.056
4.332GluArg: 4.332 ± 0.081
3.336GluSer: 3.336 ± 0.05
3.49GluThr: 3.49 ± 0.059
4.614GluVal: 4.614 ± 0.069
0.606GluTrp: 0.606 ± 0.023
1.622GluTyr: 1.622 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.468PheAla: 3.468 ± 0.058
0.575PheCys: 0.575 ± 0.024
2.391PheAsp: 2.391 ± 0.049
2.473PheGlu: 2.473 ± 0.054
1.767PhePhe: 1.767 ± 0.048
3.247PheGly: 3.247 ± 0.068
0.794PheHis: 0.794 ± 0.027
2.124PheIle: 2.124 ± 0.042
1.788PheLys: 1.788 ± 0.044
3.937PheLeu: 3.937 ± 0.056
1.227PheMet: 1.227 ± 0.037
1.339PheAsn: 1.339 ± 0.036
1.737PhePro: 1.737 ± 0.042
1.105PheGln: 1.105 ± 0.03
2.003PheArg: 2.003 ± 0.039
2.702PheSer: 2.702 ± 0.053
2.263PheThr: 2.263 ± 0.045
2.732PheVal: 2.732 ± 0.058
0.602PheTrp: 0.602 ± 0.023
1.105PheTyr: 1.105 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
6.902GlyAla: 6.902 ± 0.104
1.325GlyCys: 1.325 ± 0.037
4.338GlyAsp: 4.338 ± 0.119
4.904GlyGlu: 4.904 ± 0.078
3.399GlyPhe: 3.399 ± 0.053
6.631GlyGly: 6.631 ± 0.148
1.85GlyHis: 1.85 ± 0.047
4.656GlyIle: 4.656 ± 0.071
4.206GlyLys: 4.206 ± 0.072
9.011GlyLeu: 9.011 ± 0.108
2.7GlyMet: 2.7 ± 0.053
2.531GlyAsn: 2.531 ± 0.074
2.88GlyPro: 2.88 ± 0.056
3.038GlyGln: 3.038 ± 0.058
4.806GlyArg: 4.806 ± 0.076
4.298GlySer: 4.298 ± 0.072
4.418GlyThr: 4.418 ± 0.073
6.066GlyVal: 6.066 ± 0.079
1.06GlyTrp: 1.06 ± 0.034
2.441GlyTyr: 2.441 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.791HisAla: 1.791 ± 0.043
0.384HisCys: 0.384 ± 0.018
1.235HisAsp: 1.235 ± 0.035
1.356HisGlu: 1.356 ± 0.035
0.99HisPhe: 0.99 ± 0.029
1.782HisGly: 1.782 ± 0.038
0.537HisHis: 0.537 ± 0.022
1.117HisIle: 1.117 ± 0.034
0.888HisLys: 0.888 ± 0.031
2.171HisLeu: 2.171 ± 0.052
0.526HisMet: 0.526 ± 0.022
0.655HisAsn: 0.655 ± 0.025
1.295HisPro: 1.295 ± 0.037
0.678HisGln: 0.678 ± 0.024
1.212HisArg: 1.212 ± 0.038
1.179HisSer: 1.179 ± 0.036
1.005HisThr: 1.005 ± 0.034
1.423HisVal: 1.423 ± 0.034
0.288HisTrp: 0.288 ± 0.017
0.614HisTyr: 0.614 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.09IleAla: 5.09 ± 0.06
0.762IleCys: 0.762 ± 0.028
2.878IleAsp: 2.878 ± 0.058
3.415IleGlu: 3.415 ± 0.062
2.148IlePhe: 2.148 ± 0.048
4.151IleGly: 4.151 ± 0.066
1.138IleHis: 1.138 ± 0.032
3.188IleIle: 3.188 ± 0.06
2.562IleLys: 2.562 ± 0.055
5.684IleLeu: 5.684 ± 0.07
1.556IleMet: 1.556 ± 0.044
1.906IleAsn: 1.906 ± 0.041
2.91IlePro: 2.91 ± 0.057
1.7IleGln: 1.7 ± 0.042
3.222IleArg: 3.222 ± 0.062
3.45IleSer: 3.45 ± 0.057
3.022IleThr: 3.022 ± 0.054
3.952IleVal: 3.952 ± 0.063
0.534IleTrp: 0.534 ± 0.023
1.347IleTyr: 1.347 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
4.937LysAla: 4.937 ± 0.071
0.415LysCys: 0.415 ± 0.021
2.877LysAsp: 2.877 ± 0.064
2.86LysGlu: 2.86 ± 0.065
1.4LysPhe: 1.4 ± 0.041
3.645LysGly: 3.645 ± 0.064
0.951LysHis: 0.951 ± 0.035
2.596LysIle: 2.596 ± 0.056
2.748LysLys: 2.748 ± 0.076
4.353LysLeu: 4.353 ± 0.065
1.241LysMet: 1.241 ± 0.036
1.623LysAsn: 1.623 ± 0.044
2.128LysPro: 2.128 ± 0.045
1.656LysGln: 1.656 ± 0.045
2.852LysArg: 2.852 ± 0.051
2.44LysSer: 2.44 ± 0.047
2.663LysThr: 2.663 ± 0.05
3.194LysVal: 3.194 ± 0.058
0.452LysTrp: 0.452 ± 0.022
1.157LysTyr: 1.157 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
11.171LeuAla: 11.171 ± 0.124
1.318LeuCys: 1.318 ± 0.036
6.389LeuAsp: 6.389 ± 0.086
6.924LeuGlu: 6.924 ± 0.09
4.141LeuPhe: 4.141 ± 0.083
8.769LeuGly: 8.769 ± 0.102
2.062LeuHis: 2.062 ± 0.047
5.141LeuIle: 5.141 ± 0.072
4.901LeuLys: 4.901 ± 0.068
10.218LeuLeu: 10.218 ± 0.12
2.786LeuMet: 2.786 ± 0.048
3.312LeuAsn: 3.312 ± 0.062
5.273LeuPro: 5.273 ± 0.076
2.98LeuGln: 2.98 ± 0.05
6.01LeuArg: 6.01 ± 0.081
6.849LeuSer: 6.849 ± 0.086
6.118LeuThr: 6.118 ± 0.075
7.257LeuVal: 7.257 ± 0.083
1.134LeuTrp: 1.134 ± 0.034
2.449LeuTyr: 2.449 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
3.196MetAla: 3.196 ± 0.056
0.285MetCys: 0.285 ± 0.015
2.01MetAsp: 2.01 ± 0.045
1.876MetGlu: 1.876 ± 0.042
0.947MetPhe: 0.947 ± 0.031
2.557MetGly: 2.557 ± 0.054
0.593MetHis: 0.593 ± 0.02
1.513MetIle: 1.513 ± 0.043
1.372MetLys: 1.372 ± 0.038
2.853MetLeu: 2.853 ± 0.054
0.738MetMet: 0.738 ± 0.028
1.104MetAsn: 1.104 ± 0.034
1.47MetPro: 1.47 ± 0.041
0.996MetGln: 0.996 ± 0.03
1.745MetArg: 1.745 ± 0.045
1.974MetSer: 1.974 ± 0.043
1.792MetThr: 1.792 ± 0.042
1.99MetVal: 1.99 ± 0.048
0.219MetTrp: 0.219 ± 0.014
0.556MetTyr: 0.556 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.802AsnAla: 2.802 ± 0.054
0.433AsnCys: 0.433 ± 0.02
1.764AsnAsp: 1.764 ± 0.063
1.802AsnGlu: 1.802 ± 0.043
1.2AsnPhe: 1.2 ± 0.034
2.313AsnGly: 2.313 ± 0.051
0.66AsnHis: 0.66 ± 0.024
2.049AsnIle: 2.049 ± 0.05
1.397AsnLys: 1.397 ± 0.037
3.429AsnLeu: 3.429 ± 0.059
0.984AsnMet: 0.984 ± 0.029
0.996AsnAsn: 0.996 ± 0.039
1.941AsnPro: 1.941 ± 0.037
0.966AsnGln: 0.966 ± 0.033
1.776AsnArg: 1.776 ± 0.038
1.641AsnSer: 1.641 ± 0.044
1.632AsnThr: 1.632 ± 0.046
2.242AsnVal: 2.242 ± 0.047
0.418AsnTrp: 0.418 ± 0.019
0.86AsnTyr: 0.86 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
4.236ProAla: 4.236 ± 0.062
0.581ProCys: 0.581 ± 0.026
3.121ProAsp: 3.121 ± 0.055
4.313ProGlu: 4.313 ± 0.069
1.8ProPhe: 1.8 ± 0.04
4.216ProGly: 4.216 ± 0.064
0.969ProHis: 0.969 ± 0.032
1.997ProIle: 1.997 ± 0.041
2.136ProLys: 2.136 ± 0.046
4.543ProLeu: 4.543 ± 0.076
1.225ProMet: 1.225 ± 0.036
1.191ProAsn: 1.191 ± 0.034
1.97ProPro: 1.97 ± 0.046
1.63ProGln: 1.63 ± 0.041
2.255ProArg: 2.255 ± 0.047
2.487ProSer: 2.487 ± 0.05
2.055ProThr: 2.055 ± 0.053
3.702ProVal: 3.702 ± 0.068
0.667ProTrp: 0.667 ± 0.025
1.178ProTyr: 1.178 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
4.141GlnAla: 4.141 ± 0.073
0.403GlnCys: 0.403 ± 0.019
2.06GlnAsp: 2.06 ± 0.044
2.19GlnGlu: 2.19 ± 0.044
1.182GlnPhe: 1.182 ± 0.031
3.019GlnGly: 3.019 ± 0.051
0.65GlnHis: 0.65 ± 0.026
1.746GlnIle: 1.746 ± 0.043
1.403GlnLys: 1.403 ± 0.037
3.123GlnLeu: 3.123 ± 0.056
1.009GlnMet: 1.009 ± 0.034
1.098GlnAsn: 1.098 ± 0.032
1.318GlnPro: 1.318 ± 0.04
1.385GlnGln: 1.385 ± 0.036
2.156GlnArg: 2.156 ± 0.044
1.947GlnSer: 1.947 ± 0.044
1.823GlnThr: 1.823 ± 0.047
2.336GlnVal: 2.336 ± 0.046
0.413GlnTrp: 0.413 ± 0.022
0.838GlnTyr: 0.838 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
4.923ArgAla: 4.923 ± 0.08
0.661ArgCys: 0.661 ± 0.021
3.198ArgAsp: 3.198 ± 0.054
4.346ArgGlu: 4.346 ± 0.074
2.548ArgPhe: 2.548 ± 0.048
3.806ArgGly: 3.806 ± 0.062
1.341ArgHis: 1.341 ± 0.038
3.749ArgIle: 3.749 ± 0.058
3.026ArgLys: 3.026 ± 0.06
6.485ArgLeu: 6.485 ± 0.078
1.858ArgMet: 1.858 ± 0.042
1.839ArgAsn: 1.839 ± 0.044
2.494ArgPro: 2.494 ± 0.049
2.209ArgGln: 2.209 ± 0.049
3.845ArgArg: 3.845 ± 0.079
3.185ArgSer: 3.185 ± 0.062
2.9ArgThr: 2.9 ± 0.054
4.004ArgVal: 4.004 ± 0.066
0.699ArgTrp: 0.699 ± 0.025
1.658ArgTyr: 1.658 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.918SerAla: 4.918 ± 0.08
0.777SerCys: 0.777 ± 0.03
3.141SerAsp: 3.141 ± 0.056
3.494SerGlu: 3.494 ± 0.063
2.346SerPhe: 2.346 ± 0.05
5.603SerGly: 5.603 ± 0.096
1.164SerHis: 1.164 ± 0.033
3.232SerIle: 3.232 ± 0.057
2.482SerLys: 2.482 ± 0.049
6.178SerLeu: 6.178 ± 0.083
1.93SerMet: 1.93 ± 0.036
1.576SerAsn: 1.576 ± 0.042
2.76SerPro: 2.76 ± 0.053
1.879SerGln: 1.879 ± 0.046
3.427SerArg: 3.427 ± 0.055
3.36SerSer: 3.36 ± 0.059
2.934SerThr: 2.934 ± 0.062
4.071SerVal: 4.071 ± 0.069
0.753SerTrp: 0.753 ± 0.029
1.482SerTyr: 1.482 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.059ThrAla: 5.059 ± 0.076
0.626ThrCys: 0.626 ± 0.022
2.763ThrAsp: 2.763 ± 0.051
3.079ThrGlu: 3.079 ± 0.057
1.954ThrPhe: 1.954 ± 0.048
5.0ThrGly: 5.0 ± 0.071
1.059ThrHis: 1.059 ± 0.032
2.91ThrIle: 2.91 ± 0.055
2.058ThrLys: 2.058 ± 0.045
5.905ThrLeu: 5.905 ± 0.087
1.574ThrMet: 1.574 ± 0.043
1.543ThrAsn: 1.543 ± 0.039
3.065ThrPro: 3.065 ± 0.054
1.59ThrGln: 1.59 ± 0.037
2.913ThrArg: 2.913 ± 0.055
2.904ThrSer: 2.904 ± 0.058
2.926ThrThr: 2.926 ± 0.065
4.134ThrVal: 4.134 ± 0.064
0.609ThrTrp: 0.609 ± 0.025
1.344ThrTyr: 1.344 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
6.967ValAla: 6.967 ± 0.096
0.936ValCys: 0.936 ± 0.031
4.42ValAsp: 4.42 ± 0.07
4.579ValGlu: 4.579 ± 0.059
2.921ValPhe: 2.921 ± 0.06
5.216ValGly: 5.216 ± 0.072
1.503ValHis: 1.503 ± 0.036
3.89ValIle: 3.89 ± 0.058
2.804ValLys: 2.804 ± 0.055
7.828ValLeu: 7.828 ± 0.086
2.086ValMet: 2.086 ± 0.048
2.417ValAsn: 2.417 ± 0.044
3.235ValPro: 3.235 ± 0.063
2.398ValGln: 2.398 ± 0.054
4.474ValArg: 4.474 ± 0.061
4.312ValSer: 4.312 ± 0.065
4.02ValThr: 4.02 ± 0.061
5.789ValVal: 5.789 ± 0.097
0.739ValTrp: 0.739 ± 0.029
1.7ValTyr: 1.7 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.03TrpAla: 1.03 ± 0.03
0.138TrpCys: 0.138 ± 0.012
0.702TrpAsp: 0.702 ± 0.027
0.728TrpGlu: 0.728 ± 0.024
0.48TrpPhe: 0.48 ± 0.023
0.924TrpGly: 0.924 ± 0.029
0.235TrpHis: 0.235 ± 0.016
0.613TrpIle: 0.613 ± 0.026
0.633TrpLys: 0.633 ± 0.023
1.42TrpLeu: 1.42 ± 0.042
0.34TrpMet: 0.34 ± 0.017
0.446TrpAsn: 0.446 ± 0.021
0.568TrpPro: 0.568 ± 0.025
0.447TrpGln: 0.447 ± 0.019
0.715TrpArg: 0.715 ± 0.028
0.641TrpSer: 0.641 ± 0.026
0.648TrpThr: 0.648 ± 0.024
0.755TrpVal: 0.755 ± 0.029
0.2TrpTrp: 0.2 ± 0.014
0.283TrpTyr: 0.283 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.316TyrAla: 2.316 ± 0.049
0.386TyrCys: 0.386 ± 0.019
1.552TyrAsp: 1.552 ± 0.037
1.532TyrGlu: 1.532 ± 0.035
1.164TyrPhe: 1.164 ± 0.035
2.263TyrGly: 2.263 ± 0.052
0.531TyrHis: 0.531 ± 0.021
1.154TyrIle: 1.154 ± 0.032
1.072TyrLys: 1.072 ± 0.036
2.814TyrLeu: 2.814 ± 0.056
0.611TyrMet: 0.611 ± 0.023
0.844TyrAsn: 0.844 ± 0.029
1.289TyrPro: 1.289 ± 0.034
0.828TyrGln: 0.828 ± 0.032
1.569TyrArg: 1.569 ± 0.036
1.589TyrSer: 1.589 ± 0.036
1.363TyrThr: 1.363 ± 0.035
1.812TyrVal: 1.812 ± 0.035
0.382TyrTrp: 0.382 ± 0.02
0.842TyrTyr: 0.842 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3428 proteins (1088601 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski