Amino acid dipepetide frequency for Magnetospirillum sp. UT-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.765AlaAla: 22.765 ± 0.272
1.384AlaCys: 1.384 ± 0.039
7.74AlaAsp: 7.74 ± 0.105
9.47AlaGlu: 9.47 ± 0.112
4.347AlaPhe: 4.347 ± 0.059
12.653AlaGly: 12.653 ± 0.129
2.452AlaHis: 2.452 ± 0.05
5.393AlaIle: 5.393 ± 0.073
3.934AlaLys: 3.934 ± 0.073
14.448AlaLeu: 14.448 ± 0.137
3.751AlaMet: 3.751 ± 0.061
2.72AlaAsn: 2.72 ± 0.08
6.2AlaPro: 6.2 ± 0.102
3.725AlaGln: 3.725 ± 0.067
10.404AlaArg: 10.404 ± 0.141
5.392AlaSer: 5.392 ± 0.081
6.104AlaThr: 6.104 ± 0.182
10.869AlaVal: 10.869 ± 0.111
1.821AlaTrp: 1.821 ± 0.052
2.321AlaTyr: 2.321 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.057CysAla: 1.057 ± 0.032
0.135CysCys: 0.135 ± 0.012
0.582CysAsp: 0.582 ± 0.019
0.445CysGlu: 0.445 ± 0.02
0.323CysPhe: 0.323 ± 0.017
1.068CysGly: 1.068 ± 0.032
0.341CysHis: 0.341 ± 0.021
0.348CysIle: 0.348 ± 0.018
0.237CysLys: 0.237 ± 0.014
0.937CysLeu: 0.937 ± 0.03
0.158CysMet: 0.158 ± 0.011
0.23CysAsn: 0.23 ± 0.016
0.636CysPro: 0.636 ± 0.025
0.264CysGln: 0.264 ± 0.014
0.927CysArg: 0.927 ± 0.031
0.456CysSer: 0.456 ± 0.022
0.471CysThr: 0.471 ± 0.021
0.667CysVal: 0.667 ± 0.025
0.126CysTrp: 0.126 ± 0.011
0.21CysTyr: 0.21 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.634AspAla: 6.634 ± 0.102
0.572AspCys: 0.572 ± 0.025
3.331AspAsp: 3.331 ± 0.07
3.551AspGlu: 3.551 ± 0.058
2.089AspPhe: 2.089 ± 0.042
5.982AspGly: 5.982 ± 0.108
1.416AspHis: 1.416 ± 0.038
2.813AspIle: 2.813 ± 0.049
1.608AspLys: 1.608 ± 0.043
6.371AspLeu: 6.371 ± 0.08
1.344AspMet: 1.344 ± 0.037
1.207AspAsn: 1.207 ± 0.043
3.578AspPro: 3.578 ± 0.059
1.704AspGln: 1.704 ± 0.038
4.58AspArg: 4.58 ± 0.089
2.575AspSer: 2.575 ± 0.049
2.757AspThr: 2.757 ± 0.108
4.015AspVal: 4.015 ± 0.061
0.988AspTrp: 0.988 ± 0.03
1.322AspTyr: 1.322 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
8.544GluAla: 8.544 ± 0.097
0.484GluCys: 0.484 ± 0.021
2.887GluAsp: 2.887 ± 0.057
3.045GluGlu: 3.045 ± 0.059
1.688GluPhe: 1.688 ± 0.037
4.616GluGly: 4.616 ± 0.061
1.165GluHis: 1.165 ± 0.033
2.804GluIle: 2.804 ± 0.052
1.881GluLys: 1.881 ± 0.046
5.411GluLeu: 5.411 ± 0.091
1.755GluMet: 1.755 ± 0.039
1.098GluAsn: 1.098 ± 0.032
2.858GluPro: 2.858 ± 0.073
1.825GluGln: 1.825 ± 0.04
5.368GluArg: 5.368 ± 0.096
2.297GluSer: 2.297 ± 0.038
3.166GluThr: 3.166 ± 0.058
4.415GluVal: 4.415 ± 0.067
0.705GluTrp: 0.705 ± 0.025
1.017GluTyr: 1.017 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.595PheAla: 4.595 ± 0.062
0.397PheCys: 0.397 ± 0.017
2.424PheAsp: 2.424 ± 0.04
1.935PheGlu: 1.935 ± 0.045
1.183PhePhe: 1.183 ± 0.036
3.269PheGly: 3.269 ± 0.059
0.753PheHis: 0.753 ± 0.025
1.278PheIle: 1.278 ± 0.036
0.898PheLys: 0.898 ± 0.029
3.283PheLeu: 3.283 ± 0.062
0.653PheMet: 0.653 ± 0.021
0.892PheAsn: 0.892 ± 0.026
1.549PhePro: 1.549 ± 0.042
0.953PheGln: 0.953 ± 0.031
2.394PheArg: 2.394 ± 0.051
1.719PheSer: 1.719 ± 0.048
1.862PheThr: 1.862 ± 0.054
2.493PheVal: 2.493 ± 0.05
0.437PheTrp: 0.437 ± 0.021
0.705PheTyr: 0.705 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.161GlyAla: 10.161 ± 0.117
1.051GlyCys: 1.051 ± 0.034
4.994GlyAsp: 4.994 ± 0.075
4.983GlyGlu: 4.983 ± 0.072
3.244GlyPhe: 3.244 ± 0.056
8.505GlyGly: 8.505 ± 0.154
2.138GlyHis: 2.138 ± 0.047
4.485GlyIle: 4.485 ± 0.062
3.032GlyLys: 3.032 ± 0.061
9.399GlyLeu: 9.399 ± 0.112
2.542GlyMet: 2.542 ± 0.051
2.025GlyAsn: 2.025 ± 0.068
3.867GlyPro: 3.867 ± 0.067
2.739GlyGln: 2.739 ± 0.051
7.888GlyArg: 7.888 ± 0.117
4.304GlySer: 4.304 ± 0.104
4.824GlyThr: 4.824 ± 0.209
5.869GlyVal: 5.869 ± 0.07
1.481GlyTrp: 1.481 ± 0.036
2.01GlyTyr: 2.01 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.527HisAla: 2.527 ± 0.054
0.258HisCys: 0.258 ± 0.013
1.295HisAsp: 1.295 ± 0.036
0.986HisGlu: 0.986 ± 0.033
0.853HisPhe: 0.853 ± 0.028
2.122HisGly: 2.122 ± 0.045
0.751HisHis: 0.751 ± 0.029
0.801HisIle: 0.801 ± 0.027
0.512HisLys: 0.512 ± 0.021
2.44HisLeu: 2.44 ± 0.057
0.489HisMet: 0.489 ± 0.022
0.445HisAsn: 0.445 ± 0.018
1.616HisPro: 1.616 ± 0.044
0.643HisGln: 0.643 ± 0.023
1.894HisArg: 1.894 ± 0.046
0.93HisSer: 0.93 ± 0.029
0.875HisThr: 0.875 ± 0.025
1.584HisVal: 1.584 ± 0.038
0.343HisTrp: 0.343 ± 0.017
0.568HisTyr: 0.568 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.998IleAla: 5.998 ± 0.068
0.423IleCys: 0.423 ± 0.017
3.271IleAsp: 3.271 ± 0.048
3.011IleGlu: 3.011 ± 0.056
1.139IlePhe: 1.139 ± 0.034
4.172IleGly: 4.172 ± 0.068
0.878IleHis: 0.878 ± 0.027
1.598IleIle: 1.598 ± 0.037
1.296IleLys: 1.296 ± 0.04
3.696IleLeu: 3.696 ± 0.063
0.755IleMet: 0.755 ± 0.028
1.132IleAsn: 1.132 ± 0.035
2.007IlePro: 2.007 ± 0.048
1.044IleGln: 1.044 ± 0.03
3.084IleArg: 3.084 ± 0.06
1.999IleSer: 1.999 ± 0.045
2.181IleThr: 2.181 ± 0.054
3.19IleVal: 3.19 ± 0.053
0.401IleTrp: 0.401 ± 0.02
0.817IleTyr: 0.817 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.384LysAla: 4.384 ± 0.076
0.19LysCys: 0.19 ± 0.016
1.768LysAsp: 1.768 ± 0.048
1.396LysGlu: 1.396 ± 0.041
0.773LysPhe: 0.773 ± 0.026
2.808LysGly: 2.808 ± 0.055
0.56LysHis: 0.56 ± 0.022
1.167LysIle: 1.167 ± 0.032
1.087LysLys: 1.087 ± 0.035
2.744LysLeu: 2.744 ± 0.055
0.703LysMet: 0.703 ± 0.028
0.656LysAsn: 0.656 ± 0.024
1.916LysPro: 1.916 ± 0.042
0.837LysGln: 0.837 ± 0.028
2.151LysArg: 2.151 ± 0.051
1.495LysSer: 1.495 ± 0.039
1.558LysThr: 1.558 ± 0.041
2.718LysVal: 2.718 ± 0.054
0.315LysTrp: 0.315 ± 0.016
0.59LysTyr: 0.59 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
16.872LeuAla: 16.872 ± 0.189
0.875LeuCys: 0.875 ± 0.029
6.572LeuAsp: 6.572 ± 0.08
5.404LeuGlu: 5.404 ± 0.075
3.475LeuPhe: 3.475 ± 0.067
8.965LeuGly: 8.965 ± 0.108
2.063LeuHis: 2.063 ± 0.044
3.808LeuIle: 3.808 ± 0.075
3.679LeuLys: 3.679 ± 0.07
9.88LeuLeu: 9.88 ± 0.161
2.411LeuMet: 2.411 ± 0.051
2.111LeuAsn: 2.111 ± 0.055
6.028LeuPro: 6.028 ± 0.092
2.401LeuGln: 2.401 ± 0.052
7.198LeuArg: 7.198 ± 0.112
5.592LeuSer: 5.592 ± 0.067
5.058LeuThr: 5.058 ± 0.088
8.192LeuVal: 8.192 ± 0.096
1.261LeuTrp: 1.261 ± 0.036
1.873LeuTyr: 1.873 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.743MetAla: 3.743 ± 0.063
0.164MetCys: 0.164 ± 0.011
1.316MetAsp: 1.316 ± 0.037
1.209MetGlu: 1.209 ± 0.034
0.68MetPhe: 0.68 ± 0.024
1.962MetGly: 1.962 ± 0.041
0.423MetHis: 0.423 ± 0.021
1.039MetIle: 1.039 ± 0.029
0.87MetLys: 0.87 ± 0.027
2.434MetLeu: 2.434 ± 0.043
0.653MetMet: 0.653 ± 0.025
0.669MetAsn: 0.669 ± 0.023
1.496MetPro: 1.496 ± 0.042
0.652MetGln: 0.652 ± 0.022
1.802MetArg: 1.802 ± 0.04
1.429MetSer: 1.429 ± 0.032
1.7MetThr: 1.7 ± 0.043
2.208MetVal: 2.208 ± 0.051
0.223MetTrp: 0.223 ± 0.015
0.307MetTyr: 0.307 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.802AsnAla: 2.802 ± 0.087
0.245AsnCys: 0.245 ± 0.015
1.255AsnAsp: 1.255 ± 0.058
0.963AsnGlu: 0.963 ± 0.031
0.731AsnPhe: 0.731 ± 0.021
1.981AsnGly: 1.981 ± 0.075
0.514AsnHis: 0.514 ± 0.024
1.018AsnIle: 1.018 ± 0.033
0.553AsnLys: 0.553 ± 0.021
2.372AsnLeu: 2.372 ± 0.058
0.493AsnMet: 0.493 ± 0.023
0.519AsnAsn: 0.519 ± 0.031
1.604AsnPro: 1.604 ± 0.037
0.664AsnGln: 0.664 ± 0.027
1.723AsnArg: 1.723 ± 0.042
0.952AsnSer: 0.952 ± 0.037
1.101AsnThr: 1.101 ± 0.068
1.666AsnVal: 1.666 ± 0.044
0.295AsnTrp: 0.295 ± 0.021
0.501AsnTyr: 0.501 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
7.789ProAla: 7.789 ± 0.134
0.425ProCys: 0.425 ± 0.021
3.855ProAsp: 3.855 ± 0.066
3.933ProGlu: 3.933 ± 0.07
1.892ProPhe: 1.892 ± 0.045
5.328ProGly: 5.328 ± 0.083
1.202ProHis: 1.202 ± 0.034
1.792ProIle: 1.792 ± 0.037
1.509ProLys: 1.509 ± 0.041
5.174ProLeu: 5.174 ± 0.09
1.265ProMet: 1.265 ± 0.034
1.097ProAsn: 1.097 ± 0.029
3.971ProPro: 3.971 ± 0.115
1.43ProGln: 1.43 ± 0.044
3.422ProArg: 3.422 ± 0.07
2.354ProSer: 2.354 ± 0.053
2.247ProThr: 2.247 ± 0.05
4.672ProVal: 4.672 ± 0.074
0.749ProTrp: 0.749 ± 0.022
1.086ProTyr: 1.086 ± 0.031
0.001ProXaa: 0.001 ± 0.001
Gln
4.344GlnAla: 4.344 ± 0.078
0.246GlnCys: 0.246 ± 0.015
1.296GlnAsp: 1.296 ± 0.035
1.296GlnGlu: 1.296 ± 0.036
0.858GlnPhe: 0.858 ± 0.028
2.517GlnGly: 2.517 ± 0.05
0.582GlnHis: 0.582 ± 0.022
1.242GlnIle: 1.242 ± 0.033
0.733GlnLys: 0.733 ± 0.028
2.658GlnLeu: 2.658 ± 0.051
0.814GlnMet: 0.814 ± 0.027
0.566GlnAsn: 0.566 ± 0.022
1.718GlnPro: 1.718 ± 0.038
0.944GlnGln: 0.944 ± 0.034
2.272GlnArg: 2.272 ± 0.045
1.369GlnSer: 1.369 ± 0.035
1.395GlnThr: 1.395 ± 0.04
2.557GlnVal: 2.557 ± 0.051
0.384GlnTrp: 0.384 ± 0.018
0.535GlnTyr: 0.535 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
9.139ArgAla: 9.139 ± 0.136
0.68ArgCys: 0.68 ± 0.027
4.459ArgAsp: 4.459 ± 0.078
4.056ArgGlu: 4.056 ± 0.073
2.933ArgPhe: 2.933 ± 0.061
5.028ArgGly: 5.028 ± 0.08
2.277ArgHis: 2.277 ± 0.055
3.794ArgIle: 3.794 ± 0.059
2.284ArgLys: 2.284 ± 0.057
9.885ArgLeu: 9.885 ± 0.141
2.27ArgMet: 2.27 ± 0.05
1.649ArgAsn: 1.649 ± 0.038
4.681ArgPro: 4.681 ± 0.098
2.8ArgGln: 2.8 ± 0.048
7.689ArgArg: 7.689 ± 0.168
3.352ArgSer: 3.352 ± 0.065
3.423ArgThr: 3.423 ± 0.05
5.414ArgVal: 5.414 ± 0.081
1.087ArgTrp: 1.087 ± 0.034
1.551ArgTyr: 1.551 ± 0.035
0.001ArgXaa: 0.001 ± 0.001
Ser
5.734SerAla: 5.734 ± 0.107
0.484SerCys: 0.484 ± 0.024
2.485SerAsp: 2.485 ± 0.051
2.272SerGlu: 2.272 ± 0.041
1.777SerPhe: 1.777 ± 0.043
5.094SerGly: 5.094 ± 0.121
1.061SerHis: 1.061 ± 0.032
1.831SerIle: 1.831 ± 0.042
1.236SerLys: 1.236 ± 0.035
4.828SerLeu: 4.828 ± 0.069
1.087SerMet: 1.087 ± 0.032
1.024SerAsn: 1.024 ± 0.04
2.716SerPro: 2.716 ± 0.051
1.37SerGln: 1.37 ± 0.035
3.437SerArg: 3.437 ± 0.055
2.337SerSer: 2.337 ± 0.064
2.206SerThr: 2.206 ± 0.061
3.562SerVal: 3.562 ± 0.054
0.685SerTrp: 0.685 ± 0.027
1.015SerTyr: 1.015 ± 0.033
0.001SerXaa: 0.001 ± 0.001
Thr
6.182ThrAla: 6.182 ± 0.181
0.471ThrCys: 0.471 ± 0.021
2.663ThrAsp: 2.663 ± 0.099
2.529ThrGlu: 2.529 ± 0.049
1.791ThrPhe: 1.791 ± 0.052
4.956ThrGly: 4.956 ± 0.151
0.945ThrHis: 0.945 ± 0.027
2.401ThrIle: 2.401 ± 0.064
1.16ThrLys: 1.16 ± 0.034
5.709ThrLeu: 5.709 ± 0.112
1.115ThrMet: 1.115 ± 0.03
1.133ThrAsn: 1.133 ± 0.073
2.976ThrPro: 2.976 ± 0.052
1.226ThrGln: 1.226 ± 0.042
3.223ThrArg: 3.223 ± 0.054
2.094ThrSer: 2.094 ± 0.067
2.491ThrThr: 2.491 ± 0.127
4.605ThrVal: 4.605 ± 0.125
0.683ThrTrp: 0.683 ± 0.036
1.084ThrTyr: 1.084 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
11.088ValAla: 11.088 ± 0.105
0.73ValCys: 0.73 ± 0.027
4.233ValAsp: 4.233 ± 0.064
5.023ValGlu: 5.023 ± 0.07
2.647ValPhe: 2.647 ± 0.048
5.676ValGly: 5.676 ± 0.077
1.551ValHis: 1.551 ± 0.038
3.274ValIle: 3.274 ± 0.059
2.359ValLys: 2.359 ± 0.05
8.364ValLeu: 8.364 ± 0.111
1.974ValMet: 1.974 ± 0.042
1.921ValAsn: 1.921 ± 0.055
4.147ValPro: 4.147 ± 0.067
1.898ValGln: 1.898 ± 0.037
5.755ValArg: 5.755 ± 0.079
3.83ValSer: 3.83 ± 0.063
4.389ValThr: 4.389 ± 0.13
7.327ValVal: 7.327 ± 0.09
0.953ValTrp: 0.953 ± 0.027
1.381ValTyr: 1.381 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.391TrpAla: 1.391 ± 0.039
0.159TrpCys: 0.159 ± 0.012
0.704TrpAsp: 0.704 ± 0.025
0.566TrpGlu: 0.566 ± 0.024
0.476TrpPhe: 0.476 ± 0.019
0.967TrpGly: 0.967 ± 0.03
0.377TrpHis: 0.377 ± 0.019
0.522TrpIle: 0.522 ± 0.019
0.433TrpLys: 0.433 ± 0.021
1.64TrpLeu: 1.64 ± 0.048
0.365TrpMet: 0.365 ± 0.018
0.365TrpAsn: 0.365 ± 0.021
0.69TrpPro: 0.69 ± 0.026
0.552TrpGln: 0.552 ± 0.019
1.41TrpArg: 1.41 ± 0.034
0.74TrpSer: 0.74 ± 0.028
0.684TrpThr: 0.684 ± 0.033
0.936TrpVal: 0.936 ± 0.032
0.229TrpTrp: 0.229 ± 0.016
0.286TrpTyr: 0.286 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.142TyrAla: 2.142 ± 0.04
0.245TyrCys: 0.245 ± 0.014
1.305TyrAsp: 1.305 ± 0.032
0.953TyrGlu: 0.953 ± 0.026
0.772TyrPhe: 0.772 ± 0.027
1.869TyrGly: 1.869 ± 0.037
0.48TyrHis: 0.48 ± 0.02
0.684TyrIle: 0.684 ± 0.027
0.514TyrLys: 0.514 ± 0.023
2.044TyrLeu: 2.044 ± 0.04
0.384TyrMet: 0.384 ± 0.017
0.493TyrAsn: 0.493 ± 0.022
0.94TyrPro: 0.94 ± 0.03
0.657TyrGln: 0.657 ± 0.024
1.832TyrArg: 1.832 ± 0.042
1.03TyrSer: 1.03 ± 0.034
0.945TyrThr: 0.945 ± 0.034
1.559TyrVal: 1.559 ± 0.039
0.328TyrTrp: 0.328 ± 0.018
0.491TyrTyr: 0.491 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4085 proteins (1230408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski