Amino acid dipepetide frequency for Loktanella sp. 3ANDIMAR09

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.525AlaAla: 17.525 ± 0.213
1.169AlaCys: 1.169 ± 0.035
8.347AlaAsp: 8.347 ± 0.093
7.018AlaGlu: 7.018 ± 0.094
4.287AlaPhe: 4.287 ± 0.066
10.972AlaGly: 10.972 ± 0.124
2.276AlaHis: 2.276 ± 0.05
6.274AlaIle: 6.274 ± 0.084
3.47AlaLys: 3.47 ± 0.071
13.615AlaLeu: 13.615 ± 0.168
3.884AlaMet: 3.884 ± 0.062
2.929AlaAsn: 2.929 ± 0.049
5.859AlaPro: 5.859 ± 0.103
5.235AlaGln: 5.235 ± 0.085
8.843AlaArg: 8.843 ± 0.123
5.424AlaSer: 5.424 ± 0.074
6.991AlaThr: 6.991 ± 0.091
8.775AlaVal: 8.775 ± 0.106
1.394AlaTrp: 1.394 ± 0.038
2.582AlaTyr: 2.582 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.198CysAla: 1.198 ± 0.034
0.101CysCys: 0.101 ± 0.01
0.705CysAsp: 0.705 ± 0.026
0.405CysGlu: 0.405 ± 0.022
0.299CysPhe: 0.299 ± 0.02
0.941CysGly: 0.941 ± 0.03
0.256CysHis: 0.256 ± 0.016
0.446CysIle: 0.446 ± 0.022
0.2CysLys: 0.2 ± 0.015
0.8CysLeu: 0.8 ± 0.029
0.172CysMet: 0.172 ± 0.013
0.218CysAsn: 0.218 ± 0.015
0.481CysPro: 0.481 ± 0.028
0.251CysGln: 0.251 ± 0.016
0.534CysArg: 0.534 ± 0.024
0.366CysSer: 0.366 ± 0.017
0.472CysThr: 0.472 ± 0.022
0.664CysVal: 0.664 ± 0.027
0.102CysTrp: 0.102 ± 0.01
0.193CysTyr: 0.193 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
8.97AspAla: 8.97 ± 0.106
0.554AspCys: 0.554 ± 0.026
4.686AspAsp: 4.686 ± 0.115
3.084AspGlu: 3.084 ± 0.057
2.466AspPhe: 2.466 ± 0.049
6.425AspGly: 6.425 ± 0.103
1.573AspHis: 1.573 ± 0.042
3.447AspIle: 3.447 ± 0.053
1.585AspLys: 1.585 ± 0.043
7.129AspLeu: 7.129 ± 0.098
1.974AspMet: 1.974 ± 0.041
1.513AspAsn: 1.513 ± 0.044
4.033AspPro: 4.033 ± 0.069
2.248AspGln: 2.248 ± 0.047
5.141AspArg: 5.141 ± 0.08
2.45AspSer: 2.45 ± 0.053
3.806AspThr: 3.806 ± 0.093
4.882AspVal: 4.882 ± 0.075
1.275AspTrp: 1.275 ± 0.041
1.633AspTyr: 1.633 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.434GluAla: 6.434 ± 0.101
0.319GluCys: 0.319 ± 0.019
2.978GluAsp: 2.978 ± 0.054
2.638GluGlu: 2.638 ± 0.067
1.524GluPhe: 1.524 ± 0.033
3.804GluGly: 3.804 ± 0.066
0.929GluHis: 0.929 ± 0.032
3.064GluIle: 3.064 ± 0.06
1.677GluLys: 1.677 ± 0.047
4.342GluLeu: 4.342 ± 0.081
1.572GluMet: 1.572 ± 0.039
1.689GluAsn: 1.689 ± 0.044
2.077GluPro: 2.077 ± 0.05
1.744GluGln: 1.744 ± 0.041
3.455GluArg: 3.455 ± 0.057
1.85GluSer: 1.85 ± 0.043
3.409GluThr: 3.409 ± 0.055
3.951GluVal: 3.951 ± 0.063
0.574GluTrp: 0.574 ± 0.025
0.919GluTyr: 0.919 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.68PheAla: 4.68 ± 0.062
0.406PheCys: 0.406 ± 0.019
3.066PheAsp: 3.066 ± 0.054
1.942PheGlu: 1.942 ± 0.045
1.324PhePhe: 1.324 ± 0.04
3.761PheGly: 3.761 ± 0.067
0.714PheHis: 0.714 ± 0.021
1.687PheIle: 1.687 ± 0.043
0.933PheLys: 0.933 ± 0.032
3.081PheLeu: 3.081 ± 0.065
0.922PheMet: 0.922 ± 0.032
1.051PheAsn: 1.051 ± 0.035
1.474PhePro: 1.474 ± 0.035
1.044PheGln: 1.044 ± 0.033
2.057PheArg: 2.057 ± 0.052
1.924PheSer: 1.924 ± 0.045
2.048PheThr: 2.048 ± 0.047
2.778PheVal: 2.778 ± 0.055
0.56PheTrp: 0.56 ± 0.026
0.942PheTyr: 0.942 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
9.758GlyAla: 9.758 ± 0.121
0.862GlyCys: 0.862 ± 0.027
5.535GlyAsp: 5.535 ± 0.128
4.107GlyGlu: 4.107 ± 0.065
3.579GlyPhe: 3.579 ± 0.057
7.483GlyGly: 7.483 ± 0.131
1.898GlyHis: 1.898 ± 0.044
4.563GlyIle: 4.563 ± 0.072
2.786GlyLys: 2.786 ± 0.062
8.846GlyLeu: 8.846 ± 0.11
2.583GlyMet: 2.583 ± 0.054
2.214GlyAsn: 2.214 ± 0.101
3.854GlyPro: 3.854 ± 0.064
3.384GlyGln: 3.384 ± 0.059
5.699GlyArg: 5.699 ± 0.072
4.149GlySer: 4.149 ± 0.063
5.197GlyThr: 5.197 ± 0.084
6.557GlyVal: 6.557 ± 0.092
1.5GlyTrp: 1.5 ± 0.037
2.234GlyTyr: 2.234 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
2.461HisAla: 2.461 ± 0.056
0.237HisCys: 0.237 ± 0.016
1.475HisAsp: 1.475 ± 0.042
0.839HisGlu: 0.839 ± 0.028
0.775HisPhe: 0.775 ± 0.027
1.798HisGly: 1.798 ± 0.047
0.568HisHis: 0.568 ± 0.028
0.987HisIle: 0.987 ± 0.028
0.498HisLys: 0.498 ± 0.021
2.029HisLeu: 2.029 ± 0.045
0.531HisMet: 0.531 ± 0.023
0.456HisAsn: 0.456 ± 0.024
1.354HisPro: 1.354 ± 0.034
0.541HisGln: 0.541 ± 0.024
1.327HisArg: 1.327 ± 0.038
0.797HisSer: 0.797 ± 0.03
0.857HisThr: 0.857 ± 0.028
1.502HisVal: 1.502 ± 0.043
0.329HisTrp: 0.329 ± 0.016
0.535HisTyr: 0.535 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.652IleAla: 7.652 ± 0.093
0.619IleCys: 0.619 ± 0.021
3.949IleAsp: 3.949 ± 0.065
2.978IleGlu: 2.978 ± 0.066
1.691IlePhe: 1.691 ± 0.043
4.945IleGly: 4.945 ± 0.078
0.936IleHis: 0.936 ± 0.028
2.411IleIle: 2.411 ± 0.055
1.453IleLys: 1.453 ± 0.04
4.514IleLeu: 4.514 ± 0.071
1.271IleMet: 1.271 ± 0.035
1.455IleAsn: 1.455 ± 0.037
2.392IlePro: 2.392 ± 0.05
1.28IleGln: 1.28 ± 0.032
3.117IleArg: 3.117 ± 0.056
2.67IleSer: 2.67 ± 0.052
3.243IleThr: 3.243 ± 0.064
4.065IleVal: 4.065 ± 0.061
0.711IleTrp: 0.711 ± 0.027
1.267IleTyr: 1.267 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
3.37LysAla: 3.37 ± 0.066
0.164LysCys: 0.164 ± 0.011
1.64LysAsp: 1.64 ± 0.045
1.229LysGlu: 1.229 ± 0.044
0.89LysPhe: 0.89 ± 0.032
2.379LysGly: 2.379 ± 0.052
0.555LysHis: 0.555 ± 0.026
1.446LysIle: 1.446 ± 0.042
1.013LysLys: 1.013 ± 0.033
2.626LysLeu: 2.626 ± 0.056
0.832LysMet: 0.832 ± 0.033
0.717LysAsn: 0.717 ± 0.029
1.631LysPro: 1.631 ± 0.039
0.829LysGln: 0.829 ± 0.03
2.057LysArg: 2.057 ± 0.047
1.796LysSer: 1.796 ± 0.045
1.861LysThr: 1.861 ± 0.044
2.043LysVal: 2.043 ± 0.056
0.345LysTrp: 0.345 ± 0.018
0.555LysTyr: 0.555 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
12.308LeuAla: 12.308 ± 0.138
0.914LeuCys: 0.914 ± 0.033
6.552LeuAsp: 6.552 ± 0.083
4.212LeuGlu: 4.212 ± 0.068
3.511LeuPhe: 3.511 ± 0.063
8.136LeuGly: 8.136 ± 0.088
1.84LeuHis: 1.84 ± 0.045
5.35LeuIle: 5.35 ± 0.088
2.666LeuLys: 2.666 ± 0.061
8.476LeuLeu: 8.476 ± 0.119
2.745LeuMet: 2.745 ± 0.053
2.683LeuAsn: 2.683 ± 0.048
5.272LeuPro: 5.272 ± 0.08
2.762LeuGln: 2.762 ± 0.056
7.011LeuArg: 7.011 ± 0.099
6.266LeuSer: 6.266 ± 0.071
6.945LeuThr: 6.945 ± 0.101
6.524LeuVal: 6.524 ± 0.086
1.255LeuTrp: 1.255 ± 0.042
1.795LeuTyr: 1.795 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.509MetAla: 3.509 ± 0.058
0.21MetCys: 0.21 ± 0.016
1.701MetAsp: 1.701 ± 0.043
1.193MetGlu: 1.193 ± 0.037
0.857MetPhe: 0.857 ± 0.031
2.357MetGly: 2.357 ± 0.05
0.479MetHis: 0.479 ± 0.022
1.657MetIle: 1.657 ± 0.037
0.999MetLys: 0.999 ± 0.031
2.531MetLeu: 2.531 ± 0.049
0.863MetMet: 0.863 ± 0.033
0.896MetAsn: 0.896 ± 0.032
1.572MetPro: 1.572 ± 0.039
1.094MetGln: 1.094 ± 0.033
1.957MetArg: 1.957 ± 0.039
1.6MetSer: 1.6 ± 0.035
2.506MetThr: 2.506 ± 0.055
1.857MetVal: 1.857 ± 0.044
0.287MetTrp: 0.287 ± 0.017
0.323MetTyr: 0.323 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.433AsnAla: 3.433 ± 0.06
0.249AsnCys: 0.249 ± 0.018
1.733AsnAsp: 1.733 ± 0.083
1.115AsnGlu: 1.115 ± 0.033
0.936AsnPhe: 0.936 ± 0.031
2.421AsnGly: 2.421 ± 0.066
0.493AsnHis: 0.493 ± 0.024
1.409AsnIle: 1.409 ± 0.039
0.644AsnLys: 0.644 ± 0.027
2.452AsnLeu: 2.452 ± 0.056
0.768AsnMet: 0.768 ± 0.03
0.723AsnAsn: 0.723 ± 0.03
1.915AsnPro: 1.915 ± 0.039
0.712AsnGln: 0.712 ± 0.026
1.794AsnArg: 1.794 ± 0.037
1.086AsnSer: 1.086 ± 0.028
1.469AsnThr: 1.469 ± 0.043
1.944AsnVal: 1.944 ± 0.049
0.418AsnTrp: 0.418 ± 0.022
0.649AsnTyr: 0.649 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
6.317ProAla: 6.317 ± 0.099
0.352ProCys: 0.352 ± 0.02
4.481ProAsp: 4.481 ± 0.07
3.118ProGlu: 3.118 ± 0.065
1.876ProPhe: 1.876 ± 0.048
4.343ProGly: 4.343 ± 0.066
1.038ProHis: 1.038 ± 0.03
2.264ProIle: 2.264 ± 0.053
1.461ProLys: 1.461 ± 0.038
4.59ProLeu: 4.59 ± 0.073
1.302ProMet: 1.302 ± 0.034
1.262ProAsn: 1.262 ± 0.033
2.211ProPro: 2.211 ± 0.062
1.965ProGln: 1.965 ± 0.044
2.879ProArg: 2.879 ± 0.059
2.282ProSer: 2.282 ± 0.047
2.639ProThr: 2.639 ± 0.053
4.436ProVal: 4.436 ± 0.068
0.661ProTrp: 0.661 ± 0.023
1.089ProTyr: 1.089 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.573GlnAla: 4.573 ± 0.067
0.205GlnCys: 0.205 ± 0.014
2.111GlnAsp: 2.111 ± 0.052
1.428GlnGlu: 1.428 ± 0.04
1.124GlnPhe: 1.124 ± 0.028
2.791GlnGly: 2.791 ± 0.056
0.609GlnHis: 0.609 ± 0.025
2.25GlnIle: 2.25 ± 0.051
0.93GlnLys: 0.93 ± 0.026
2.933GlnLeu: 2.933 ± 0.051
1.197GlnMet: 1.197 ± 0.032
0.932GlnAsn: 0.932 ± 0.031
1.709GlnPro: 1.709 ± 0.043
1.273GlnGln: 1.273 ± 0.042
2.361GlnArg: 2.361 ± 0.049
1.856GlnSer: 1.856 ± 0.041
2.371GlnThr: 2.371 ± 0.045
2.72GlnVal: 2.72 ± 0.059
0.419GlnTrp: 0.419 ± 0.02
0.556GlnTyr: 0.556 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
8.41ArgAla: 8.41 ± 0.107
0.485ArgCys: 0.485 ± 0.021
5.049ArgAsp: 5.049 ± 0.065
3.048ArgGlu: 3.048 ± 0.067
2.634ArgPhe: 2.634 ± 0.056
4.69ArgGly: 4.69 ± 0.069
1.491ArgHis: 1.491 ± 0.043
3.935ArgIle: 3.935 ± 0.064
2.016ArgLys: 2.016 ± 0.042
7.002ArgLeu: 7.002 ± 0.095
2.04ArgMet: 2.04 ± 0.046
1.778ArgAsn: 1.778 ± 0.04
3.222ArgPro: 3.222 ± 0.062
2.456ArgGln: 2.456 ± 0.056
4.805ArgArg: 4.805 ± 0.085
3.198ArgSer: 3.198 ± 0.063
3.328ArgThr: 3.328 ± 0.057
4.862ArgVal: 4.862 ± 0.071
0.951ArgTrp: 0.951 ± 0.034
1.477ArgTyr: 1.477 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.574SerAla: 5.574 ± 0.078
0.422SerCys: 0.422 ± 0.022
3.656SerAsp: 3.656 ± 0.058
2.457SerGlu: 2.457 ± 0.049
2.042SerPhe: 2.042 ± 0.043
5.012SerGly: 5.012 ± 0.078
1.023SerHis: 1.023 ± 0.037
2.349SerIle: 2.349 ± 0.05
1.327SerLys: 1.327 ± 0.04
4.658SerLeu: 4.658 ± 0.071
1.249SerMet: 1.249 ± 0.033
1.392SerAsn: 1.392 ± 0.043
2.358SerPro: 2.358 ± 0.042
1.713SerGln: 1.713 ± 0.041
3.085SerArg: 3.085 ± 0.057
2.262SerSer: 2.262 ± 0.048
2.451SerThr: 2.451 ± 0.051
3.747SerVal: 3.747 ± 0.063
0.616SerTrp: 0.616 ± 0.029
1.302SerTyr: 1.302 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
7.341ThrAla: 7.341 ± 0.094
0.536ThrCys: 0.536 ± 0.026
4.011ThrAsp: 4.011 ± 0.073
2.822ThrGlu: 2.822 ± 0.05
2.27ThrPhe: 2.27 ± 0.051
5.9ThrGly: 5.9 ± 0.095
1.178ThrHis: 1.178 ± 0.038
3.106ThrIle: 3.106 ± 0.058
1.402ThrLys: 1.402 ± 0.042
6.644ThrLeu: 6.644 ± 0.095
1.425ThrMet: 1.425 ± 0.038
1.415ThrAsn: 1.415 ± 0.046
3.659ThrPro: 3.659 ± 0.065
2.007ThrGln: 2.007 ± 0.047
3.754ThrArg: 3.754 ± 0.064
2.718ThrSer: 2.718 ± 0.047
3.504ThrThr: 3.504 ± 0.069
4.759ThrVal: 4.759 ± 0.082
0.676ThrTrp: 0.676 ± 0.025
1.495ThrTyr: 1.495 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
9.312ValAla: 9.312 ± 0.105
0.65ValCys: 0.65 ± 0.028
4.531ValAsp: 4.531 ± 0.068
3.741ValGlu: 3.741 ± 0.069
2.843ValPhe: 2.843 ± 0.053
5.569ValGly: 5.569 ± 0.076
1.263ValHis: 1.263 ± 0.035
4.319ValIle: 4.319 ± 0.064
1.997ValLys: 1.997 ± 0.05
7.266ValLeu: 7.266 ± 0.094
2.211ValMet: 2.211 ± 0.047
2.044ValAsn: 2.044 ± 0.046
3.72ValPro: 3.72 ± 0.062
2.439ValGln: 2.439 ± 0.054
4.381ValArg: 4.381 ± 0.059
4.23ValSer: 4.23 ± 0.072
5.508ValThr: 5.508 ± 0.089
5.695ValVal: 5.695 ± 0.089
1.014ValTrp: 1.014 ± 0.03
1.478ValTyr: 1.478 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
1.318TrpAla: 1.318 ± 0.037
0.146TrpCys: 0.146 ± 0.013
0.927TrpAsp: 0.927 ± 0.032
0.524TrpGlu: 0.524 ± 0.023
0.566TrpPhe: 0.566 ± 0.023
0.996TrpGly: 0.996 ± 0.032
0.289TrpHis: 0.289 ± 0.019
0.707TrpIle: 0.707 ± 0.03
0.389TrpLys: 0.389 ± 0.02
1.536TrpLeu: 1.536 ± 0.041
0.447TrpMet: 0.447 ± 0.021
0.407TrpAsn: 0.407 ± 0.021
0.709TrpPro: 0.709 ± 0.027
0.623TrpGln: 0.623 ± 0.024
1.039TrpArg: 1.039 ± 0.031
0.768TrpSer: 0.768 ± 0.027
0.871TrpThr: 0.871 ± 0.028
0.915TrpVal: 0.915 ± 0.027
0.235TrpTrp: 0.235 ± 0.016
0.248TrpTyr: 0.248 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.727TyrAla: 2.727 ± 0.053
0.205TyrCys: 0.205 ± 0.014
1.743TyrAsp: 1.743 ± 0.041
1.052TyrGlu: 1.052 ± 0.036
0.902TyrPhe: 0.902 ± 0.03
2.063TyrGly: 2.063 ± 0.047
0.487TyrHis: 0.487 ± 0.02
0.948TyrIle: 0.948 ± 0.028
0.532TyrLys: 0.532 ± 0.025
2.151TyrLeu: 2.151 ± 0.044
0.457TyrMet: 0.457 ± 0.022
0.59TyrAsn: 0.59 ± 0.023
1.066TyrPro: 1.066 ± 0.034
0.744TyrGln: 0.744 ± 0.029
1.58TyrArg: 1.58 ± 0.04
1.006TyrSer: 1.006 ± 0.03
1.143TyrThr: 1.143 ± 0.032
1.536TyrVal: 1.536 ± 0.037
0.339TyrTrp: 0.339 ± 0.018
0.551TyrTyr: 0.551 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3527 proteins (1058224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski