Amino acid dipepetide frequency for Sitophilus oryzae (Rice weevil) (Curculio oryzae)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.384AlaAla: 4.384 ± 0.046
1.031AlaCys: 1.031 ± 0.02
2.768AlaAsp: 2.768 ± 0.021
3.803AlaGlu: 3.803 ± 0.029
2.004AlaPhe: 2.004 ± 0.016
3.053AlaGly: 3.053 ± 0.026
1.21AlaHis: 1.21 ± 0.011
3.13AlaIle: 3.13 ± 0.022
3.731AlaLys: 3.731 ± 0.038
5.193AlaLeu: 5.193 ± 0.034
1.181AlaMet: 1.181 ± 0.013
2.529AlaAsn: 2.529 ± 0.019
2.868AlaPro: 2.868 ± 0.03
2.388AlaGln: 2.388 ± 0.02
2.583AlaArg: 2.583 ± 0.016
4.469AlaSer: 4.469 ± 0.028
3.24AlaThr: 3.24 ± 0.028
3.632AlaVal: 3.632 ± 0.024
0.494AlaTrp: 0.494 ± 0.007
1.567AlaTyr: 1.567 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
0.981CysAla: 0.981 ± 0.015
0.471CysCys: 0.471 ± 0.009
1.229CysAsp: 1.229 ± 0.016
1.131CysGlu: 1.131 ± 0.015
0.73CysPhe: 0.73 ± 0.009
1.213CysGly: 1.213 ± 0.027
0.487CysHis: 0.487 ± 0.007
1.119CysIle: 1.119 ± 0.021
1.278CysLys: 1.278 ± 0.02
1.75CysLeu: 1.75 ± 0.021
0.358CysMet: 0.358 ± 0.006
1.045CysAsn: 1.045 ± 0.015
1.042CysPro: 1.042 ± 0.024
0.781CysGln: 0.781 ± 0.014
0.94CysArg: 0.94 ± 0.02
1.685CysSer: 1.685 ± 0.029
1.025CysThr: 1.025 ± 0.017
1.119CysVal: 1.119 ± 0.02
0.192CysTrp: 0.192 ± 0.004
0.594CysTyr: 0.594 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.677AspAla: 2.677 ± 0.026
1.0AspCys: 1.0 ± 0.017
3.686AspAsp: 3.686 ± 0.024
4.339AspGlu: 4.339 ± 0.035
2.332AspPhe: 2.332 ± 0.015
2.916AspGly: 2.916 ± 0.02
1.181AspHis: 1.181 ± 0.011
4.0AspIle: 4.0 ± 0.024
3.784AspLys: 3.784 ± 0.031
5.128AspLeu: 5.128 ± 0.024
1.203AspMet: 1.203 ± 0.013
3.021AspAsn: 3.021 ± 0.022
2.381AspPro: 2.381 ± 0.026
1.916AspGln: 1.916 ± 0.015
2.485AspArg: 2.485 ± 0.02
4.569AspSer: 4.569 ± 0.025
2.92AspThr: 2.92 ± 0.02
3.434AspVal: 3.434 ± 0.021
0.572AspTrp: 0.572 ± 0.009
1.891AspTyr: 1.891 ± 0.015
0.001AspXaa: 0.001 ± 0.0
Glu
3.884GluAla: 3.884 ± 0.029
1.194GluCys: 1.194 ± 0.028
4.442GluAsp: 4.442 ± 0.031
7.121GluGlu: 7.121 ± 0.136
2.247GluPhe: 2.247 ± 0.016
3.042GluGly: 3.042 ± 0.041
1.543GluHis: 1.543 ± 0.019
4.554GluIle: 4.554 ± 0.042
6.213GluLys: 6.213 ± 0.066
6.151GluLeu: 6.151 ± 0.046
1.482GluMet: 1.482 ± 0.014
4.655GluAsn: 4.655 ± 0.034
2.837GluPro: 2.837 ± 0.038
3.015GluGln: 3.015 ± 0.035
3.64GluArg: 3.64 ± 0.03
4.918GluSer: 4.918 ± 0.037
3.904GluThr: 3.904 ± 0.054
3.994GluVal: 3.994 ± 0.035
0.587GluTrp: 0.587 ± 0.008
2.067GluTyr: 2.067 ± 0.018
0.001GluXaa: 0.001 ± 0.0
Phe
1.982PheAla: 1.982 ± 0.016
0.807PheCys: 0.807 ± 0.009
2.144PheAsp: 2.144 ± 0.015
2.333PheGlu: 2.333 ± 0.016
1.613PhePhe: 1.613 ± 0.017
2.25PheGly: 2.25 ± 0.021
0.879PheHis: 0.879 ± 0.011
2.253PheIle: 2.253 ± 0.02
2.544PheLys: 2.544 ± 0.02
3.625PheLeu: 3.625 ± 0.029
0.801PheMet: 0.801 ± 0.009
2.039PheAsn: 2.039 ± 0.018
1.609PhePro: 1.609 ± 0.014
1.503PheGln: 1.503 ± 0.013
1.808PheArg: 1.808 ± 0.016
3.085PheSer: 3.085 ± 0.022
2.103PheThr: 2.103 ± 0.018
2.36PheVal: 2.36 ± 0.018
0.442PheTrp: 0.442 ± 0.007
1.396PheTyr: 1.396 ± 0.015
0.001PheXaa: 0.001 ± 0.0
Gly
2.972GlyAla: 2.972 ± 0.026
0.915GlyCys: 0.915 ± 0.012
2.848GlyAsp: 2.848 ± 0.022
3.074GlyGlu: 3.074 ± 0.027
2.165GlyPhe: 2.165 ± 0.018
3.932GlyGly: 3.932 ± 0.054
1.356GlyHis: 1.356 ± 0.015
3.06GlyIle: 3.06 ± 0.021
3.527GlyLys: 3.527 ± 0.033
4.243GlyLeu: 4.243 ± 0.03
1.048GlyMet: 1.048 ± 0.012
2.639GlyAsn: 2.639 ± 0.019
2.472GlyPro: 2.472 ± 0.036
2.229GlyGln: 2.229 ± 0.036
2.591GlyArg: 2.591 ± 0.02
4.635GlySer: 4.635 ± 0.053
2.989GlyThr: 2.989 ± 0.022
3.034GlyVal: 3.034 ± 0.022
0.591GlyTrp: 0.591 ± 0.008
1.969GlyTyr: 1.969 ± 0.021
0.002GlyXaa: 0.002 ± 0.0
His
1.084HisAla: 1.084 ± 0.011
0.605HisCys: 0.605 ± 0.011
1.061HisAsp: 1.061 ± 0.025
1.32HisGlu: 1.32 ± 0.01
1.019HisPhe: 1.019 ± 0.011
1.193HisGly: 1.193 ± 0.011
0.91HisHis: 0.91 ± 0.016
1.56HisIle: 1.56 ± 0.013
1.547HisLys: 1.547 ± 0.013
2.45HisLeu: 2.45 ± 0.026
0.587HisMet: 0.587 ± 0.01
1.176HisAsn: 1.176 ± 0.011
1.285HisPro: 1.285 ± 0.015
1.115HisGln: 1.115 ± 0.012
1.219HisArg: 1.219 ± 0.012
1.938HisSer: 1.938 ± 0.019
1.299HisThr: 1.299 ± 0.014
1.333HisVal: 1.333 ± 0.013
0.249HisTrp: 0.249 ± 0.004
0.835HisTyr: 0.835 ± 0.008
0.001HisXaa: 0.001 ± 0.0
Ile
3.18IleAla: 3.18 ± 0.019
1.298IleCys: 1.298 ± 0.017
3.39IleAsp: 3.39 ± 0.021
4.196IleGlu: 4.196 ± 0.037
2.493IlePhe: 2.493 ± 0.024
2.809IleGly: 2.809 ± 0.018
1.442IleHis: 1.442 ± 0.013
3.701IleIle: 3.701 ± 0.024
4.505IleLys: 4.505 ± 0.039
5.503IleLeu: 5.503 ± 0.038
1.172IleMet: 1.172 ± 0.011
3.303IleAsn: 3.303 ± 0.026
3.078IlePro: 3.078 ± 0.023
2.576IleGln: 2.576 ± 0.02
2.778IleArg: 2.778 ± 0.018
4.923IleSer: 4.923 ± 0.026
3.443IleThr: 3.443 ± 0.03
3.506IleVal: 3.506 ± 0.025
0.562IleTrp: 0.562 ± 0.008
1.945IleTyr: 1.945 ± 0.014
0.001IleXaa: 0.001 ± 0.0
Lys
3.696LysAla: 3.696 ± 0.03
1.463LysCys: 1.463 ± 0.02
4.03LysAsp: 4.03 ± 0.04
5.837LysGlu: 5.837 ± 0.067
2.398LysPhe: 2.398 ± 0.02
3.129LysGly: 3.129 ± 0.029
1.721LysHis: 1.721 ± 0.014
4.732LysIle: 4.732 ± 0.033
6.768LysLys: 6.768 ± 0.106
6.427LysLeu: 6.427 ± 0.036
1.533LysMet: 1.533 ± 0.011
4.252LysAsn: 4.252 ± 0.03
3.614LysPro: 3.614 ± 0.065
3.096LysGln: 3.096 ± 0.023
3.954LysArg: 3.954 ± 0.027
5.52LysSer: 5.52 ± 0.05
4.138LysThr: 4.138 ± 0.027
4.042LysVal: 4.042 ± 0.03
0.72LysTrp: 0.72 ± 0.01
2.573LysTyr: 2.573 ± 0.015
0.002LysXaa: 0.002 ± 0.0
Leu
5.245LeuAla: 5.245 ± 0.032
1.642LeuCys: 1.642 ± 0.018
4.92LeuAsp: 4.92 ± 0.03
6.695LeuGlu: 6.695 ± 0.049
3.233LeuPhe: 3.233 ± 0.025
4.318LeuGly: 4.318 ± 0.025
2.144LeuHis: 2.144 ± 0.02
4.662LeuIle: 4.662 ± 0.028
7.077LeuLys: 7.077 ± 0.038
8.115LeuLeu: 8.115 ± 0.057
1.759LeuMet: 1.759 ± 0.017
4.973LeuAsn: 4.973 ± 0.032
4.546LeuPro: 4.546 ± 0.032
4.371LeuGln: 4.371 ± 0.028
4.598LeuArg: 4.598 ± 0.031
6.971LeuSer: 6.971 ± 0.04
5.043LeuThr: 5.043 ± 0.026
4.835LeuVal: 4.835 ± 0.029
0.854LeuTrp: 0.854 ± 0.01
2.715LeuTyr: 2.715 ± 0.02
0.003LeuXaa: 0.003 ± 0.0
Met
1.366MetAla: 1.366 ± 0.01
0.404MetCys: 0.404 ± 0.007
1.255MetAsp: 1.255 ± 0.01
1.57MetGlu: 1.57 ± 0.012
0.854MetPhe: 0.854 ± 0.01
1.151MetGly: 1.151 ± 0.014
0.462MetHis: 0.462 ± 0.007
0.986MetIle: 0.986 ± 0.01
1.427MetLys: 1.427 ± 0.012
1.746MetLeu: 1.746 ± 0.016
0.507MetMet: 0.507 ± 0.008
0.993MetAsn: 0.993 ± 0.011
0.956MetPro: 0.956 ± 0.011
0.861MetGln: 0.861 ± 0.009
0.954MetArg: 0.954 ± 0.01
1.729MetSer: 1.729 ± 0.015
1.087MetThr: 1.087 ± 0.011
1.195MetVal: 1.195 ± 0.011
0.214MetTrp: 0.214 ± 0.004
0.674MetTyr: 0.674 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.737AsnAla: 2.737 ± 0.021
1.039AsnCys: 1.039 ± 0.015
2.864AsnAsp: 2.864 ± 0.02
3.681AsnGlu: 3.681 ± 0.025
2.2AsnPhe: 2.2 ± 0.019
2.958AsnGly: 2.958 ± 0.024
1.287AsnHis: 1.287 ± 0.015
4.08AsnIle: 4.08 ± 0.029
4.022AsnLys: 4.022 ± 0.029
5.086AsnLeu: 5.086 ± 0.031
1.202AsnMet: 1.202 ± 0.01
3.642AsnAsn: 3.642 ± 0.031
2.421AsnPro: 2.421 ± 0.026
2.339AsnGln: 2.339 ± 0.029
2.426AsnArg: 2.426 ± 0.019
4.657AsnSer: 4.657 ± 0.031
3.015AsnThr: 3.015 ± 0.018
3.354AsnVal: 3.354 ± 0.018
0.5AsnTrp: 0.5 ± 0.006
1.865AsnTyr: 1.865 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
2.84ProAla: 2.84 ± 0.024
0.796ProCys: 0.796 ± 0.032
2.628ProAsp: 2.628 ± 0.018
3.805ProGlu: 3.805 ± 0.075
1.663ProPhe: 1.663 ± 0.014
2.952ProGly: 2.952 ± 0.052
1.158ProHis: 1.158 ± 0.012
2.699ProIle: 2.699 ± 0.02
3.546ProLys: 3.546 ± 0.043
4.105ProLeu: 4.105 ± 0.025
0.886ProMet: 0.886 ± 0.011
2.557ProAsn: 2.557 ± 0.022
4.629ProPro: 4.629 ± 0.051
2.271ProGln: 2.271 ± 0.023
2.276ProArg: 2.276 ± 0.018
4.597ProSer: 4.597 ± 0.04
3.127ProThr: 3.127 ± 0.025
3.314ProVal: 3.314 ± 0.026
0.424ProTrp: 0.424 ± 0.007
1.576ProTyr: 1.576 ± 0.016
0.001ProXaa: 0.001 ± 0.0
Gln
2.481GlnAla: 2.481 ± 0.019
0.814GlnCys: 0.814 ± 0.018
2.05GlnAsp: 2.05 ± 0.014
3.241GlnGlu: 3.241 ± 0.036
1.449GlnPhe: 1.449 ± 0.013
1.973GlnGly: 1.973 ± 0.02
1.134GlnHis: 1.134 ± 0.013
2.505GlnIle: 2.505 ± 0.017
3.125GlnLys: 3.125 ± 0.021
3.776GlnLeu: 3.776 ± 0.029
0.941GlnMet: 0.941 ± 0.011
2.713GlnAsn: 2.713 ± 0.025
2.298GlnPro: 2.298 ± 0.037
3.085GlnGln: 3.085 ± 0.054
2.243GlnArg: 2.243 ± 0.017
3.149GlnSer: 3.149 ± 0.026
2.461GlnThr: 2.461 ± 0.02
2.463GlnVal: 2.463 ± 0.019
0.436GlnTrp: 0.436 ± 0.006
1.451GlnTyr: 1.451 ± 0.018
0.001GlnXaa: 0.001 ± 0.0
Arg
2.598ArgAla: 2.598 ± 0.018
0.954ArgCys: 0.954 ± 0.016
2.583ArgAsp: 2.583 ± 0.019
3.226ArgGlu: 3.226 ± 0.026
1.811ArgPhe: 1.811 ± 0.015
2.437ArgGly: 2.437 ± 0.023
1.406ArgHis: 1.406 ± 0.015
2.797ArgIle: 2.797 ± 0.014
4.106ArgLys: 4.106 ± 0.024
4.21ArgLeu: 4.21 ± 0.028
0.99ArgMet: 0.99 ± 0.01
2.845ArgAsn: 2.845 ± 0.019
2.416ArgPro: 2.416 ± 0.025
2.242ArgGln: 2.242 ± 0.018
3.488ArgArg: 3.488 ± 0.029
3.847ArgSer: 3.847 ± 0.035
2.611ArgThr: 2.611 ± 0.016
2.552ArgVal: 2.552 ± 0.019
0.511ArgTrp: 0.511 ± 0.007
1.586ArgTyr: 1.586 ± 0.012
0.001ArgXaa: 0.001 ± 0.0
Ser
4.306SerAla: 4.306 ± 0.033
1.434SerCys: 1.434 ± 0.025
4.829SerAsp: 4.829 ± 0.034
5.411SerGlu: 5.411 ± 0.044
2.931SerPhe: 2.931 ± 0.02
4.695SerGly: 4.695 ± 0.049
1.782SerHis: 1.782 ± 0.016
4.372SerIle: 4.372 ± 0.025
5.557SerLys: 5.557 ± 0.048
6.95SerLeu: 6.95 ± 0.04
1.577SerMet: 1.577 ± 0.014
4.517SerAsn: 4.517 ± 0.03
4.679SerPro: 4.679 ± 0.057
3.409SerGln: 3.409 ± 0.027
3.962SerArg: 3.962 ± 0.03
9.014SerSer: 9.014 ± 0.084
5.349SerThr: 5.349 ± 0.043
4.848SerVal: 4.848 ± 0.026
0.756SerTrp: 0.756 ± 0.01
2.39SerTyr: 2.39 ± 0.021
0.001SerXaa: 0.001 ± 0.0
Thr
3.265ThrAla: 3.265 ± 0.024
1.117ThrCys: 1.117 ± 0.02
3.187ThrAsp: 3.187 ± 0.021
4.08ThrGlu: 4.08 ± 0.049
2.168ThrPhe: 2.168 ± 0.015
3.115ThrGly: 3.115 ± 0.023
1.219ThrHis: 1.219 ± 0.011
3.397ThrIle: 3.397 ± 0.027
3.813ThrLys: 3.813 ± 0.027
4.998ThrLeu: 4.998 ± 0.024
1.053ThrMet: 1.053 ± 0.011
3.04ThrAsn: 3.04 ± 0.019
3.533ThrPro: 3.533 ± 0.027
2.185ThrGln: 2.185 ± 0.019
2.385ThrArg: 2.385 ± 0.014
5.172ThrSer: 5.172 ± 0.046
4.258ThrThr: 4.258 ± 0.058
3.848ThrVal: 3.848 ± 0.054
0.55ThrTrp: 0.55 ± 0.008
1.746ThrTyr: 1.746 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
3.544ValAla: 3.544 ± 0.024
1.24ValCys: 1.24 ± 0.015
3.226ValAsp: 3.226 ± 0.02
4.095ValGlu: 4.095 ± 0.044
2.323ValPhe: 2.323 ± 0.019
2.823ValGly: 2.823 ± 0.021
1.374ValHis: 1.374 ± 0.013
3.624ValIle: 3.624 ± 0.035
4.137ValLys: 4.137 ± 0.034
5.348ValLeu: 5.348 ± 0.031
1.169ValMet: 1.169 ± 0.01
3.04ValAsn: 3.04 ± 0.019
3.309ValPro: 3.309 ± 0.031
2.553ValGln: 2.553 ± 0.021
2.652ValArg: 2.652 ± 0.017
4.609ValSer: 4.609 ± 0.022
3.709ValThr: 3.709 ± 0.045
3.919ValVal: 3.919 ± 0.033
0.596ValTrp: 0.596 ± 0.008
1.856ValTyr: 1.856 ± 0.015
0.001ValXaa: 0.001 ± 0.0
Trp
0.483TrpAla: 0.483 ± 0.007
0.188TrpCys: 0.188 ± 0.004
0.568TrpAsp: 0.568 ± 0.008
0.567TrpGlu: 0.567 ± 0.008
0.445TrpPhe: 0.445 ± 0.007
0.479TrpGly: 0.479 ± 0.007
0.227TrpHis: 0.227 ± 0.005
0.607TrpIle: 0.607 ± 0.009
0.729TrpLys: 0.729 ± 0.01
0.974TrpLeu: 0.974 ± 0.011
0.252TrpMet: 0.252 ± 0.005
0.575TrpAsn: 0.575 ± 0.008
0.397TrpPro: 0.397 ± 0.007
0.404TrpGln: 0.404 ± 0.006
0.554TrpArg: 0.554 ± 0.007
0.746TrpSer: 0.746 ± 0.01
0.575TrpThr: 0.575 ± 0.008
0.499TrpVal: 0.499 ± 0.008
0.166TrpTrp: 0.166 ± 0.004
0.356TrpTyr: 0.356 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.556TyrAla: 1.556 ± 0.014
0.757TyrCys: 0.757 ± 0.009
1.725TyrAsp: 1.725 ± 0.016
1.964TyrGlu: 1.964 ± 0.017
1.5TyrPhe: 1.5 ± 0.014
1.851TyrGly: 1.851 ± 0.019
0.865TyrHis: 0.865 ± 0.009
1.966TyrIle: 1.966 ± 0.014
2.206TyrLys: 2.206 ± 0.016
2.939TyrLeu: 2.939 ± 0.02
0.69TyrMet: 0.69 ± 0.009
1.82TyrAsn: 1.82 ± 0.016
1.476TyrPro: 1.476 ± 0.016
1.448TyrGln: 1.448 ± 0.017
1.68TyrArg: 1.68 ± 0.015
2.534TyrSer: 2.534 ± 0.02
1.833TyrThr: 1.833 ± 0.016
1.865TyrVal: 1.865 ± 0.014
0.376TyrTrp: 0.376 ± 0.007
1.292TyrTyr: 1.292 ± 0.014
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20351 proteins (12485201 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski