Amino acid dipepetide frequency for Bathycoccus sp. RCC1105 virus BpV1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.826AlaAla: 3.826 ± 0.682
0.72AlaCys: 0.72 ± 0.132
2.641AlaAsp: 2.641 ± 0.257
2.785AlaGlu: 2.785 ± 0.442
2.097AlaPhe: 2.097 ± 0.18
3.73AlaGly: 3.73 ± 0.61
0.944AlaHis: 0.944 ± 0.127
3.137AlaIle: 3.137 ± 0.203
4.098AlaLys: 4.098 ± 0.677
3.842AlaLeu: 3.842 ± 0.236
1.489AlaMet: 1.489 ± 0.166
3.41AlaAsn: 3.41 ± 0.51
2.641AlaPro: 2.641 ± 0.355
1.761AlaGln: 1.761 ± 0.166
2.081AlaArg: 2.081 ± 0.218
4.786AlaSer: 4.786 ± 0.566
3.634AlaThr: 3.634 ± 0.343
2.737AlaVal: 2.737 ± 0.251
0.576AlaTrp: 0.576 ± 0.099
2.337AlaTyr: 2.337 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
0.736CysAla: 0.736 ± 0.099
0.192CysCys: 0.192 ± 0.066
0.688CysAsp: 0.688 ± 0.131
0.896CysGlu: 0.896 ± 0.168
0.688CysPhe: 0.688 ± 0.116
0.896CysGly: 0.896 ± 0.141
0.272CysHis: 0.272 ± 0.061
0.832CysIle: 0.832 ± 0.115
1.201CysLys: 1.201 ± 0.208
0.912CysLeu: 0.912 ± 0.139
0.448CysMet: 0.448 ± 0.099
0.832CysAsn: 0.832 ± 0.146
0.64CysPro: 0.64 ± 0.155
0.352CysGln: 0.352 ± 0.069
0.576CysArg: 0.576 ± 0.097
0.912CysSer: 0.912 ± 0.174
0.928CysThr: 0.928 ± 0.178
0.848CysVal: 0.848 ± 0.163
0.128CysTrp: 0.128 ± 0.037
0.544CysTyr: 0.544 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
3.137AspAla: 3.137 ± 0.285
0.656AspCys: 0.656 ± 0.115
3.57AspAsp: 3.57 ± 0.328
4.05AspGlu: 4.05 ± 0.33
2.705AspPhe: 2.705 ± 0.218
4.53AspGly: 4.53 ± 0.588
0.64AspHis: 0.64 ± 0.101
4.754AspIle: 4.754 ± 0.277
3.602AspLys: 3.602 ± 0.283
4.226AspLeu: 4.226 ± 0.351
1.633AspMet: 1.633 ± 0.206
3.313AspAsn: 3.313 ± 0.308
2.289AspPro: 2.289 ± 0.195
1.265AspGln: 1.265 ± 0.137
2.001AspArg: 2.001 ± 0.215
3.362AspSer: 3.362 ± 0.295
4.578AspThr: 4.578 ± 0.309
3.57AspVal: 3.57 ± 0.221
0.64AspTrp: 0.64 ± 0.137
2.769AspTyr: 2.769 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
3.105GluAla: 3.105 ± 0.574
1.024GluCys: 1.024 ± 0.151
3.49GluAsp: 3.49 ± 0.335
4.962GluGlu: 4.962 ± 0.734
2.545GluPhe: 2.545 ± 0.279
3.041GluGly: 3.041 ± 0.228
1.056GluHis: 1.056 ± 0.173
4.002GluIle: 4.002 ± 0.363
5.234GluLys: 5.234 ± 0.624
4.978GluLeu: 4.978 ± 0.476
1.873GluMet: 1.873 ± 0.212
3.858GluAsn: 3.858 ± 0.382
2.353GluPro: 2.353 ± 0.278
1.937GluGln: 1.937 ± 0.298
2.321GluArg: 2.321 ± 0.244
3.73GluSer: 3.73 ± 0.256
4.642GluThr: 4.642 ± 0.363
3.217GluVal: 3.217 ± 0.23
0.896GluTrp: 0.896 ± 0.127
2.849GluTyr: 2.849 ± 0.271
0.0GluXaa: 0.0 ± 0.0
Phe
2.033PheAla: 2.033 ± 0.225
0.736PheCys: 0.736 ± 0.111
3.105PheAsp: 3.105 ± 0.236
2.449PheGlu: 2.449 ± 0.225
1.937PhePhe: 1.937 ± 0.229
2.913PheGly: 2.913 ± 0.26
0.896PheHis: 0.896 ± 0.142
2.945PheIle: 2.945 ± 0.316
3.121PheLys: 3.121 ± 0.29
2.753PheLeu: 2.753 ± 0.247
1.313PheMet: 1.313 ± 0.188
2.433PheAsn: 2.433 ± 0.169
1.153PhePro: 1.153 ± 0.123
0.768PheGln: 0.768 ± 0.112
1.841PheArg: 1.841 ± 0.213
3.201PheSer: 3.201 ± 0.255
2.881PheThr: 2.881 ± 0.267
2.641PheVal: 2.641 ± 0.227
0.4PheTrp: 0.4 ± 0.072
1.873PheTyr: 1.873 ± 0.204
0.0PheXaa: 0.0 ± 0.0
Gly
4.082GlyAla: 4.082 ± 0.75
0.864GlyCys: 0.864 ± 0.115
4.29GlyAsp: 4.29 ± 0.441
3.378GlyGlu: 3.378 ± 0.204
2.545GlyPhe: 2.545 ± 0.239
7.779GlyGly: 7.779 ± 1.443
1.329GlyHis: 1.329 ± 0.149
4.402GlyIle: 4.402 ± 0.317
4.834GlyLys: 4.834 ± 0.306
4.306GlyLeu: 4.306 ± 0.31
1.729GlyMet: 1.729 ± 0.198
4.706GlyAsn: 4.706 ± 0.633
1.745GlyPro: 1.745 ± 0.183
1.921GlyGln: 1.921 ± 0.231
2.305GlyArg: 2.305 ± 0.254
6.755GlySer: 6.755 ± 1.142
6.707GlyThr: 6.707 ± 1.096
3.602GlyVal: 3.602 ± 0.275
0.688GlyTrp: 0.688 ± 0.124
3.137GlyTyr: 3.137 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.281HisAla: 1.281 ± 0.148
0.288HisCys: 0.288 ± 0.086
0.832HisAsp: 0.832 ± 0.119
1.409HisGlu: 1.409 ± 0.201
0.64HisPhe: 0.64 ± 0.112
1.345HisGly: 1.345 ± 0.179
0.496HisHis: 0.496 ± 0.08
1.104HisIle: 1.104 ± 0.168
1.441HisLys: 1.441 ± 0.183
1.457HisLeu: 1.457 ± 0.163
0.464HisMet: 0.464 ± 0.09
1.137HisAsn: 1.137 ± 0.123
0.816HisPro: 0.816 ± 0.114
0.496HisGln: 0.496 ± 0.094
0.592HisArg: 0.592 ± 0.098
1.072HisSer: 1.072 ± 0.126
1.297HisThr: 1.297 ± 0.18
1.441HisVal: 1.441 ± 0.195
0.256HisTrp: 0.256 ± 0.057
0.944HisTyr: 0.944 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
3.474IleAla: 3.474 ± 0.225
0.816IleCys: 0.816 ± 0.157
4.866IleAsp: 4.866 ± 0.377
4.77IleGlu: 4.77 ± 0.416
2.625IlePhe: 2.625 ± 0.232
4.594IleGly: 4.594 ± 0.478
1.761IleHis: 1.761 ± 0.179
4.738IleIle: 4.738 ± 0.339
5.106IleLys: 5.106 ± 0.424
4.93IleLeu: 4.93 ± 0.397
1.489IleMet: 1.489 ± 0.2
4.066IleAsn: 4.066 ± 0.324
2.865IlePro: 2.865 ± 0.316
2.257IleGln: 2.257 ± 0.177
2.657IleArg: 2.657 ± 0.261
5.01IleSer: 5.01 ± 0.244
4.194IleThr: 4.194 ± 0.316
3.297IleVal: 3.297 ± 0.249
0.48IleTrp: 0.48 ± 0.082
3.073IleTyr: 3.073 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
3.73LysAla: 3.73 ± 0.631
1.297LysCys: 1.297 ± 0.213
3.73LysAsp: 3.73 ± 0.351
4.85LysGlu: 4.85 ± 0.895
3.105LysPhe: 3.105 ± 0.306
3.49LysGly: 3.49 ± 0.294
1.473LysHis: 1.473 ± 0.184
5.587LysIle: 5.587 ± 0.405
7.651LysLys: 7.651 ± 1.087
6.051LysLeu: 6.051 ± 0.343
2.369LysMet: 2.369 ± 0.249
4.898LysAsn: 4.898 ± 0.507
3.137LysPro: 3.137 ± 0.382
2.929LysGln: 2.929 ± 0.303
3.762LysArg: 3.762 ± 0.458
4.21LysSer: 4.21 ± 0.332
5.394LysThr: 5.394 ± 0.324
4.194LysVal: 4.194 ± 0.362
1.024LysTrp: 1.024 ± 0.132
3.826LysTyr: 3.826 ± 0.198
0.0LysXaa: 0.0 ± 0.0
Leu
4.194LeuAla: 4.194 ± 0.461
1.024LeuCys: 1.024 ± 0.143
4.402LeuAsp: 4.402 ± 0.276
4.61LeuGlu: 4.61 ± 0.494
2.673LeuPhe: 2.673 ± 0.303
4.066LeuGly: 4.066 ± 0.256
1.601LeuHis: 1.601 ± 0.149
4.322LeuIle: 4.322 ± 0.339
6.163LeuLys: 6.163 ± 0.559
5.474LeuLeu: 5.474 ± 0.52
1.745LeuMet: 1.745 ± 0.195
4.13LeuAsn: 4.13 ± 0.343
2.913LeuPro: 2.913 ± 0.256
2.449LeuGln: 2.449 ± 0.241
3.57LeuArg: 3.57 ± 0.331
5.234LeuSer: 5.234 ± 0.392
5.154LeuThr: 5.154 ± 0.383
4.61LeuVal: 4.61 ± 0.292
0.704LeuTrp: 0.704 ± 0.101
3.345LeuTyr: 3.345 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
1.137MetAla: 1.137 ± 0.16
0.416MetCys: 0.416 ± 0.082
1.665MetAsp: 1.665 ± 0.204
1.569MetGlu: 1.569 ± 0.268
1.249MetPhe: 1.249 ± 0.176
1.633MetGly: 1.633 ± 0.182
0.48MetHis: 0.48 ± 0.101
1.553MetIle: 1.553 ± 0.191
2.305MetLys: 2.305 ± 0.3
1.809MetLeu: 1.809 ± 0.207
0.72MetMet: 0.72 ± 0.129
2.033MetAsn: 2.033 ± 0.235
0.736MetPro: 0.736 ± 0.119
0.704MetGln: 0.704 ± 0.112
1.056MetArg: 1.056 ± 0.179
2.113MetSer: 2.113 ± 0.185
1.473MetThr: 1.473 ± 0.186
1.505MetVal: 1.505 ± 0.15
0.272MetTrp: 0.272 ± 0.074
1.585MetTyr: 1.585 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
3.201AsnAla: 3.201 ± 0.294
0.608AsnCys: 0.608 ± 0.124
3.201AsnAsp: 3.201 ± 0.316
3.458AsnGlu: 3.458 ± 0.267
2.913AsnPhe: 2.913 ± 0.239
4.85AsnGly: 4.85 ± 0.641
1.072AsnHis: 1.072 ± 0.133
4.658AsnIle: 4.658 ± 0.333
4.562AsnLys: 4.562 ± 0.341
4.53AsnLeu: 4.53 ± 0.405
1.905AsnMet: 1.905 ± 0.22
3.858AsnAsn: 3.858 ± 0.494
2.193AsnPro: 2.193 ± 0.184
1.937AsnGln: 1.937 ± 0.214
2.481AsnArg: 2.481 ± 0.647
4.45AsnSer: 4.45 ± 0.405
4.53AsnThr: 4.53 ± 0.521
4.962AsnVal: 4.962 ± 0.324
0.72AsnTrp: 0.72 ± 0.108
2.497AsnTyr: 2.497 ± 0.228
0.0AsnXaa: 0.0 ± 0.0
Pro
2.065ProAla: 2.065 ± 0.315
0.48ProCys: 0.48 ± 0.108
2.241ProAsp: 2.241 ± 0.234
2.881ProGlu: 2.881 ± 0.314
1.681ProPhe: 1.681 ± 0.221
2.689ProGly: 2.689 ± 0.258
0.512ProHis: 0.512 ± 0.104
2.401ProIle: 2.401 ± 0.245
2.865ProLys: 2.865 ± 0.371
2.385ProLeu: 2.385 ± 0.261
1.056ProMet: 1.056 ± 0.159
2.097ProAsn: 2.097 ± 0.185
3.698ProPro: 3.698 ± 1.27
1.169ProGln: 1.169 ± 0.144
1.585ProArg: 1.585 ± 0.193
3.458ProSer: 3.458 ± 0.343
3.089ProThr: 3.089 ± 0.239
2.481ProVal: 2.481 ± 0.23
0.192ProTrp: 0.192 ± 0.048
1.361ProTyr: 1.361 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
1.553GlnAla: 1.553 ± 0.182
0.384GlnCys: 0.384 ± 0.074
1.649GlnAsp: 1.649 ± 0.16
1.841GlnGlu: 1.841 ± 0.225
1.249GlnPhe: 1.249 ± 0.129
1.729GlnGly: 1.729 ± 0.21
0.384GlnHis: 0.384 ± 0.081
1.985GlnIle: 1.985 ± 0.213
2.625GlnLys: 2.625 ± 0.359
2.161GlnLeu: 2.161 ± 0.19
0.88GlnMet: 0.88 ± 0.144
1.793GlnAsn: 1.793 ± 0.236
1.345GlnPro: 1.345 ± 0.171
1.153GlnGln: 1.153 ± 0.143
1.04GlnArg: 1.04 ± 0.126
2.113GlnSer: 2.113 ± 0.196
2.241GlnThr: 2.241 ± 0.186
1.745GlnVal: 1.745 ± 0.164
0.32GlnTrp: 0.32 ± 0.071
1.505GlnTyr: 1.505 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
1.889ArgAla: 1.889 ± 0.216
0.592ArgCys: 0.592 ± 0.093
2.001ArgAsp: 2.001 ± 0.202
3.009ArgGlu: 3.009 ± 0.353
1.505ArgPhe: 1.505 ± 0.143
2.225ArgGly: 2.225 ± 0.193
0.816ArgHis: 0.816 ± 0.147
3.217ArgIle: 3.217 ± 0.317
3.618ArgLys: 3.618 ± 0.503
3.201ArgLeu: 3.201 ± 0.455
0.992ArgMet: 0.992 ± 0.172
2.433ArgAsn: 2.433 ± 0.254
1.649ArgPro: 1.649 ± 0.223
1.04ArgGln: 1.04 ± 0.168
1.745ArgArg: 1.745 ± 0.266
2.289ArgSer: 2.289 ± 0.178
2.081ArgThr: 2.081 ± 0.202
2.689ArgVal: 2.689 ± 0.222
0.496ArgTrp: 0.496 ± 0.093
1.569ArgTyr: 1.569 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
4.098SerAla: 4.098 ± 0.391
0.768SerCys: 0.768 ± 0.096
4.706SerAsp: 4.706 ± 0.513
4.002SerGlu: 4.002 ± 0.299
2.817SerPhe: 2.817 ± 0.22
8.1SerGly: 8.1 ± 1.175
1.201SerHis: 1.201 ± 0.146
5.09SerIle: 5.09 ± 0.247
4.722SerLys: 4.722 ± 0.316
4.882SerLeu: 4.882 ± 0.314
1.729SerMet: 1.729 ± 0.176
5.426SerAsn: 5.426 ± 0.707
2.353SerPro: 2.353 ± 0.248
2.017SerGln: 2.017 ± 0.14
2.641SerArg: 2.641 ± 0.22
7.155SerSer: 7.155 ± 1.032
6.227SerThr: 6.227 ± 0.783
4.626SerVal: 4.626 ± 0.278
1.088SerTrp: 1.088 ± 0.252
3.073SerTyr: 3.073 ± 0.325
0.0SerXaa: 0.0 ± 0.0
Thr
3.586ThrAla: 3.586 ± 0.42
0.928ThrCys: 0.928 ± 0.159
4.034ThrAsp: 4.034 ± 0.332
3.378ThrGlu: 3.378 ± 0.229
3.329ThrPhe: 3.329 ± 0.268
6.051ThrGly: 6.051 ± 0.669
1.553ThrHis: 1.553 ± 0.163
5.218ThrIle: 5.218 ± 0.453
4.53ThrLys: 4.53 ± 0.368
5.923ThrLeu: 5.923 ± 0.346
1.393ThrMet: 1.393 ± 0.169
4.53ThrAsn: 4.53 ± 0.494
3.297ThrPro: 3.297 ± 0.221
2.129ThrGln: 2.129 ± 0.166
2.817ThrArg: 2.817 ± 0.246
6.467ThrSer: 6.467 ± 0.867
6.387ThrThr: 6.387 ± 0.961
5.41ThrVal: 5.41 ± 0.562
1.024ThrTrp: 1.024 ± 0.194
3.426ThrTyr: 3.426 ± 0.494
0.0ThrXaa: 0.0 ± 0.0
Val
3.362ValAla: 3.362 ± 0.268
1.088ValCys: 1.088 ± 0.162
3.057ValAsp: 3.057 ± 0.223
3.185ValGlu: 3.185 ± 0.321
2.593ValPhe: 2.593 ± 0.246
3.538ValGly: 3.538 ± 0.456
1.153ValHis: 1.153 ± 0.129
3.458ValIle: 3.458 ± 0.213
4.242ValLys: 4.242 ± 0.316
4.786ValLeu: 4.786 ± 0.318
1.377ValMet: 1.377 ± 0.182
3.874ValAsn: 3.874 ± 0.361
2.577ValPro: 2.577 ± 0.242
1.985ValGln: 1.985 ± 0.2
2.065ValArg: 2.065 ± 0.215
5.971ValSer: 5.971 ± 0.466
4.914ValThr: 4.914 ± 0.396
3.57ValVal: 3.57 ± 0.225
0.784ValTrp: 0.784 ± 0.114
3.265ValTyr: 3.265 ± 0.36
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.061
0.176TrpCys: 0.176 ± 0.066
0.624TrpAsp: 0.624 ± 0.089
0.448TrpGlu: 0.448 ± 0.075
0.624TrpPhe: 0.624 ± 0.093
1.056TrpGly: 1.056 ± 0.163
0.176TrpHis: 0.176 ± 0.057
0.592TrpIle: 0.592 ± 0.104
0.976TrpLys: 0.976 ± 0.124
0.768TrpLeu: 0.768 ± 0.119
0.176TrpMet: 0.176 ± 0.049
0.768TrpAsn: 0.768 ± 0.12
0.336TrpPro: 0.336 ± 0.081
0.32TrpGln: 0.32 ± 0.069
0.304TrpArg: 0.304 ± 0.07
0.976TrpSer: 0.976 ± 0.171
1.088TrpThr: 1.088 ± 0.242
0.72TrpVal: 0.72 ± 0.144
0.096TrpTrp: 0.096 ± 0.035
0.464TrpTyr: 0.464 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.369TyrAla: 2.369 ± 0.248
0.464TyrCys: 0.464 ± 0.093
2.497TyrAsp: 2.497 ± 0.217
3.185TyrGlu: 3.185 ± 0.276
1.873TyrPhe: 1.873 ± 0.157
2.849TyrGly: 2.849 ± 0.256
0.992TyrHis: 0.992 ± 0.143
3.073TyrIle: 3.073 ± 0.227
3.826TyrLys: 3.826 ± 0.283
3.009TyrLeu: 3.009 ± 0.216
1.217TyrMet: 1.217 ± 0.126
2.993TyrAsn: 2.993 ± 0.231
1.649TyrPro: 1.649 ± 0.171
1.104TyrGln: 1.104 ± 0.117
1.617TyrArg: 1.617 ± 0.158
3.458TyrSer: 3.458 ± 0.32
3.97TyrThr: 3.97 ± 0.442
2.977TyrVal: 2.977 ± 0.29
0.32TyrTrp: 0.32 ± 0.065
2.065TyrTyr: 2.065 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 203 proteins (62473 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski