Amino acid dipepetide frequency for Aplysia californica nido-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.57AlaAla: 5.57 ± 0.405
2.412AlaCys: 2.412 ± 0.19
4.249AlaAsp: 4.249 ± 0.316
4.192AlaGlu: 4.192 ± 0.571
2.871AlaPhe: 2.871 ± 0.026
3.905AlaGly: 3.905 ± 0.508
1.321AlaHis: 1.321 ± 0.081
2.125AlaIle: 2.125 ± 0.158
3.847AlaLys: 3.847 ± 0.376
4.881AlaLeu: 4.881 ± 0.126
1.723AlaMet: 1.723 ± 0.261
2.871AlaAsn: 2.871 ± 0.448
3.675AlaPro: 3.675 ± 0.215
2.067AlaGln: 2.067 ± 0.468
2.986AlaArg: 2.986 ± 0.386
5.857AlaSer: 5.857 ± 0.212
4.479AlaThr: 4.479 ± 0.397
5.341AlaVal: 5.341 ± 0.764
0.574AlaTrp: 0.574 ± 0.1
3.216AlaTyr: 3.216 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
1.952CysAla: 1.952 ± 0.288
1.091CysCys: 1.091 ± 0.15
1.493CysAsp: 1.493 ± 0.212
1.665CysGlu: 1.665 ± 0.158
1.55CysPhe: 1.55 ± 0.368
2.01CysGly: 2.01 ± 0.13
0.402CysHis: 0.402 ± 0.048
1.206CysIle: 1.206 ± 0.089
1.206CysLys: 1.206 ± 0.01
2.24CysLeu: 2.24 ± 0.193
0.517CysMet: 0.517 ± 0.061
1.034CysAsn: 1.034 ± 0.061
0.804CysPro: 0.804 ± 0.336
0.632CysGln: 0.632 ± 0.125
1.149CysArg: 1.149 ± 0.044
1.838CysSer: 1.838 ± 0.665
1.034CysThr: 1.034 ± 0.145
3.101CysVal: 3.101 ± 0.258
0.057CysTrp: 0.057 ± 0.07
0.976CysTyr: 0.976 ± 0.248
0.0CysXaa: 0.0 ± 0.0
Asp
3.962AspAla: 3.962 ± 0.508
1.665AspCys: 1.665 ± 0.119
3.273AspAsp: 3.273 ± 0.306
3.388AspGlu: 3.388 ± 0.261
3.675AspPhe: 3.675 ± 0.585
3.56AspGly: 3.56 ± 0.093
1.206AspHis: 1.206 ± 0.174
3.044AspIle: 3.044 ± 0.526
3.158AspLys: 3.158 ± 0.122
5.168AspLeu: 5.168 ± 0.288
1.263AspMet: 1.263 ± 0.348
1.55AspAsn: 1.55 ± 0.171
1.78AspPro: 1.78 ± 0.381
1.665AspGln: 1.665 ± 0.071
2.01AspArg: 2.01 ± 0.22
4.02AspSer: 4.02 ± 0.774
2.527AspThr: 2.527 ± 0.091
7.465AspVal: 7.465 ± 0.387
0.689AspTrp: 0.689 ± 0.088
1.321AspTyr: 1.321 ± 0.142
0.0AspXaa: 0.0 ± 0.0
Glu
3.675GluAla: 3.675 ± 0.36
1.091GluCys: 1.091 ± 0.12
4.249GluAsp: 4.249 ± 0.245
4.881GluGlu: 4.881 ± 0.736
2.182GluPhe: 2.182 ± 0.365
5.053GluGly: 5.053 ± 0.484
1.723GluHis: 1.723 ± 0.134
2.182GluIle: 2.182 ± 0.189
3.905GluLys: 3.905 ± 0.31
4.537GluLeu: 4.537 ± 0.633
0.747GluMet: 0.747 ± 0.186
2.527GluAsn: 2.527 ± 0.223
1.665GluPro: 1.665 ± 0.228
1.321GluGln: 1.321 ± 0.38
3.331GluArg: 3.331 ± 0.394
4.537GluSer: 4.537 ± 0.318
2.929GluThr: 2.929 ± 0.143
4.996GluVal: 4.996 ± 0.163
0.402GluTrp: 0.402 ± 0.06
2.412GluTyr: 2.412 ± 0.243
0.0GluXaa: 0.0 ± 0.0
Phe
3.503PheAla: 3.503 ± 0.046
1.78PheCys: 1.78 ± 0.082
3.733PheAsp: 3.733 ± 0.525
2.642PheGlu: 2.642 ± 0.308
5.053PhePhe: 5.053 ± 0.454
4.077PheGly: 4.077 ± 0.252
0.919PheHis: 0.919 ± 0.178
2.642PheIle: 2.642 ± 0.311
2.412PheLys: 2.412 ± 0.126
6.489PheLeu: 6.489 ± 0.259
1.493PheMet: 1.493 ± 0.126
1.952PheAsn: 1.952 ± 0.281
2.814PhePro: 2.814 ± 0.047
0.976PheGln: 0.976 ± 0.172
2.24PheArg: 2.24 ± 0.301
5.513PheSer: 5.513 ± 0.367
3.503PheThr: 3.503 ± 0.272
6.604PheVal: 6.604 ± 0.622
0.517PheTrp: 0.517 ± 0.143
2.297PheTyr: 2.297 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
3.618GlyAla: 3.618 ± 0.399
2.125GlyCys: 2.125 ± 0.526
3.56GlyAsp: 3.56 ± 0.312
6.202GlyGlu: 6.202 ± 0.749
3.618GlyPhe: 3.618 ± 0.407
4.192GlyGly: 4.192 ± 0.556
2.067GlyHis: 2.067 ± 0.119
2.929GlyIle: 2.929 ± 0.186
3.503GlyLys: 3.503 ± 0.455
6.948GlyLeu: 6.948 ± 0.562
0.632GlyMet: 0.632 ± 0.338
3.158GlyAsn: 3.158 ± 0.357
2.01GlyPro: 2.01 ± 0.202
1.665GlyGln: 1.665 ± 0.203
3.503GlyArg: 3.503 ± 0.149
4.939GlySer: 4.939 ± 0.615
2.699GlyThr: 2.699 ± 0.112
7.35GlyVal: 7.35 ± 0.145
0.689GlyTrp: 0.689 ± 0.11
2.067GlyTyr: 2.067 ± 0.123
0.0GlyXaa: 0.0 ± 0.0
His
1.378HisAla: 1.378 ± 0.133
0.402HisCys: 0.402 ± 0.044
1.149HisAsp: 1.149 ± 0.098
1.091HisGlu: 1.091 ± 0.129
2.067HisPhe: 2.067 ± 0.171
1.895HisGly: 1.895 ± 0.195
0.459HisHis: 0.459 ± 0.08
0.976HisIle: 0.976 ± 0.059
0.747HisLys: 0.747 ± 0.053
1.149HisLeu: 1.149 ± 0.044
0.23HisMet: 0.23 ± 0.082
0.689HisAsn: 0.689 ± 0.113
0.804HisPro: 0.804 ± 0.12
0.517HisGln: 0.517 ± 0.116
1.493HisArg: 1.493 ± 0.254
1.378HisSer: 1.378 ± 0.041
1.091HisThr: 1.091 ± 0.335
2.929HisVal: 2.929 ± 0.573
0.115HisTrp: 0.115 ± 0.034
1.436HisTyr: 1.436 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
2.814IleAla: 2.814 ± 0.421
1.263IleCys: 1.263 ± 0.112
1.78IleAsp: 1.78 ± 0.253
1.149IleGlu: 1.149 ± 0.099
2.986IlePhe: 2.986 ± 0.653
2.986IleGly: 2.986 ± 0.07
1.665IleHis: 1.665 ± 0.388
2.297IleIle: 2.297 ± 0.279
2.125IleLys: 2.125 ± 0.243
3.675IleLeu: 3.675 ± 0.522
0.747IleMet: 0.747 ± 0.078
1.952IleAsn: 1.952 ± 0.195
2.354IlePro: 2.354 ± 0.147
1.665IleGln: 1.665 ± 0.31
2.756IleArg: 2.756 ± 0.534
3.158IleSer: 3.158 ± 0.162
2.527IleThr: 2.527 ± 0.077
3.675IleVal: 3.675 ± 0.536
0.689IleTrp: 0.689 ± 0.189
0.861IleTyr: 0.861 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
3.158LysAla: 3.158 ± 0.159
1.608LysCys: 1.608 ± 0.327
2.354LysAsp: 2.354 ± 0.307
2.469LysGlu: 2.469 ± 0.102
2.01LysPhe: 2.01 ± 0.136
2.814LysGly: 2.814 ± 0.185
0.976LysHis: 0.976 ± 0.197
2.814LysIle: 2.814 ± 0.031
3.216LysLys: 3.216 ± 0.375
4.135LysLeu: 4.135 ± 0.467
0.861LysMet: 0.861 ± 0.236
2.297LysAsn: 2.297 ± 0.054
2.354LysPro: 2.354 ± 0.548
2.067LysGln: 2.067 ± 0.119
2.642LysArg: 2.642 ± 0.2
3.847LysSer: 3.847 ± 0.126
1.838LysThr: 1.838 ± 0.353
5.398LysVal: 5.398 ± 0.056
0.459LysTrp: 0.459 ± 0.08
1.55LysTyr: 1.55 ± 0.157
0.057LysXaa: 0.057 ± 0.036
Leu
6.03LeuAla: 6.03 ± 0.732
2.182LeuCys: 2.182 ± 0.101
4.996LeuAsp: 4.996 ± 0.521
2.986LeuGlu: 2.986 ± 0.484
8.097LeuPhe: 8.097 ± 0.946
6.661LeuGly: 6.661 ± 0.165
2.182LeuHis: 2.182 ± 0.176
3.56LeuIle: 3.56 ± 0.153
3.388LeuLys: 3.388 ± 0.294
8.958LeuLeu: 8.958 ± 0.602
1.321LeuMet: 1.321 ± 0.172
5.111LeuAsn: 5.111 ± 0.732
3.618LeuPro: 3.618 ± 0.076
2.182LeuGln: 2.182 ± 0.085
4.996LeuArg: 4.996 ± 0.13
8.729LeuSer: 8.729 ± 0.17
5.111LeuThr: 5.111 ± 0.315
7.408LeuVal: 7.408 ± 0.372
0.689LeuTrp: 0.689 ± 0.189
2.929LeuTyr: 2.929 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
1.378MetAla: 1.378 ± 0.178
0.747MetCys: 0.747 ± 0.24
0.747MetAsp: 0.747 ± 0.051
0.976MetGlu: 0.976 ± 0.091
1.436MetPhe: 1.436 ± 0.309
1.091MetGly: 1.091 ± 0.129
0.689MetHis: 0.689 ± 0.084
0.747MetIle: 0.747 ± 0.145
0.689MetLys: 0.689 ± 0.189
1.608MetLeu: 1.608 ± 0.228
0.459MetMet: 0.459 ± 0.056
0.345MetAsn: 0.345 ± 0.103
0.574MetPro: 0.574 ± 0.099
0.689MetGln: 0.689 ± 0.165
0.861MetArg: 0.861 ± 0.152
1.665MetSer: 1.665 ± 0.415
0.976MetThr: 0.976 ± 0.099
0.919MetVal: 0.919 ± 0.347
0.057MetTrp: 0.057 ± 0.07
0.632MetTyr: 0.632 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
2.986AsnAla: 2.986 ± 0.144
1.149AsnCys: 1.149 ± 0.044
1.952AsnAsp: 1.952 ± 0.198
2.814AsnGlu: 2.814 ± 0.433
2.297AsnPhe: 2.297 ± 0.188
2.412AsnGly: 2.412 ± 0.164
0.747AsnHis: 0.747 ± 0.142
1.665AsnIle: 1.665 ± 0.154
1.55AsnLys: 1.55 ± 0.184
4.881AsnLeu: 4.881 ± 0.356
0.919AsnMet: 0.919 ± 0.089
0.747AsnAsn: 0.747 ± 0.222
1.378AsnPro: 1.378 ± 0.102
0.976AsnGln: 0.976 ± 0.059
1.608AsnArg: 1.608 ± 0.17
2.986AsnSer: 2.986 ± 0.41
3.273AsnThr: 3.273 ± 0.165
4.02AsnVal: 4.02 ± 0.083
0.459AsnTrp: 0.459 ± 0.129
1.665AsnTyr: 1.665 ± 0.119
0.0AsnXaa: 0.0 ± 0.0
Pro
2.01ProAla: 2.01 ± 0.151
0.402ProCys: 0.402 ± 0.149
1.723ProAsp: 1.723 ± 0.353
2.01ProGlu: 2.01 ± 0.387
2.412ProPhe: 2.412 ± 0.372
2.814ProGly: 2.814 ± 0.339
0.976ProHis: 0.976 ± 0.19
1.55ProIle: 1.55 ± 0.288
2.527ProLys: 2.527 ± 0.211
4.709ProLeu: 4.709 ± 0.318
1.149ProMet: 1.149 ± 0.203
2.182ProAsn: 2.182 ± 0.169
2.125ProPro: 2.125 ± 0.446
0.976ProGln: 0.976 ± 0.329
2.756ProArg: 2.756 ± 0.268
3.618ProSer: 3.618 ± 0.597
2.986ProThr: 2.986 ± 0.376
3.847ProVal: 3.847 ± 0.172
0.574ProTrp: 0.574 ± 0.099
0.689ProTyr: 0.689 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
2.412GlnAla: 2.412 ± 0.243
0.574GlnCys: 0.574 ± 0.1
0.976GlnAsp: 0.976 ± 0.246
1.321GlnGlu: 1.321 ± 0.027
1.608GlnPhe: 1.608 ± 0.193
1.436GlnGly: 1.436 ± 0.162
0.747GlnHis: 0.747 ± 0.233
1.263GlnIle: 1.263 ± 0.169
1.608GlnLys: 1.608 ± 0.104
2.067GlnLeu: 2.067 ± 0.074
0.287GlnMet: 0.287 ± 0.05
1.436GlnAsn: 1.436 ± 0.035
1.493GlnPro: 1.493 ± 0.273
1.263GlnGln: 1.263 ± 0.108
1.149GlnArg: 1.149 ± 0.288
1.263GlnSer: 1.263 ± 0.358
1.378GlnThr: 1.378 ± 0.051
2.929GlnVal: 2.929 ± 0.071
0.172GlnTrp: 0.172 ± 0.126
1.263GlnTyr: 1.263 ± 0.08
0.0GlnXaa: 0.0 ± 0.0
Arg
3.503ArgAla: 3.503 ± 0.232
1.206ArgCys: 1.206 ± 0.152
2.756ArgAsp: 2.756 ± 0.395
3.503ArgGlu: 3.503 ± 0.281
2.929ArgPhe: 2.929 ± 0.219
3.847ArgGly: 3.847 ± 0.048
1.149ArgHis: 1.149 ± 0.231
2.01ArgIle: 2.01 ± 0.231
2.584ArgLys: 2.584 ± 0.031
3.101ArgLeu: 3.101 ± 0.356
0.804ArgMet: 0.804 ± 0.241
1.723ArgAsn: 1.723 ± 0.258
2.699ArgPro: 2.699 ± 0.401
1.206ArgGln: 1.206 ± 0.181
2.642ArgArg: 2.642 ± 0.335
2.469ArgSer: 2.469 ± 0.363
3.56ArgThr: 3.56 ± 0.218
5.283ArgVal: 5.283 ± 0.585
0.402ArgTrp: 0.402 ± 0.048
1.952ArgTyr: 1.952 ± 0.583
0.0ArgXaa: 0.0 ± 0.0
Ser
5.972SerAla: 5.972 ± 0.354
1.723SerCys: 1.723 ± 0.482
4.939SerAsp: 4.939 ± 0.132
4.249SerGlu: 4.249 ± 0.34
4.479SerPhe: 4.479 ± 0.219
4.766SerGly: 4.766 ± 0.738
1.149SerHis: 1.149 ± 0.06
4.537SerIle: 4.537 ± 0.133
2.986SerLys: 2.986 ± 0.316
7.752SerLeu: 7.752 ± 0.57
1.895SerMet: 1.895 ± 0.256
3.044SerAsn: 3.044 ± 0.358
3.044SerPro: 3.044 ± 1.297
1.723SerGln: 1.723 ± 0.221
3.503SerArg: 3.503 ± 0.603
8.843SerSer: 8.843 ± 0.477
4.192SerThr: 4.192 ± 0.983
9.533SerVal: 9.533 ± 0.059
0.23SerTrp: 0.23 ± 0.068
2.699SerTyr: 2.699 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
4.02ThrAla: 4.02 ± 0.124
1.493ThrCys: 1.493 ± 0.278
3.446ThrAsp: 3.446 ± 0.072
2.584ThrGlu: 2.584 ± 0.315
3.044ThrPhe: 3.044 ± 0.103
4.135ThrGly: 4.135 ± 0.497
1.493ThrHis: 1.493 ± 0.05
2.24ThrIle: 2.24 ± 0.153
2.412ThrLys: 2.412 ± 0.541
5.915ThrLeu: 5.915 ± 0.34
1.091ThrMet: 1.091 ± 0.055
1.378ThrAsn: 1.378 ± 0.229
3.273ThrPro: 3.273 ± 0.21
1.206ThrGln: 1.206 ± 0.076
1.895ThrArg: 1.895 ± 0.186
4.537ThrSer: 4.537 ± 0.377
4.479ThrThr: 4.479 ± 0.822
5.857ThrVal: 5.857 ± 0.141
0.459ThrTrp: 0.459 ± 0.042
1.723ThrTyr: 1.723 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
6.661ValAla: 6.661 ± 0.22
2.469ValCys: 2.469 ± 0.083
5.8ValAsp: 5.8 ± 0.369
6.834ValGlu: 6.834 ± 0.838
6.202ValPhe: 6.202 ± 0.211
7.408ValGly: 7.408 ± 0.18
1.091ValHis: 1.091 ± 0.218
3.446ValIle: 3.446 ± 0.011
5.226ValLys: 5.226 ± 0.527
8.327ValLeu: 8.327 ± 0.261
0.804ValMet: 0.804 ± 0.12
4.192ValAsn: 4.192 ± 0.583
4.422ValPro: 4.422 ± 0.577
2.469ValGln: 2.469 ± 0.273
5.628ValArg: 5.628 ± 0.228
9.131ValSer: 9.131 ± 0.65
5.168ValThr: 5.168 ± 0.307
11.485ValVal: 11.485 ± 0.639
0.632ValTrp: 0.632 ± 0.024
3.56ValTyr: 3.56 ± 0.155
0.0ValXaa: 0.0 ± 0.0
Trp
0.517TrpAla: 0.517 ± 0.061
0.0TrpCys: 0.0 ± 0.0
0.689TrpAsp: 0.689 ± 0.072
0.861TrpGlu: 0.861 ± 0.113
0.345TrpPhe: 0.345 ± 0.01
0.459TrpGly: 0.459 ± 0.062
0.23TrpHis: 0.23 ± 0.068
0.459TrpIle: 0.459 ± 0.042
0.517TrpLys: 0.517 ± 0.143
1.608TrpLeu: 1.608 ± 0.278
0.057TrpMet: 0.057 ± 0.036
0.172TrpAsn: 0.172 ± 0.073
0.115TrpPro: 0.115 ± 0.062
0.345TrpGln: 0.345 ± 0.01
0.517TrpArg: 0.517 ± 0.079
0.287TrpSer: 0.287 ± 0.115
0.345TrpThr: 0.345 ± 0.01
0.115TrpVal: 0.115 ± 0.072
0.057TrpTrp: 0.057 ± 0.07
0.402TrpTyr: 0.402 ± 0.048
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.044TyrAla: 3.044 ± 0.212
0.574TyrCys: 0.574 ± 0.058
2.527TyrAsp: 2.527 ± 0.077
2.527TyrGlu: 2.527 ± 0.268
2.297TyrPhe: 2.297 ± 0.393
1.952TyrGly: 1.952 ± 0.096
0.804TyrHis: 0.804 ± 0.122
1.493TyrIle: 1.493 ± 0.24
1.321TyrLys: 1.321 ± 0.027
2.986TyrLeu: 2.986 ± 0.298
0.23TyrMet: 0.23 ± 0.028
2.125TyrAsn: 2.125 ± 0.504
1.034TyrPro: 1.034 ± 0.118
1.034TyrGln: 1.034 ± 0.171
1.608TyrArg: 1.608 ± 0.17
2.642TyrSer: 2.642 ± 0.386
2.584TyrThr: 2.584 ± 0.07
2.699TyrVal: 2.699 ± 0.713
0.172TyrTrp: 0.172 ± 0.041
0.861TyrTyr: 0.861 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.057XaaArg: 0.057 ± 0.036
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (17415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski