Amino acid dipepetide frequency for Microbacterium phage Cinna

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.744AlaAla: 17.744 ± 1.423
1.014AlaCys: 1.014 ± 0.264
8.168AlaAsp: 8.168 ± 0.707
10.59AlaGlu: 10.59 ± 0.995
2.873AlaPhe: 2.873 ± 0.412
8.788AlaGly: 8.788 ± 0.779
2.253AlaHis: 2.253 ± 0.364
5.014AlaIle: 5.014 ± 0.582
3.155AlaLys: 3.155 ± 0.5
12.956AlaLeu: 12.956 ± 0.998
3.098AlaMet: 3.098 ± 0.423
3.436AlaAsn: 3.436 ± 0.458
6.647AlaPro: 6.647 ± 0.614
4.225AlaGln: 4.225 ± 0.482
9.689AlaArg: 9.689 ± 1.114
6.872AlaSer: 6.872 ± 0.757
9.182AlaThr: 9.182 ± 0.737
7.886AlaVal: 7.886 ± 0.745
2.31AlaTrp: 2.31 ± 0.312
2.591AlaTyr: 2.591 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 0.195
0.113CysCys: 0.113 ± 0.109
0.789CysAsp: 0.789 ± 0.214
0.563CysGlu: 0.563 ± 0.152
0.056CysPhe: 0.056 ± 0.045
1.69CysGly: 1.69 ± 0.398
0.394CysHis: 0.394 ± 0.124
0.338CysIle: 0.338 ± 0.134
0.0CysLys: 0.0 ± 0.0
0.169CysLeu: 0.169 ± 0.088
0.0CysMet: 0.0 ± 0.0
0.169CysAsn: 0.169 ± 0.104
1.183CysPro: 1.183 ± 0.294
0.113CysGln: 0.113 ± 0.088
0.789CysArg: 0.789 ± 0.276
0.225CysSer: 0.225 ± 0.128
0.394CysThr: 0.394 ± 0.157
0.507CysVal: 0.507 ± 0.168
0.113CysTrp: 0.113 ± 0.077
0.169CysTyr: 0.169 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
9.238AspAla: 9.238 ± 0.746
0.845AspCys: 0.845 ± 0.245
5.746AspAsp: 5.746 ± 0.709
4.169AspGlu: 4.169 ± 0.622
1.69AspPhe: 1.69 ± 0.213
6.703AspGly: 6.703 ± 0.769
1.183AspHis: 1.183 ± 0.257
2.535AspIle: 2.535 ± 0.4
1.07AspLys: 1.07 ± 0.268
4.788AspLeu: 4.788 ± 0.561
1.07AspMet: 1.07 ± 0.229
1.07AspAsn: 1.07 ± 0.27
4.788AspPro: 4.788 ± 0.551
1.296AspGln: 1.296 ± 0.23
4.112AspArg: 4.112 ± 0.521
3.098AspSer: 3.098 ± 0.435
3.831AspThr: 3.831 ± 0.462
4.112AspVal: 4.112 ± 0.456
1.127AspTrp: 1.127 ± 0.22
1.352AspTyr: 1.352 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
9.52GluAla: 9.52 ± 0.735
0.62GluCys: 0.62 ± 0.207
3.493GluAsp: 3.493 ± 0.632
5.464GluGlu: 5.464 ± 0.572
1.127GluPhe: 1.127 ± 0.235
4.676GluGly: 4.676 ± 0.528
2.76GluHis: 2.76 ± 0.405
3.267GluIle: 3.267 ± 0.479
1.972GluLys: 1.972 ± 0.339
2.817GluLeu: 2.817 ± 0.382
2.028GluMet: 2.028 ± 0.371
1.634GluAsn: 1.634 ± 0.299
5.858GluPro: 5.858 ± 0.809
2.253GluGln: 2.253 ± 0.454
7.098GluArg: 7.098 ± 0.782
2.253GluSer: 2.253 ± 0.409
3.267GluThr: 3.267 ± 0.373
5.295GluVal: 5.295 ± 0.615
1.69GluTrp: 1.69 ± 0.326
2.028GluTyr: 2.028 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.535PheAla: 2.535 ± 0.325
0.338PheCys: 0.338 ± 0.15
1.859PheAsp: 1.859 ± 0.366
1.69PheGlu: 1.69 ± 0.308
0.451PhePhe: 0.451 ± 0.178
1.915PheGly: 1.915 ± 0.34
0.507PheHis: 0.507 ± 0.191
0.732PheIle: 0.732 ± 0.19
0.394PheLys: 0.394 ± 0.121
1.408PheLeu: 1.408 ± 0.257
0.394PheMet: 0.394 ± 0.135
0.451PheAsn: 0.451 ± 0.164
0.958PhePro: 0.958 ± 0.21
0.676PheGln: 0.676 ± 0.207
1.577PheArg: 1.577 ± 0.313
1.239PheSer: 1.239 ± 0.313
2.141PheThr: 2.141 ± 0.386
1.352PheVal: 1.352 ± 0.372
0.338PheTrp: 0.338 ± 0.141
0.451PheTyr: 0.451 ± 0.15
0.0PheXaa: 0.0 ± 0.0
Gly
8.957GlyAla: 8.957 ± 0.649
0.732GlyCys: 0.732 ± 0.217
6.14GlyAsp: 6.14 ± 0.791
6.027GlyGlu: 6.027 ± 0.545
2.141GlyPhe: 2.141 ± 0.388
8.393GlyGly: 8.393 ± 0.749
1.746GlyHis: 1.746 ± 0.394
3.662GlyIle: 3.662 ± 0.543
2.648GlyLys: 2.648 ± 0.368
6.534GlyLeu: 6.534 ± 0.987
2.704GlyMet: 2.704 ± 0.401
2.648GlyAsn: 2.648 ± 0.426
3.718GlyPro: 3.718 ± 0.509
2.479GlyGln: 2.479 ± 0.319
5.689GlyArg: 5.689 ± 0.559
4.056GlySer: 4.056 ± 0.606
7.605GlyThr: 7.605 ± 0.733
6.309GlyVal: 6.309 ± 0.612
2.591GlyTrp: 2.591 ± 0.429
2.817GlyTyr: 2.817 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
2.422HisAla: 2.422 ± 0.416
0.225HisCys: 0.225 ± 0.14
1.746HisAsp: 1.746 ± 0.328
1.577HisGlu: 1.577 ± 0.322
0.169HisPhe: 0.169 ± 0.091
2.479HisGly: 2.479 ± 0.378
0.789HisHis: 0.789 ± 0.276
0.958HisIle: 0.958 ± 0.237
0.507HisLys: 0.507 ± 0.154
2.141HisLeu: 2.141 ± 0.423
0.62HisMet: 0.62 ± 0.206
0.563HisAsn: 0.563 ± 0.19
1.296HisPro: 1.296 ± 0.326
0.282HisGln: 0.282 ± 0.14
1.859HisArg: 1.859 ± 0.345
0.62HisSer: 0.62 ± 0.161
1.07HisThr: 1.07 ± 0.284
1.296HisVal: 1.296 ± 0.243
0.169HisTrp: 0.169 ± 0.094
0.507HisTyr: 0.507 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
5.746IleAla: 5.746 ± 0.659
0.113IleCys: 0.113 ± 0.075
3.493IleAsp: 3.493 ± 0.505
4.112IleGlu: 4.112 ± 0.542
0.789IlePhe: 0.789 ± 0.182
4.0IleGly: 4.0 ± 0.556
1.127IleHis: 1.127 ± 0.255
2.084IleIle: 2.084 ± 0.453
0.901IleLys: 0.901 ± 0.207
2.422IleLeu: 2.422 ± 0.401
0.901IleMet: 0.901 ± 0.194
0.789IleAsn: 0.789 ± 0.194
3.267IlePro: 3.267 ± 0.514
1.577IleGln: 1.577 ± 0.299
3.436IleArg: 3.436 ± 0.423
2.31IleSer: 2.31 ± 0.439
4.394IleThr: 4.394 ± 0.467
3.436IleVal: 3.436 ± 0.4
0.563IleTrp: 0.563 ± 0.201
0.676IleTyr: 0.676 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
4.619LysAla: 4.619 ± 0.537
0.113LysCys: 0.113 ± 0.075
0.901LysAsp: 0.901 ± 0.227
0.451LysGlu: 0.451 ± 0.136
0.563LysPhe: 0.563 ± 0.19
2.479LysGly: 2.479 ± 0.386
0.62LysHis: 0.62 ± 0.229
0.958LysIle: 0.958 ± 0.241
0.282LysLys: 0.282 ± 0.141
0.507LysLeu: 0.507 ± 0.286
0.732LysMet: 0.732 ± 0.191
0.676LysAsn: 0.676 ± 0.204
2.028LysPro: 2.028 ± 0.331
0.563LysGln: 0.563 ± 0.176
2.929LysArg: 2.929 ± 0.474
0.958LysSer: 0.958 ± 0.255
1.296LysThr: 1.296 ± 0.259
2.422LysVal: 2.422 ± 0.44
0.507LysTrp: 0.507 ± 0.159
0.451LysTyr: 0.451 ± 0.154
0.0LysXaa: 0.0 ± 0.0
Leu
9.238LeuAla: 9.238 ± 0.862
0.676LeuCys: 0.676 ± 0.235
4.788LeuAsp: 4.788 ± 0.403
3.605LeuGlu: 3.605 ± 0.51
1.577LeuPhe: 1.577 ± 0.421
6.591LeuGly: 6.591 ± 0.822
1.521LeuHis: 1.521 ± 0.386
4.394LeuIle: 4.394 ± 0.565
0.62LeuLys: 0.62 ± 0.189
5.521LeuLeu: 5.521 ± 0.736
1.127LeuMet: 1.127 ± 0.236
1.972LeuAsn: 1.972 ± 0.284
4.112LeuPro: 4.112 ± 0.584
2.197LeuGln: 2.197 ± 0.433
5.689LeuArg: 5.689 ± 0.601
4.957LeuSer: 4.957 ± 0.53
6.309LeuThr: 6.309 ± 0.525
5.183LeuVal: 5.183 ± 0.715
1.014LeuTrp: 1.014 ± 0.266
1.634LeuTyr: 1.634 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
3.042MetAla: 3.042 ± 0.384
0.113MetCys: 0.113 ± 0.088
1.014MetAsp: 1.014 ± 0.224
0.901MetGlu: 0.901 ± 0.256
0.282MetPhe: 0.282 ± 0.14
1.07MetGly: 1.07 ± 0.245
0.451MetHis: 0.451 ± 0.147
1.296MetIle: 1.296 ± 0.302
0.507MetLys: 0.507 ± 0.193
1.352MetLeu: 1.352 ± 0.269
0.451MetMet: 0.451 ± 0.154
0.901MetAsn: 0.901 ± 0.175
2.197MetPro: 2.197 ± 0.376
0.676MetGln: 0.676 ± 0.181
1.577MetArg: 1.577 ± 0.34
1.915MetSer: 1.915 ± 0.275
3.042MetThr: 3.042 ± 0.403
1.803MetVal: 1.803 ± 0.377
0.507MetTrp: 0.507 ± 0.154
0.507MetTyr: 0.507 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.929AsnAla: 2.929 ± 0.38
0.169AsnCys: 0.169 ± 0.105
1.07AsnAsp: 1.07 ± 0.243
1.465AsnGlu: 1.465 ± 0.302
0.563AsnPhe: 0.563 ± 0.187
3.098AsnGly: 3.098 ± 0.542
0.563AsnHis: 0.563 ± 0.213
0.732AsnIle: 0.732 ± 0.168
0.451AsnLys: 0.451 ± 0.187
1.521AsnLeu: 1.521 ± 0.298
0.338AsnMet: 0.338 ± 0.128
1.07AsnAsn: 1.07 ± 0.223
2.704AsnPro: 2.704 ± 0.42
0.451AsnGln: 0.451 ± 0.142
2.253AsnArg: 2.253 ± 0.37
1.408AsnSer: 1.408 ± 0.311
1.69AsnThr: 1.69 ± 0.308
2.253AsnVal: 2.253 ± 0.301
0.451AsnTrp: 0.451 ± 0.199
0.394AsnTyr: 0.394 ± 0.133
0.0AsnXaa: 0.0 ± 0.0
Pro
7.154ProAla: 7.154 ± 0.651
0.394ProCys: 0.394 ± 0.163
4.563ProAsp: 4.563 ± 0.566
6.816ProGlu: 6.816 ± 0.963
1.183ProPhe: 1.183 ± 0.286
4.507ProGly: 4.507 ± 0.541
1.127ProHis: 1.127 ± 0.252
2.535ProIle: 2.535 ± 0.417
2.028ProLys: 2.028 ± 0.336
4.112ProLeu: 4.112 ± 0.511
1.465ProMet: 1.465 ± 0.291
1.915ProAsn: 1.915 ± 0.342
3.774ProPro: 3.774 ± 0.533
1.352ProGln: 1.352 ± 0.609
3.436ProArg: 3.436 ± 0.443
3.774ProSer: 3.774 ± 0.553
5.239ProThr: 5.239 ± 0.433
4.281ProVal: 4.281 ± 0.476
1.183ProTrp: 1.183 ± 0.252
1.915ProTyr: 1.915 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
5.239GlnAla: 5.239 ± 0.561
0.451GlnCys: 0.451 ± 0.164
1.127GlnAsp: 1.127 ± 0.239
1.577GlnGlu: 1.577 ± 0.285
0.563GlnPhe: 0.563 ± 0.161
2.31GlnGly: 2.31 ± 0.404
0.62GlnHis: 0.62 ± 0.157
1.803GlnIle: 1.803 ± 0.256
0.338GlnLys: 0.338 ± 0.133
1.014GlnLeu: 1.014 ± 0.484
0.732GlnMet: 0.732 ± 0.195
0.789GlnAsn: 0.789 ± 0.212
1.296GlnPro: 1.296 ± 0.331
0.789GlnGln: 0.789 ± 0.199
2.141GlnArg: 2.141 ± 0.327
1.408GlnSer: 1.408 ± 0.245
2.197GlnThr: 2.197 ± 0.357
2.366GlnVal: 2.366 ± 0.29
1.07GlnTrp: 1.07 ± 0.268
0.789GlnTyr: 0.789 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
9.745ArgAla: 9.745 ± 0.911
0.563ArgCys: 0.563 ± 0.193
4.845ArgAsp: 4.845 ± 0.588
5.239ArgGlu: 5.239 ± 0.797
1.915ArgPhe: 1.915 ± 0.315
6.027ArgGly: 6.027 ± 0.634
1.746ArgHis: 1.746 ± 0.351
2.76ArgIle: 2.76 ± 0.357
2.873ArgLys: 2.873 ± 0.441
6.985ArgLeu: 6.985 ± 0.659
2.253ArgMet: 2.253 ± 0.427
1.803ArgAsn: 1.803 ± 0.266
3.098ArgPro: 3.098 ± 0.507
2.929ArgGln: 2.929 ± 0.509
7.774ArgArg: 7.774 ± 0.833
3.267ArgSer: 3.267 ± 0.479
4.112ArgThr: 4.112 ± 0.48
4.957ArgVal: 4.957 ± 0.577
1.465ArgTrp: 1.465 ± 0.326
2.028ArgTyr: 2.028 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
6.027SerAla: 6.027 ± 0.7
0.394SerCys: 0.394 ± 0.148
2.76SerAsp: 2.76 ± 0.528
2.76SerGlu: 2.76 ± 0.395
1.465SerPhe: 1.465 ± 0.29
5.858SerGly: 5.858 ± 0.92
0.958SerHis: 0.958 ± 0.219
2.591SerIle: 2.591 ± 0.405
1.465SerLys: 1.465 ± 0.345
4.507SerLeu: 4.507 ± 0.547
1.127SerMet: 1.127 ± 0.223
1.521SerAsn: 1.521 ± 0.296
3.098SerPro: 3.098 ± 0.359
0.845SerGln: 0.845 ± 0.193
3.436SerArg: 3.436 ± 0.476
1.803SerSer: 1.803 ± 0.384
4.0SerThr: 4.0 ± 0.565
3.718SerVal: 3.718 ± 0.426
0.676SerTrp: 0.676 ± 0.178
0.901SerTyr: 0.901 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
9.914ThrAla: 9.914 ± 0.998
0.62ThrCys: 0.62 ± 0.174
4.845ThrAsp: 4.845 ± 0.539
4.281ThrGlu: 4.281 ± 0.607
1.521ThrPhe: 1.521 ± 0.334
8.506ThrGly: 8.506 ± 0.874
1.014ThrHis: 1.014 ± 0.242
4.732ThrIle: 4.732 ± 0.585
1.803ThrLys: 1.803 ± 0.336
5.352ThrLeu: 5.352 ± 0.699
1.577ThrMet: 1.577 ± 0.245
1.465ThrAsn: 1.465 ± 0.355
6.478ThrPro: 6.478 ± 0.61
1.859ThrGln: 1.859 ± 0.414
4.507ThrArg: 4.507 ± 0.47
2.986ThrSer: 2.986 ± 0.396
5.802ThrThr: 5.802 ± 0.583
6.422ThrVal: 6.422 ± 0.757
1.183ThrTrp: 1.183 ± 0.248
1.521ThrTyr: 1.521 ± 0.265
0.0ThrXaa: 0.0 ± 0.0
Val
8.281ValAla: 8.281 ± 0.605
0.62ValCys: 0.62 ± 0.194
3.605ValAsp: 3.605 ± 0.416
4.338ValGlu: 4.338 ± 0.449
1.521ValPhe: 1.521 ± 0.327
5.295ValGly: 5.295 ± 0.477
1.183ValHis: 1.183 ± 0.249
3.774ValIle: 3.774 ± 0.518
2.141ValLys: 2.141 ± 0.304
5.464ValLeu: 5.464 ± 0.506
1.634ValMet: 1.634 ± 0.311
1.859ValAsn: 1.859 ± 0.308
4.225ValPro: 4.225 ± 0.526
2.253ValGln: 2.253 ± 0.366
4.901ValArg: 4.901 ± 0.508
4.169ValSer: 4.169 ± 0.571
7.548ValThr: 7.548 ± 0.625
4.169ValVal: 4.169 ± 0.552
1.521ValTrp: 1.521 ± 0.274
1.915ValTyr: 1.915 ± 0.32
0.0ValXaa: 0.0 ± 0.0
Trp
1.915TrpAla: 1.915 ± 0.315
0.338TrpCys: 0.338 ± 0.146
1.07TrpAsp: 1.07 ± 0.257
1.69TrpGlu: 1.69 ± 0.283
0.507TrpPhe: 0.507 ± 0.15
1.239TrpGly: 1.239 ± 0.28
0.451TrpHis: 0.451 ± 0.152
0.732TrpIle: 0.732 ± 0.178
0.507TrpLys: 0.507 ± 0.168
1.465TrpLeu: 1.465 ± 0.252
0.507TrpMet: 0.507 ± 0.163
0.507TrpAsn: 0.507 ± 0.197
0.901TrpPro: 0.901 ± 0.233
1.183TrpGln: 1.183 ± 0.236
1.69TrpArg: 1.69 ± 0.341
1.352TrpSer: 1.352 ± 0.285
1.972TrpThr: 1.972 ± 0.304
0.901TrpVal: 0.901 ± 0.222
0.563TrpTrp: 0.563 ± 0.208
0.056TrpTyr: 0.056 ± 0.048
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.324TyrAla: 3.324 ± 0.559
0.225TyrCys: 0.225 ± 0.102
1.803TyrAsp: 1.803 ± 0.319
1.859TyrGlu: 1.859 ± 0.317
0.451TyrPhe: 0.451 ± 0.211
1.972TyrGly: 1.972 ± 0.292
0.282TyrHis: 0.282 ± 0.118
1.239TyrIle: 1.239 ± 0.254
0.394TyrLys: 0.394 ± 0.147
1.69TyrLeu: 1.69 ± 0.358
0.789TyrMet: 0.789 ± 0.211
0.338TyrAsn: 0.338 ± 0.116
1.127TyrPro: 1.127 ± 0.226
0.62TyrGln: 0.62 ± 0.177
1.803TyrArg: 1.803 ± 0.335
1.239TyrSer: 1.239 ± 0.282
1.296TyrThr: 1.296 ± 0.231
1.746TyrVal: 1.746 ± 0.25
0.507TyrTrp: 0.507 ± 0.167
0.789TyrTyr: 0.789 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (17753 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski