Amino acid dipepetide frequency for Simian virus 41 (SV41)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.106AlaAla: 9.106 ± 2.502
0.809AlaCys: 0.809 ± 0.485
4.856AlaAsp: 4.856 ± 0.37
5.261AlaGlu: 5.261 ± 1.747
1.821AlaPhe: 1.821 ± 0.413
4.452AlaGly: 4.452 ± 1.539
1.214AlaHis: 1.214 ± 0.747
4.654AlaIle: 4.654 ± 0.868
3.44AlaLys: 3.44 ± 0.852
8.094AlaLeu: 8.094 ± 1.333
1.619AlaMet: 1.619 ± 0.931
5.059AlaAsn: 5.059 ± 2.007
2.631AlaPro: 2.631 ± 0.91
3.642AlaGln: 3.642 ± 1.331
3.44AlaArg: 3.44 ± 1.177
6.07AlaSer: 6.07 ± 0.451
5.261AlaThr: 5.261 ± 1.625
4.654AlaVal: 4.654 ± 1.252
0.809AlaTrp: 0.809 ± 0.415
2.631AlaTyr: 2.631 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.444
0.202CysCys: 0.202 ± 0.125
0.202CysAsp: 0.202 ± 0.125
0.607CysGlu: 0.607 ± 0.424
1.214CysPhe: 1.214 ± 0.548
0.809CysGly: 0.809 ± 0.636
0.202CysHis: 0.202 ± 0.256
1.012CysIle: 1.012 ± 0.434
1.416CysLys: 1.416 ± 0.61
1.416CysLeu: 1.416 ± 0.357
1.012CysMet: 1.012 ± 0.254
0.809CysAsn: 0.809 ± 0.472
1.012CysPro: 1.012 ± 0.589
0.405CysGln: 0.405 ± 0.249
1.012CysArg: 1.012 ± 0.664
2.226CysSer: 2.226 ± 0.793
1.416CysThr: 1.416 ± 0.27
0.809CysVal: 0.809 ± 0.43
0.0CysTrp: 0.0 ± 0.0
0.809CysTyr: 0.809 ± 0.458
0.0CysXaa: 0.0 ± 0.0
Asp
2.226AspAla: 2.226 ± 0.426
0.607AspCys: 0.607 ± 0.271
3.238AspAsp: 3.238 ± 1.496
2.023AspGlu: 2.023 ± 0.628
2.226AspPhe: 2.226 ± 0.744
3.44AspGly: 3.44 ± 0.593
1.416AspHis: 1.416 ± 0.511
2.631AspIle: 2.631 ± 0.596
1.012AspLys: 1.012 ± 0.325
6.677AspLeu: 6.677 ± 0.966
0.405AspMet: 0.405 ± 0.344
2.833AspAsn: 2.833 ± 1.177
4.654AspPro: 4.654 ± 0.741
2.833AspGln: 2.833 ± 0.647
1.416AspArg: 1.416 ± 0.635
3.035AspSer: 3.035 ± 0.9
3.44AspThr: 3.44 ± 0.701
2.631AspVal: 2.631 ± 0.34
0.202AspTrp: 0.202 ± 0.125
3.238AspTyr: 3.238 ± 1.357
0.0AspXaa: 0.0 ± 0.0
Glu
2.631GluAla: 2.631 ± 1.393
1.214GluCys: 1.214 ± 0.571
2.833GluAsp: 2.833 ± 0.943
3.845GluGlu: 3.845 ± 0.677
0.607GluPhe: 0.607 ± 0.289
3.44GluGly: 3.44 ± 0.939
0.607GluHis: 0.607 ± 0.286
4.047GluIle: 4.047 ± 0.9
2.428GluLys: 2.428 ± 0.501
6.475GluLeu: 6.475 ± 0.972
0.607GluMet: 0.607 ± 0.239
1.619GluAsn: 1.619 ± 0.583
1.012GluPro: 1.012 ± 0.932
2.631GluGln: 2.631 ± 0.604
1.821GluArg: 1.821 ± 0.419
3.44GluSer: 3.44 ± 0.462
2.631GluThr: 2.631 ± 0.63
2.631GluVal: 2.631 ± 0.471
0.809GluTrp: 0.809 ± 0.636
1.416GluTyr: 1.416 ± 0.357
0.0GluXaa: 0.0 ± 0.0
Phe
1.821PheAla: 1.821 ± 0.654
0.809PheCys: 0.809 ± 0.307
3.238PheAsp: 3.238 ± 1.324
1.416PheGlu: 1.416 ± 0.477
2.428PhePhe: 2.428 ± 0.476
0.809PheGly: 0.809 ± 0.45
0.405PheHis: 0.405 ± 0.215
2.631PheIle: 2.631 ± 0.503
1.821PheLys: 1.821 ± 0.518
2.226PheLeu: 2.226 ± 0.708
0.405PheMet: 0.405 ± 0.204
1.416PheAsn: 1.416 ± 0.326
1.619PhePro: 1.619 ± 0.901
1.416PheGln: 1.416 ± 0.559
1.416PheArg: 1.416 ± 0.6
4.249PheSer: 4.249 ± 0.749
1.821PheThr: 1.821 ± 0.731
1.821PheVal: 1.821 ± 0.671
0.202PheTrp: 0.202 ± 0.125
1.416PheTyr: 1.416 ± 0.451
0.0PheXaa: 0.0 ± 0.0
Gly
5.261GlyAla: 5.261 ± 1.504
1.619GlyCys: 1.619 ± 0.731
4.047GlyAsp: 4.047 ± 1.19
2.631GlyGlu: 2.631 ± 0.747
2.428GlyPhe: 2.428 ± 1.028
4.047GlyGly: 4.047 ± 1.83
0.809GlyHis: 0.809 ± 0.485
4.249GlyIle: 4.249 ± 0.307
2.833GlyLys: 2.833 ± 1.12
4.856GlyLeu: 4.856 ± 0.533
1.214GlyMet: 1.214 ± 1.108
2.833GlyAsn: 2.833 ± 0.927
1.821GlyPro: 1.821 ± 0.998
2.226GlyGln: 2.226 ± 0.5
4.452GlyArg: 4.452 ± 1.425
5.463GlySer: 5.463 ± 1.923
2.833GlyThr: 2.833 ± 1.547
4.249GlyVal: 4.249 ± 0.843
0.0GlyTrp: 0.0 ± 0.0
1.012GlyTyr: 1.012 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
1.619HisAla: 1.619 ± 0.57
0.0HisCys: 0.0 ± 0.0
0.809HisAsp: 0.809 ± 0.498
0.607HisGlu: 0.607 ± 0.286
0.607HisPhe: 0.607 ± 0.25
0.202HisGly: 0.202 ± 0.125
1.214HisHis: 1.214 ± 0.56
1.012HisIle: 1.012 ± 0.33
1.012HisLys: 1.012 ± 0.511
3.035HisLeu: 3.035 ± 1.217
0.809HisMet: 0.809 ± 0.616
1.214HisAsn: 1.214 ± 0.36
1.619HisPro: 1.619 ± 0.641
0.405HisGln: 0.405 ± 0.249
0.809HisArg: 0.809 ± 0.35
1.214HisSer: 1.214 ± 0.299
0.809HisThr: 0.809 ± 0.328
1.214HisVal: 1.214 ± 0.288
0.405HisTrp: 0.405 ± 0.195
1.012HisTyr: 1.012 ± 0.463
0.0HisXaa: 0.0 ± 0.0
Ile
6.677IleAla: 6.677 ± 1.215
0.202IleCys: 0.202 ± 0.125
3.845IleAsp: 3.845 ± 1.056
4.249IleGlu: 4.249 ± 1.148
1.214IlePhe: 1.214 ± 0.445
3.238IleGly: 3.238 ± 0.756
2.226IleHis: 2.226 ± 0.852
6.475IleIle: 6.475 ± 0.953
4.047IleLys: 4.047 ± 1.36
5.666IleLeu: 5.666 ± 0.842
1.416IleMet: 1.416 ± 0.358
2.226IleAsn: 2.226 ± 0.944
3.845IlePro: 3.845 ± 0.78
4.452IleGln: 4.452 ± 1.688
2.833IleArg: 2.833 ± 0.739
5.463IleSer: 5.463 ± 1.607
6.273IleThr: 6.273 ± 1.345
4.047IleVal: 4.047 ± 0.731
0.607IleTrp: 0.607 ± 0.374
1.214IleTyr: 1.214 ± 0.405
0.0IleXaa: 0.0 ± 0.0
Lys
4.856LysAla: 4.856 ± 1.531
1.012LysCys: 1.012 ± 0.474
2.428LysAsp: 2.428 ± 0.817
2.226LysGlu: 2.226 ± 0.958
1.214LysPhe: 1.214 ± 0.484
3.035LysGly: 3.035 ± 0.995
0.607LysHis: 0.607 ± 0.374
4.452LysIle: 4.452 ± 1.571
2.428LysLys: 2.428 ± 0.985
5.463LysLeu: 5.463 ± 1.028
0.607LysMet: 0.607 ± 0.288
2.428LysAsn: 2.428 ± 0.985
2.023LysPro: 2.023 ± 0.867
1.619LysGln: 1.619 ± 0.658
3.642LysArg: 3.642 ± 0.703
2.023LysSer: 2.023 ± 0.516
3.642LysThr: 3.642 ± 0.928
2.226LysVal: 2.226 ± 0.58
0.405LysTrp: 0.405 ± 0.249
2.428LysTyr: 2.428 ± 1.292
0.0LysXaa: 0.0 ± 0.0
Leu
10.522LeuAla: 10.522 ± 1.632
2.428LeuCys: 2.428 ± 0.769
4.249LeuAsp: 4.249 ± 0.838
4.452LeuGlu: 4.452 ± 1.557
3.238LeuPhe: 3.238 ± 0.992
5.261LeuGly: 5.261 ± 0.479
1.821LeuHis: 1.821 ± 0.731
6.475LeuIle: 6.475 ± 0.806
5.868LeuLys: 5.868 ± 1.149
10.522LeuLeu: 10.522 ± 1.095
3.642LeuMet: 3.642 ± 0.885
5.261LeuAsn: 5.261 ± 0.882
5.666LeuPro: 5.666 ± 0.846
2.833LeuGln: 2.833 ± 0.826
4.249LeuArg: 4.249 ± 0.778
9.51LeuSer: 9.51 ± 2.448
11.938LeuThr: 11.938 ± 2.063
5.463LeuVal: 5.463 ± 1.718
1.619LeuTrp: 1.619 ± 0.612
3.035LeuTyr: 3.035 ± 1.561
0.0LeuXaa: 0.0 ± 0.0
Met
2.226MetAla: 2.226 ± 0.884
0.202MetCys: 0.202 ± 0.125
0.607MetAsp: 0.607 ± 0.554
1.214MetGlu: 1.214 ± 0.502
0.202MetPhe: 0.202 ± 0.251
1.619MetGly: 1.619 ± 1.24
0.0MetHis: 0.0 ± 0.0
2.023MetIle: 2.023 ± 0.487
1.012MetLys: 1.012 ± 0.431
1.619MetLeu: 1.619 ± 0.599
1.012MetMet: 1.012 ± 0.552
1.214MetAsn: 1.214 ± 0.405
0.607MetPro: 0.607 ± 0.288
1.416MetGln: 1.416 ± 0.464
1.619MetArg: 1.619 ± 0.406
2.631MetSer: 2.631 ± 1.074
1.821MetThr: 1.821 ± 1.16
1.214MetVal: 1.214 ± 0.56
0.202MetTrp: 0.202 ± 0.125
1.416MetTyr: 1.416 ± 0.473
0.0MetXaa: 0.0 ± 0.0
Asn
3.642AsnAla: 3.642 ± 0.703
1.214AsnCys: 1.214 ± 0.73
1.821AsnAsp: 1.821 ± 0.689
0.809AsnGlu: 0.809 ± 0.51
1.821AsnPhe: 1.821 ± 0.362
3.845AsnGly: 3.845 ± 0.744
1.214AsnHis: 1.214 ± 0.612
3.238AsnIle: 3.238 ± 1.223
1.821AsnLys: 1.821 ± 0.492
4.856AsnLeu: 4.856 ± 1.189
0.405AsnMet: 0.405 ± 0.417
1.821AsnAsn: 1.821 ± 0.566
5.059AsnPro: 5.059 ± 0.886
2.833AsnGln: 2.833 ± 0.852
2.833AsnArg: 2.833 ± 0.402
2.833AsnSer: 2.833 ± 0.921
3.238AsnThr: 3.238 ± 1.034
2.226AsnVal: 2.226 ± 0.65
1.416AsnTrp: 1.416 ± 0.56
1.821AsnTyr: 1.821 ± 0.32
0.0AsnXaa: 0.0 ± 0.0
Pro
4.047ProAla: 4.047 ± 1.658
0.202ProCys: 0.202 ± 0.224
2.833ProAsp: 2.833 ± 0.71
1.821ProGlu: 1.821 ± 0.355
2.631ProPhe: 2.631 ± 0.713
4.249ProGly: 4.249 ± 0.74
1.012ProHis: 1.012 ± 0.447
3.642ProIle: 3.642 ± 0.921
2.226ProLys: 2.226 ± 0.585
6.88ProLeu: 6.88 ± 0.227
1.012ProMet: 1.012 ± 0.256
3.035ProAsn: 3.035 ± 0.97
3.642ProPro: 3.642 ± 0.783
2.023ProGln: 2.023 ± 0.411
1.214ProArg: 1.214 ± 0.26
3.845ProSer: 3.845 ± 0.909
3.845ProThr: 3.845 ± 1.14
3.035ProVal: 3.035 ± 1.529
0.0ProTrp: 0.0 ± 0.0
2.226ProTyr: 2.226 ± 1.005
0.0ProXaa: 0.0 ± 0.0
Gln
3.845GlnAla: 3.845 ± 0.952
0.809GlnCys: 0.809 ± 0.413
2.226GlnAsp: 2.226 ± 0.895
2.428GlnGlu: 2.428 ± 0.772
0.809GlnPhe: 0.809 ± 0.39
3.845GlnGly: 3.845 ± 1.653
1.416GlnHis: 1.416 ± 0.677
3.44GlnIle: 3.44 ± 0.814
1.214GlnLys: 1.214 ± 0.308
4.856GlnLeu: 4.856 ± 0.931
1.821GlnMet: 1.821 ± 0.593
1.821GlnAsn: 1.821 ± 0.775
1.214GlnPro: 1.214 ± 0.738
2.428GlnGln: 2.428 ± 1.18
1.619GlnArg: 1.619 ± 0.418
4.452GlnSer: 4.452 ± 0.894
0.809GlnThr: 0.809 ± 0.371
2.226GlnVal: 2.226 ± 0.899
0.607GlnTrp: 0.607 ± 0.239
1.416GlnTyr: 1.416 ± 0.593
0.0GlnXaa: 0.0 ± 0.0
Arg
1.619ArgAla: 1.619 ± 0.713
0.607ArgCys: 0.607 ± 0.349
1.619ArgAsp: 1.619 ± 0.641
1.619ArgGlu: 1.619 ± 0.55
1.619ArgPhe: 1.619 ± 0.961
2.023ArgGly: 2.023 ± 0.676
1.619ArgHis: 1.619 ± 0.571
3.44ArgIle: 3.44 ± 0.809
4.047ArgLys: 4.047 ± 0.641
5.059ArgLeu: 5.059 ± 1.101
1.619ArgMet: 1.619 ± 0.493
1.012ArgAsn: 1.012 ± 0.474
2.226ArgPro: 2.226 ± 1.168
1.012ArgGln: 1.012 ± 0.512
2.428ArgArg: 2.428 ± 0.471
4.856ArgSer: 4.856 ± 0.873
3.035ArgThr: 3.035 ± 0.916
3.035ArgVal: 3.035 ± 0.914
0.607ArgTrp: 0.607 ± 0.28
0.809ArgTyr: 0.809 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
4.856SerAla: 4.856 ± 1.367
1.416SerCys: 1.416 ± 0.325
2.631SerAsp: 2.631 ± 0.59
3.845SerGlu: 3.845 ± 1.361
2.631SerPhe: 2.631 ± 0.53
5.463SerGly: 5.463 ± 1.614
1.416SerHis: 1.416 ± 0.298
5.059SerIle: 5.059 ± 1.071
4.452SerLys: 4.452 ± 0.913
10.32SerLeu: 10.32 ± 1.59
1.619SerMet: 1.619 ± 0.246
4.452SerAsn: 4.452 ± 1.015
3.845SerPro: 3.845 ± 0.898
3.642SerGln: 3.642 ± 0.884
3.44SerArg: 3.44 ± 0.585
9.51SerSer: 9.51 ± 1.225
3.035SerThr: 3.035 ± 0.507
4.856SerVal: 4.856 ± 0.901
1.214SerTrp: 1.214 ± 0.548
3.642SerTyr: 3.642 ± 0.788
0.0SerXaa: 0.0 ± 0.0
Thr
6.677ThrAla: 6.677 ± 2.044
1.416ThrCys: 1.416 ± 0.986
3.845ThrAsp: 3.845 ± 1.246
2.631ThrGlu: 2.631 ± 0.283
2.631ThrPhe: 2.631 ± 0.526
3.238ThrGly: 3.238 ± 1.243
1.012ThrHis: 1.012 ± 0.418
5.868ThrIle: 5.868 ± 0.522
3.035ThrLys: 3.035 ± 0.613
7.082ThrLeu: 7.082 ± 1.426
2.023ThrMet: 2.023 ± 0.64
3.035ThrAsn: 3.035 ± 0.431
4.654ThrPro: 4.654 ± 1.246
2.226ThrGln: 2.226 ± 0.538
3.44ThrArg: 3.44 ± 0.655
4.047ThrSer: 4.047 ± 0.866
4.249ThrThr: 4.249 ± 0.873
3.642ThrVal: 3.642 ± 1.15
0.607ThrTrp: 0.607 ± 0.288
2.428ThrTyr: 2.428 ± 1.168
0.0ThrXaa: 0.0 ± 0.0
Val
3.238ValAla: 3.238 ± 0.762
1.416ValCys: 1.416 ± 0.496
3.238ValAsp: 3.238 ± 0.915
2.833ValGlu: 2.833 ± 1.404
2.428ValPhe: 2.428 ± 0.398
3.642ValGly: 3.642 ± 0.561
1.214ValHis: 1.214 ± 0.458
4.249ValIle: 4.249 ± 0.897
2.428ValLys: 2.428 ± 1.196
6.273ValLeu: 6.273 ± 0.55
1.214ValMet: 1.214 ± 0.645
2.833ValAsn: 2.833 ± 1.112
4.452ValPro: 4.452 ± 0.504
1.416ValGln: 1.416 ± 0.374
1.619ValArg: 1.619 ± 0.411
2.833ValSer: 2.833 ± 1.062
4.047ValThr: 4.047 ± 0.698
3.845ValVal: 3.845 ± 1.12
0.607ValTrp: 0.607 ± 0.284
2.833ValTyr: 2.833 ± 0.829
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.418
0.405TrpCys: 0.405 ± 0.429
0.607TrpAsp: 0.607 ± 0.374
0.0TrpGlu: 0.0 ± 0.0
0.405TrpPhe: 0.405 ± 0.195
0.607TrpGly: 0.607 ± 0.288
0.0TrpHis: 0.0 ± 0.0
0.809TrpIle: 0.809 ± 0.255
0.607TrpLys: 0.607 ± 0.374
0.809TrpLeu: 0.809 ± 0.498
0.202TrpMet: 0.202 ± 0.125
1.012TrpAsn: 1.012 ± 0.274
0.809TrpPro: 0.809 ± 0.284
0.607TrpGln: 0.607 ± 0.374
0.405TrpArg: 0.405 ± 0.249
1.012TrpSer: 1.012 ± 0.43
0.405TrpThr: 0.405 ± 0.195
0.607TrpVal: 0.607 ± 0.367
0.202TrpTrp: 0.202 ± 0.224
0.202TrpTyr: 0.202 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.991
1.214TyrCys: 1.214 ± 0.415
1.012TyrAsp: 1.012 ± 0.512
2.226TyrGlu: 2.226 ± 0.846
1.214TyrPhe: 1.214 ± 0.421
1.619TyrGly: 1.619 ± 0.362
0.202TyrHis: 0.202 ± 0.224
1.012TyrIle: 1.012 ± 0.48
1.619TyrLys: 1.619 ± 0.498
5.868TyrLeu: 5.868 ± 1.469
1.012TyrMet: 1.012 ± 0.256
2.833TyrAsn: 2.833 ± 0.619
1.012TyrPro: 1.012 ± 0.327
3.238TyrGln: 3.238 ± 0.901
0.202TyrArg: 0.202 ± 0.125
2.631TyrSer: 2.631 ± 0.622
3.238TyrThr: 3.238 ± 0.757
2.226TyrVal: 2.226 ± 0.598
0.202TyrTrp: 0.202 ± 0.125
2.023TyrTyr: 2.023 ± 0.708
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4943 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski