Amino acid dipepetide frequency for Inoviridae sp. ctBZ32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.493AlaAla: 7.493 ± 4.239
4.087AlaCys: 4.087 ± 2.159
4.768AlaAsp: 4.768 ± 1.079
3.406AlaGlu: 3.406 ± 1.138
5.45AlaPhe: 5.45 ± 1.402
6.812AlaGly: 6.812 ± 2.258
1.362AlaHis: 1.362 ± 0.594
7.493AlaIle: 7.493 ± 1.975
2.725AlaLys: 2.725 ± 0.814
10.218AlaLeu: 10.218 ± 4.48
5.45AlaMet: 5.45 ± 2.027
2.725AlaAsn: 2.725 ± 1.613
6.131AlaPro: 6.131 ± 2.973
4.087AlaGln: 4.087 ± 1.307
8.174AlaArg: 8.174 ± 2.482
6.131AlaSer: 6.131 ± 1.948
8.856AlaThr: 8.856 ± 3.82
10.899AlaVal: 10.899 ± 2.111
2.725AlaTrp: 2.725 ± 1.188
2.044AlaTyr: 2.044 ± 1.19
0.0AlaXaa: 0.0 ± 0.0
Cys
2.725CysAla: 2.725 ± 1.187
0.0CysCys: 0.0 ± 0.0
1.362CysAsp: 1.362 ± 0.913
1.362CysGlu: 1.362 ± 0.913
1.362CysPhe: 1.362 ± 0.913
0.681CysGly: 0.681 ± 0.457
0.0CysHis: 0.0 ± 0.0
1.362CysIle: 1.362 ± 0.913
0.0CysLys: 0.0 ± 0.0
0.681CysLeu: 0.681 ± 0.457
0.681CysMet: 0.681 ± 0.457
0.0CysAsn: 0.0 ± 0.0
3.406CysPro: 3.406 ± 1.685
1.362CysGln: 1.362 ± 0.99
4.768CysArg: 4.768 ± 1.237
0.0CysSer: 0.0 ± 0.0
2.725CysThr: 2.725 ± 1.826
1.362CysVal: 1.362 ± 1.185
0.0CysTrp: 0.0 ± 0.0
0.681CysTyr: 0.681 ± 0.565
0.0CysXaa: 0.0 ± 0.0
Asp
7.493AspAla: 7.493 ± 3.271
0.0AspCys: 0.0 ± 0.0
1.362AspAsp: 1.362 ± 0.913
1.362AspGlu: 1.362 ± 0.594
0.681AspPhe: 0.681 ± 0.65
7.493AspGly: 7.493 ± 3.207
0.0AspHis: 0.0 ± 0.0
1.362AspIle: 1.362 ± 1.961
1.362AspLys: 1.362 ± 1.3
2.725AspLeu: 2.725 ± 1.377
1.362AspMet: 1.362 ± 1.129
0.681AspAsn: 0.681 ± 0.457
3.406AspPro: 3.406 ± 1.451
0.681AspGln: 0.681 ± 0.976
2.725AspArg: 2.725 ± 1.103
2.044AspSer: 2.044 ± 0.961
0.681AspThr: 0.681 ± 0.457
4.768AspVal: 4.768 ± 2.759
0.681AspTrp: 0.681 ± 0.565
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.725GluAla: 2.725 ± 0.94
0.0GluCys: 0.0 ± 0.0
4.087GluAsp: 4.087 ± 2.131
0.0GluGlu: 0.0 ± 0.0
0.0GluPhe: 0.0 ± 0.0
6.131GluGly: 6.131 ± 2.534
0.681GluHis: 0.681 ± 0.565
1.362GluIle: 1.362 ± 1.314
1.362GluLys: 1.362 ± 1.129
2.725GluLeu: 2.725 ± 1.067
0.681GluMet: 0.681 ± 0.981
0.0GluAsn: 0.0 ± 0.0
4.087GluPro: 4.087 ± 2.159
2.044GluGln: 2.044 ± 1.089
2.044GluArg: 2.044 ± 1.37
2.725GluSer: 2.725 ± 1.188
2.044GluThr: 2.044 ± 0.967
3.406GluVal: 3.406 ± 1.138
2.044GluTrp: 2.044 ± 1.251
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
6.812PheAla: 6.812 ± 1.577
0.0PheCys: 0.0 ± 0.0
3.406PheAsp: 3.406 ± 1.088
0.681PheGlu: 0.681 ± 0.565
0.681PhePhe: 0.681 ± 0.565
1.362PheGly: 1.362 ± 0.594
0.0PheHis: 0.0 ± 0.0
1.362PheIle: 1.362 ± 0.99
2.725PheLys: 2.725 ± 2.199
2.044PheLeu: 2.044 ± 0.893
0.681PheMet: 0.681 ± 0.457
0.0PheAsn: 0.0 ± 0.0
1.362PhePro: 1.362 ± 0.721
0.681PheGln: 0.681 ± 0.457
3.406PheArg: 3.406 ± 1.361
0.681PheSer: 0.681 ± 0.457
2.044PheThr: 2.044 ± 1.144
3.406PheVal: 3.406 ± 1.754
0.681PheTrp: 0.681 ± 0.457
0.681PheTyr: 0.681 ± 0.565
0.0PheXaa: 0.0 ± 0.0
Gly
8.174GlyAla: 8.174 ± 1.867
0.681GlyCys: 0.681 ± 0.457
4.087GlyAsp: 4.087 ± 2.118
4.087GlyGlu: 4.087 ± 1.979
7.493GlyPhe: 7.493 ± 1.382
12.943GlyGly: 12.943 ± 3.904
0.681GlyHis: 0.681 ± 0.565
4.768GlyIle: 4.768 ± 2.105
1.362GlyLys: 1.362 ± 0.594
9.537GlyLeu: 9.537 ± 4.176
0.681GlyMet: 0.681 ± 1.166
2.725GlyAsn: 2.725 ± 1.523
4.768GlyPro: 4.768 ± 1.538
4.087GlyGln: 4.087 ± 1.666
3.406GlyArg: 3.406 ± 2.754
8.856GlySer: 8.856 ± 2.029
8.174GlyThr: 8.174 ± 3.491
10.899GlyVal: 10.899 ± 3.581
2.725GlyTrp: 2.725 ± 1.047
2.044GlyTyr: 2.044 ± 0.893
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.362HisAsp: 1.362 ± 1.426
1.362HisGlu: 1.362 ± 1.129
0.0HisPhe: 0.0 ± 0.0
2.044HisGly: 2.044 ± 1.251
0.0HisHis: 0.0 ± 0.0
0.681HisIle: 0.681 ± 0.457
0.681HisLys: 0.681 ± 0.565
1.362HisLeu: 1.362 ± 0.594
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.681HisPro: 0.681 ± 0.565
0.681HisGln: 0.681 ± 0.565
2.044HisArg: 2.044 ± 1.222
0.0HisSer: 0.0 ± 0.0
2.044HisThr: 2.044 ± 1.694
4.087HisVal: 4.087 ± 2.24
0.0HisTrp: 0.0 ± 0.0
0.681HisTyr: 0.681 ± 0.65
0.0HisXaa: 0.0 ± 0.0
Ile
4.087IleAla: 4.087 ± 0.955
0.0IleCys: 0.0 ± 0.0
4.768IleAsp: 4.768 ± 2.595
3.406IleGlu: 3.406 ± 1.888
0.0IlePhe: 0.0 ± 0.0
4.768IleGly: 4.768 ± 2.979
0.681IleHis: 0.681 ± 0.65
1.362IleIle: 1.362 ± 0.913
1.362IleLys: 1.362 ± 1.059
3.406IleLeu: 3.406 ± 1.724
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
4.087IlePro: 4.087 ± 1.398
0.681IleGln: 0.681 ± 0.457
2.044IleArg: 2.044 ± 1.19
0.681IleSer: 0.681 ± 0.457
2.725IleThr: 2.725 ± 2.088
2.725IleVal: 2.725 ± 1.178
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.406LysAla: 3.406 ± 1.823
0.681LysCys: 0.681 ± 0.457
0.681LysAsp: 0.681 ± 0.565
1.362LysGlu: 1.362 ± 1.129
0.0LysPhe: 0.0 ± 0.0
4.087LysGly: 4.087 ± 1.915
0.681LysHis: 0.681 ± 1.113
1.362LysIle: 1.362 ± 1.129
1.362LysLys: 1.362 ± 0.99
2.725LysLeu: 2.725 ± 1.253
0.0LysMet: 0.0 ± 0.0
0.681LysAsn: 0.681 ± 0.457
0.681LysPro: 0.681 ± 0.981
0.0LysGln: 0.0 ± 0.0
2.044LysArg: 2.044 ± 1.024
2.044LysSer: 2.044 ± 1.251
1.362LysThr: 1.362 ± 1.055
1.362LysVal: 1.362 ± 1.614
0.681LysTrp: 0.681 ± 1.113
1.362LysTyr: 1.362 ± 0.633
0.0LysXaa: 0.0 ± 0.0
Leu
7.493LeuAla: 7.493 ± 4.778
0.681LeuCys: 0.681 ± 0.457
1.362LeuAsp: 1.362 ± 1.952
2.725LeuGlu: 2.725 ± 1.253
0.0LeuPhe: 0.0 ± 0.0
5.45LeuGly: 5.45 ± 2.44
0.681LeuHis: 0.681 ± 0.565
2.044LeuIle: 2.044 ± 1.379
0.681LeuLys: 0.681 ± 0.976
7.493LeuLeu: 7.493 ± 5.42
0.681LeuMet: 0.681 ± 0.65
2.725LeuAsn: 2.725 ± 1.826
7.493LeuPro: 7.493 ± 3.012
4.768LeuGln: 4.768 ± 1.821
8.174LeuArg: 8.174 ± 2.933
6.131LeuSer: 6.131 ± 1.529
8.174LeuThr: 8.174 ± 3.003
7.493LeuVal: 7.493 ± 2.45
1.362LeuTrp: 1.362 ± 1.3
0.681LeuTyr: 0.681 ± 0.565
0.0LeuXaa: 0.0 ± 0.0
Met
2.044MetAla: 2.044 ± 1.248
1.362MetCys: 1.362 ± 0.594
0.0MetAsp: 0.0 ± 0.0
0.681MetGlu: 0.681 ± 0.65
0.681MetPhe: 0.681 ± 1.113
0.681MetGly: 0.681 ± 0.457
0.0MetHis: 0.0 ± 0.0
0.681MetIle: 0.681 ± 0.565
0.681MetLys: 0.681 ± 0.976
2.044MetLeu: 2.044 ± 1.882
0.681MetMet: 0.681 ± 1.069
0.681MetAsn: 0.681 ± 0.457
4.087MetPro: 4.087 ± 1.401
0.681MetGln: 0.681 ± 0.981
1.362MetArg: 1.362 ± 1.336
0.681MetSer: 0.681 ± 0.565
0.681MetThr: 0.681 ± 0.565
1.362MetVal: 1.362 ± 0.721
0.0MetTrp: 0.0 ± 0.0
0.681MetTyr: 0.681 ± 0.65
0.0MetXaa: 0.0 ± 0.0
Asn
2.725AsnAla: 2.725 ± 1.462
1.362AsnCys: 1.362 ± 0.913
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
3.406AsnGly: 3.406 ± 1.72
0.0AsnHis: 0.0 ± 0.0
0.681AsnIle: 0.681 ± 0.457
0.681AsnLys: 0.681 ± 0.457
0.681AsnLeu: 0.681 ± 0.457
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.044AsnPro: 2.044 ± 1.37
1.362AsnGln: 1.362 ± 1.15
2.725AsnArg: 2.725 ± 1.191
2.044AsnSer: 2.044 ± 1.233
0.681AsnThr: 0.681 ± 0.457
1.362AsnVal: 1.362 ± 0.594
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
12.943ProAla: 12.943 ± 4.032
1.362ProCys: 1.362 ± 0.913
3.406ProAsp: 3.406 ± 1.266
4.768ProGlu: 4.768 ± 2.178
4.087ProPhe: 4.087 ± 1.956
8.856ProGly: 8.856 ± 2.154
2.044ProHis: 2.044 ± 1.396
1.362ProIle: 1.362 ± 1.108
2.725ProLys: 2.725 ± 1.621
2.725ProLeu: 2.725 ± 1.067
0.681ProMet: 0.681 ± 0.565
2.725ProAsn: 2.725 ± 1.293
5.45ProPro: 5.45 ± 2.39
2.725ProGln: 2.725 ± 1.583
4.087ProArg: 4.087 ± 1.149
3.406ProSer: 3.406 ± 1.064
2.725ProThr: 2.725 ± 1.293
8.856ProVal: 8.856 ± 1.872
0.681ProTrp: 0.681 ± 0.565
0.681ProTyr: 0.681 ± 0.565
0.0ProXaa: 0.0 ± 0.0
Gln
6.131GlnAla: 6.131 ± 1.934
2.725GlnCys: 2.725 ± 1.826
0.0GlnAsp: 0.0 ± 0.0
0.681GlnGlu: 0.681 ± 0.565
2.044GlnPhe: 2.044 ± 1.933
4.087GlnGly: 4.087 ± 1.94
2.044GlnHis: 2.044 ± 0.569
0.681GlnIle: 0.681 ± 0.981
0.681GlnLys: 0.681 ± 0.981
2.725GlnLeu: 2.725 ± 2.868
0.681GlnMet: 0.681 ± 0.565
0.681GlnAsn: 0.681 ± 0.457
4.768GlnPro: 4.768 ± 2.015
4.087GlnGln: 4.087 ± 1.566
2.725GlnArg: 2.725 ± 1.067
0.681GlnSer: 0.681 ± 0.457
0.681GlnThr: 0.681 ± 0.457
2.725GlnVal: 2.725 ± 1.642
1.362GlnTrp: 1.362 ± 0.721
1.362GlnTyr: 1.362 ± 0.913
0.0GlnXaa: 0.0 ± 0.0
Arg
6.131ArgAla: 6.131 ± 3.835
3.406ArgCys: 3.406 ± 1.63
2.044ArgAsp: 2.044 ± 0.893
3.406ArgGlu: 3.406 ± 2.4
2.725ArgPhe: 2.725 ± 1.267
6.812ArgGly: 6.812 ± 3.082
2.044ArgHis: 2.044 ± 1.694
2.725ArgIle: 2.725 ± 1.999
1.362ArgLys: 1.362 ± 0.721
5.45ArgLeu: 5.45 ± 2.445
1.362ArgMet: 1.362 ± 1.112
1.362ArgAsn: 1.362 ± 0.594
4.087ArgPro: 4.087 ± 2.24
2.725ArgGln: 2.725 ± 2.349
7.493ArgArg: 7.493 ± 2.925
2.725ArgSer: 2.725 ± 1.191
8.174ArgThr: 8.174 ± 2.939
6.131ArgVal: 6.131 ± 2.217
0.681ArgTrp: 0.681 ± 0.65
0.681ArgTyr: 0.681 ± 0.65
0.0ArgXaa: 0.0 ± 0.0
Ser
7.493SerAla: 7.493 ± 2.662
0.681SerCys: 0.681 ± 0.457
0.681SerAsp: 0.681 ± 0.457
2.044SerGlu: 2.044 ± 0.893
2.044SerPhe: 2.044 ± 1.37
8.174SerGly: 8.174 ± 2.459
1.362SerHis: 1.362 ± 0.721
2.044SerIle: 2.044 ± 0.893
2.725SerLys: 2.725 ± 0.814
4.087SerLeu: 4.087 ± 3.26
1.362SerMet: 1.362 ± 1.062
0.681SerAsn: 0.681 ± 0.565
2.044SerPro: 2.044 ± 1.73
2.725SerGln: 2.725 ± 1.253
5.45SerArg: 5.45 ± 1.937
4.768SerSer: 4.768 ± 1.656
4.087SerThr: 4.087 ± 1.496
3.406SerVal: 3.406 ± 1.088
0.0SerTrp: 0.0 ± 0.0
1.362SerTyr: 1.362 ± 0.99
0.0SerXaa: 0.0 ± 0.0
Thr
4.768ThrAla: 4.768 ± 1.937
3.406ThrCys: 3.406 ± 1.961
1.362ThrAsp: 1.362 ± 0.913
3.406ThrGlu: 3.406 ± 1.266
0.681ThrPhe: 0.681 ± 0.976
8.856ThrGly: 8.856 ± 2.053
2.044ThrHis: 2.044 ± 1.694
1.362ThrIle: 1.362 ± 1.175
2.044ThrLys: 2.044 ± 1.383
8.856ThrLeu: 8.856 ± 2.655
1.362ThrMet: 1.362 ± 0.961
2.044ThrAsn: 2.044 ± 1.37
6.812ThrPro: 6.812 ± 2.351
3.406ThrGln: 3.406 ± 1.63
2.725ThrArg: 2.725 ± 1.459
5.45ThrSer: 5.45 ± 1.682
17.711ThrThr: 17.711 ± 11.248
4.087ThrVal: 4.087 ± 2.462
0.681ThrTrp: 0.681 ± 0.457
0.681ThrTyr: 0.681 ± 0.65
0.0ThrXaa: 0.0 ± 0.0
Val
14.986ValAla: 14.986 ± 3.358
3.406ValCys: 3.406 ± 1.881
5.45ValAsp: 5.45 ± 3.675
2.725ValGlu: 2.725 ± 1.576
4.087ValPhe: 4.087 ± 2.259
8.174ValGly: 8.174 ± 1.299
2.044ValHis: 2.044 ± 1.251
3.406ValIle: 3.406 ± 2.025
1.362ValLys: 1.362 ± 0.99
4.768ValLeu: 4.768 ± 1.826
1.362ValMet: 1.362 ± 1.952
2.044ValAsn: 2.044 ± 1.295
7.493ValPro: 7.493 ± 2.404
3.406ValGln: 3.406 ± 1.431
4.768ValArg: 4.768 ± 2.263
6.812ValSer: 6.812 ± 2.451
4.087ValThr: 4.087 ± 1.597
8.856ValVal: 8.856 ± 2.869
2.725ValTrp: 2.725 ± 1.585
0.681ValTyr: 0.681 ± 0.981
0.0ValXaa: 0.0 ± 0.0
Trp
2.044TrpAla: 2.044 ± 1.73
0.681TrpCys: 0.681 ± 0.65
0.0TrpAsp: 0.0 ± 0.0
0.681TrpGlu: 0.681 ± 0.457
0.681TrpPhe: 0.681 ± 0.565
1.362TrpGly: 1.362 ± 0.721
0.681TrpHis: 0.681 ± 0.565
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.681TrpLeu: 0.681 ± 0.65
1.362TrpMet: 1.362 ± 0.721
0.0TrpAsn: 0.0 ± 0.0
2.044TrpPro: 2.044 ± 1.2
0.681TrpGln: 0.681 ± 0.565
0.681TrpArg: 0.681 ± 0.457
0.681TrpSer: 0.681 ± 0.457
1.362TrpThr: 1.362 ± 0.594
3.406TrpVal: 3.406 ± 1.361
1.362TrpTrp: 1.362 ± 1.3
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.044TyrAla: 2.044 ± 0.893
0.0TyrCys: 0.0 ± 0.0
0.681TyrAsp: 0.681 ± 0.981
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.681TyrHis: 0.681 ± 0.65
0.681TyrIle: 0.681 ± 0.457
0.681TyrLys: 0.681 ± 0.565
1.362TyrLeu: 1.362 ± 1.058
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.362TyrPro: 1.362 ± 0.721
0.681TyrGln: 0.681 ± 0.457
0.681TyrArg: 0.681 ± 0.65
0.681TyrSer: 0.681 ± 0.565
2.725TyrThr: 2.725 ± 1.293
2.044TyrVal: 2.044 ± 0.953
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1469 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski