swestrup | Collaborative Semantic Analysis.

I just had a random thought that I figured I should write down before I forget it. When

_sps_ was here on Saturday, I mentioned to him a paper I had seen on the difficulty of storing and retrieving scientific papers that are relevant to a field of research. It has gotten so bad in the field of mathematics that it is now often easier to spend a year re-solving a tricky mathematical problem than it is to find an existing paper with the solution. There is a (woefully underfunded) institute that tries to produce a controlled-vocabulary description of the semantic elements in new papers, and record them. They keep falling further and further behind.

Anyway,

_sps_ had some not-unreasonable ideas on how to encode useful indexes of these math papers so that relevant materials could be searched for. The big question is: how do you do the semantic analysis? For something like Math, you need a human, and one that understands the math as well. Plus, it would help if they just happened to know of all of the other bits of math that the paper overlapped, even if they are in other fields and use different nomenclature.

Anyway, it suddenly occurred to me that it might be possible (I'm not sure how) to design a mathematics-paper search-engine and browser which had the express purpose of eliciting from a mathematician information about the nature of the paper being studied, and how closely its contents matched that mathematicians current work. This would be done, not by asking questions, but by allowing the mathematician to categorize his searches by project, and to pay attention to how long he spent studying various sections of the paper. As well, if we provided various renaming and renomenclaturing systems, we might get further information by observing the transformations that were performed on the paper.

In the end, I would hope the gathered data from a large number of mathematicians could be used to build a fuzzy index of any given paper, and to let us build a map of which things seemed to be close to each other in a semantic space. I don't know, ultimately, how well such a system would work, but I think it would be worth giving it a try.

S	M	T	W	T	F	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

Most Popular Tags

advocaat - 2 uses
azrhey - 4 uses
blitzweekend - 4 uses
booze - 2 uses
breakfast - 2 uses
cats - 2 uses
cgi - 2 uses
civ 3 - 4 uses
css - 2 uses
css2 - 2 uses
denzo - 2 uses
der mouse - 3 uses
domesticity - 2 uses
dragon-half - 3 uses
dreams - 2 uses
dsl - 2 uses
games - 4 uses
gaming - 3 uses
hardware - 2 uses
html - 3 uses
ice - 2 uses
injury - 4 uses
insomnia - 8 uses
javascript - 2 uses
knee - 3 uses
linux peeve - 4 uses
mathematica - 4 uses
mci - 3 uses
meme - 13 uses
migraine - 3 uses
names - 2 uses
party - 2 uses
perl - 6 uses
pictures - 3 uses
poetry - 2 uses
rant - 2 uses
resume - 2 uses
ribs - 4 uses
sick - 4 uses
sickness - 10 uses
sleep - 19 uses
sps - 2 uses
torcon - 12 uses
torcon 3 - 2 uses
vertigo - 2 uses
website - 5 uses
work - 5 uses
worldcon - 4 uses
xenobiology - 3 uses
xml::parser - 3 uses

Flat | Top-Level Comments Only

From:

sps.livejournal.com

I can't see this. Bibliographies are political things: academics are promoted based on how often they are cited, so when you are considering what to put in a bibliography you are thinking, is this person my friend? Do I owe them a favour? Do I want them to owe me? Can I leave out this person because I hate them so much, or will they then vote against me on the committee? And even then you can only include things you can remember, which for a mathematician (who is trained to work things out over again) isn't usually much!

Unifying Contradiction

Collaborative Semantic Analysis.

Collaborative Semantic Analysis.

Re: Interesting problem of taxonomy

Profile

January 2017

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags