search

You are currently browsing articles tagged search.

Dan Schultz doesn’t like Facebook or Twitter because they are too focussed on individual expression rather than the community.

That may be because he is using them wrong. I liked photography as a kid, but I didn’t know any photographers. Flickr happened to come into my life just after I bought my first digital camera. My participation in photography exploded. Not because I had a way to post my photos but because I had a way to find other local photographers for mutual encouragement. Even better was forming local groups to encourage people to meet. The value of Flickr is developing the community.

Worldwide Photowalk Panorama

Similarly, I got into Twitter because my community, peers at other universities running the same software as myself, were seeking help there. Any place with answers to the problems we face, which is where people with the answers are watching, is where we go. Twitter was the place to get the attention of the right people not a forum like phpBB. (There are already lots of email lists.) My other community, people using the software I run are also on Twitter. I’ve resolved issues for many clients by finding their public complaints and offering solutions. When my focus changed away from using Twitter for the community is when I stopped liking Twitter.

Personally, I have yet to find much sense of community in the phpBB, Google Wave, and Ning. So I find it strange these are the exemplars of community applications. They seem fractured so one finds dozens of groups to covering the same interest. Sometimes this is because some moderator upset a portion of the community with draconian behavior causing people to form an alternative community. Bad blood exists for a while. Other times people set up a new community unaware others exist.


Related posts

Rather than depend on end users to accurately report the browser used, I look for the user-agent in the web server logs. (Yes, I know it can be spoofed. Power users would be trying different things to resolve their own issues not coming to us.)

Followers of this blog may recall I changed the Weblogic config.xml to record user agents to the webserver.log.

One trick I use is the double quotes in awk to identify just the user agent. This information is then sorting by name to count (uniq -c) how many of each is present. Finally, I sort again by number with the largest at the top to see which are the most common.

grep <term> webserver.log | awk -F\” ‘{print $2}’ | sort | uniq -c | sort -n -r

This is what I will use looking for a specific user. If I am looking at a wider range, such as the user age for hits on a page, then I probably will use the head command to look at the top 20.

A “feature” of this is getting the build (Firefox 3.011) rather than just the version (Firefox 3). For getting the version, I tend to use something more like this to count the found version out of the log.

grep <term> webserver.log | awk -F\” ‘{print $2}’ | grep -c ‘<version>’

I have yet to see many CE/Vista URIs with the names of web browsers. So these are the most common versions one would likely find (what to grep – name – notes):

  1. MSIE # – Microsoft Internet Explorer – I’ve seen 5 through 8 in the last few months.
  2. Firefox # – Mozilla Firefox – I’ve seen 2 through 3.5. There is enough difference between 3 and 3.5 (also 2 and 2.5) I would count them separately.
  3. Safari – Apple/WebKit – In searching for this one, I would add to the search a ‘grep -v Chrome’ or to eliminate Google Chrome user agents.
  4. Chrome # – Google Chrome – Only versions 1 and 2.

Naturally there many, many others. It surprised me to see iPhone and Android on the list.


Related posts

Blackboard has a conference they call BbWorld. I noticed there are some odd tweets with the same #bbworld hashtag lately. These appear to be about a Blackberry conference to be held next month.

Collisions on names are common enough. For example, here are a couple names our clients use to brand their sites which other places also use.

My own project, GeorgiaVIEW is not immune. Some time ago I noticed the GeorgiaView Consortium (geological remote sensing) at the University of West Georgia.

I guess it is a good thing one Bbworld is in July and the other is in September.

For now I’ll just drop my RSS feed for the hashtag.


Related posts

Turnitin.com

I’m surprised I have not blogged here about the student lawsuit against Turnitin.com? An anti-plagiarism service, Turnitin has students or faculty members upload papers into the database. By comparing new papers to the database, it gives ratings as to whether it is likely a student plagiarized.

Now the search goes out for any student who has a paper that’s being held by TurnItIn that they did not upload themselves. Students Settle with TurnItIn

In theory I could be someone in this situation. Back in 2005, a coworker asked my mother if someone by my name was related to her. This coworker was taking some classes at the university I attended. Turnitin had threw up a cautionary flag on the Originality Report because it was somewhat similar to something with my name on it. The problem is this product came into use at the university after the time I was a student. So I never submitted anything to it. The department from which I got my degree kept a copy of my papers (many submitted by email) and used this product at the time.

Another possibility is this tidbit about the product: Over 11 Billion Web Pages Crawled & Archived. I was actively blogging before and at the time of the incident. Assuming it could identify my name out of all that content, this match could have come from my blogging.

When I contacted Turnitin about this back in 2005, they told me I would have to remove my paper. I re-explained that I didn’t submit the paper. So Turnitin explained that whoever did put the paper in the system would have to remove it. The guy acknowledged the difficulty of the situation in identifying who posted it.


Related posts

Walking home from the bus in high school, I saw police cars and officers in front of my house. Their presence made me extremely apprehensive. The only little assurance was my father talking to the officers. Someone broke into the house and stole some of our stuff.

We felt violated. Our own home was unsafe.

At the time however, the people with guns with a tendency to keep their hands near them were much more threatening than some anonymous teen who wanted some quick cash.

Police officers are the good guys.

Take this scenario:

  1. You’ve spent almost a full day on a plane or in airports flying from Shanghai to Boston so you are extremely jet-lagged.
  2. (SUGGESTED ADDITION) You picked up the flu while in China (remember Avian Bird Flu?).
  3. Your front door won’t open when you get home, so you end up gaining access to the house from the back door. Eventually with help you do get it opened.
  4. While calling someone to come fix the door, a police officer shows up to question you about being the owner of the house. (Let’s ignore that Harvard owns it. You just reside there.)

This is like Alexander and the Terrible, Horrible, No Good, Very Bad Day: “Nothing at all was right.” Except… This state of mind was interpreted by the police officer this way:

“From the time he opened the door it seemed that he was very upset, very put off that I was there in the first place,” Sergeant Crowley told the station, WEEI. “Not just what he said, but the tone in which he said it, just seemed very peculiar — even more so now that I know how educated he is.” NYT

This seems like the perfect opportunity to ask questions about Dr. Gates’ day to establish something of a rapport to ascertain why he might be so upset. It’s not so peculiar when the context is known. I bet if all this had been placed in context at the time, then this would not be front page news.


Related posts

Good Sign I missed the story about brothers convicted of harvesting emails the first time. Well, I noticed a followup.

Back around 2001, the CIO received complaints about performance for the web server. So, I went log trolling to see what the web server was doing. A single IP dominated the HTTP requests. This one IP passed various last names into the email directory. Some quick research revealed Apache could block requests from that IP. That calmed things down enough for me to identify the owner of the IP. The CIO then bullied the ISP to provide contact information for the company involved.

Previous little adventures like this landed me a permanent job, so I jumped at similar challenges.

Well, a few years later, it happened again. This time my boss had made me develop a script for the dissemination of the anti-virus software package to home users. Basically, it used email authentication for verification if someone could get the download link. So, I applied the same technique to the email directory. Well, this upset some people who legitimately needed email addresses. So the human workers would provide email addresses to people with a legitimate need.

I’m glad since I’ve left, VSU no longer looks up email addresses for people. (I thought some of the requests questionable.) Also, my little email authentication script was before LDAP was available to the university. I think the new solution much better.

One the more vocal complainers about my having stopped non-VSU access to the email directory was my current employer. We apparently list email addresses for employees freely. Which makes me wonder how much spam we get is due to the brothers described at the beginning of this story? Or other email harvesters? Just hitting the send button potentially exposes the email address.

No worries. I’m sure Glenn is protecting me. :)


Related posts

If you cannot find me, then you are not looking. If you search on Facebook for Ezra Freelove, then I am the only result at the moment. Maybe all you knew was Ezra and the city where I lived? Facebook search is not so great you could find me through my first name plus something else you knew about me (other than email or city). Probably this is for the best. We don’t want to make it too easy to stalk people, right?

Allowing users to make a username is a promotion. The blogosphere making a fuss over all this is a Chicken Little-esque. Sure Myspace, Twitter, and a number of other sites have addresses with usernames in them. No one is forcing people opposed to having one to make one. Only in the past month could one choose a username for one’s Google profile. Prior to that it was a hefty large number of numbers.

I think the reason some people prefer usernames comes down to elaborative encoding. To retain something in memory, we associate that something with existing items in memory. Short-term memory has only about 7 slots and digits are each a single item. Assuming a single incrementation per account created and over 200 million users, using a numbers means there ought to be 9 digits worth of numbers to memorize. Words occupy a single slot in short term memory, by far simplifying remembering. Which would you rather try to remember 46202460 or ezrasf?

An argument against usernames comes down to using the memory of the Facebook database or other computer memory. Computer memory is better than human memory for stuff like this.

All of these work and go to the same place:

  1. http://www.facebook.com/profile.php?id=46202460
  2. http://www.facebook.com/ezrasf
  3. http://www.ezrasf.com/fb

Pick your poison. Enjoy.


Related posts

From 2001 to 2006, Microsoft Outlook was the email client I used for work (and on my home computer to access work stuff). Back then, Exchange was not available, so a number of the features were more hacks than reality. However, it worked pretty well.

When I changed jobs, Netscape and Thunderbird were the pre-installed clients. I opted for Thunderbird. It worked pretty well for me. Calendaring was in MeetingMaker. Everything worked pretty well.

Recently work shifted to Exchange, so going back to Outlook made sense. Maybe because I have so much experience, the transition was not as bad as it might have been. Still… These are gotchas which have annoyed me lately:

  1. Editable subject usability: The emails from our client issue tracking system put the description where its hidden. I was really pissed that I could not edit the subject until I figured out unlike most software which changes the shading to show it is now editable, Outlook just lets me edit at any time. Also, editing the subject after it is used by something else like a task results in the change in the email but not the task. (The main reason I want to change them is so it appears correctly in the task list. ) Copying to a second email results in the same problem. Apparently I have to either create a new task and copy-n-paste the subject I want or forward the email to myself.
  2. Spacebar moves to next message instead of next new message: I really like the Thunderbird method of skipping to the next unread message when I hit the spacebar at the end of the current message. It even will find the next unread message in another folder. Outlook just advances to the next message.
  3. Boolean is more than OR: I had this fantastic Thunderbird filter which looked for user@ AND domain.tld. Outlook only honors OR. We have 15 admin nodes and databases which send up reports. Alerts and tickets come from a different source and unaffected by this.
  4. Search ignores special characters: I thought in the past I had sent email to abc-defghi@domain.tld. However, the message bounced, so I searched my email for part of the address “abc-defghi” as its not in the address book. I got results which match “abc” not “abc-defghi”. So it ignored the hyphen and everything after. FAIL!
  5. Send email as plain text or paste a plain text: Yes, I know lots of people have HTML capable clients. I hate Outlook puts my replies in a sickly blue font. When I copy and paste from the elsewhere in the message, it changes the font. So then I have to go and do formatting to have a presentable email. I just want to type and send. I don’t care about fonts, colors, etc. If I did, then I would create a web page. … (Added 2009-JUN-03)

That’s it for now.


Related posts

(This started out as a blog comment for Sania’s post Facebook Killed Your Blog. I’m posting it here first.)

We share blogs with the whole world. So our blogs get lost in the noise, bolstering the need for a whole industry optimizing getting found in search engines. Its a concerted effort just get noticed. That’s because blog readers have to seek out blogs to follow, subscribe to the feed, and follow. Finding the best blogs to read is sometimes difficult and more from word of mouth than anything search engines provide.

Blogs also tend to have a lot of information to digest. Social networks have just a line or two with maybe a link to more information. Blog readers typically are designed around the idea of collecting all the posts and letting the user pick which to read. Social networks typically are designed around the idea of just showing recent posts and letting the users choose how far back in time to read.

As technologies lower the costs to express ideas (aka get easier), blogs will get left behind as they have become upside down in value. The costs of writings, reading, subscribing, and commenting on blogs are more expensive compared to micro-blogging or status updates.

Why blog when hanging out on social networks are so much easier? Blogs can only survive as long as they have information worthy.

Why blog when readers are no longer reading? Posting blog entries on social networks does help keep traffic levels somewhat by getting exposure.

As bloggers providing valuable expression leave blogging, the value of blogs decrease. People will still blog. It just won’t be the popular thing to do.


Related posts

When I preface statements with something like “Used Google to look up <search term>”, it means:

The following is my interpretation of what you are asking. You should do your own research on the search term to better understand the issue. I’m telling you the search term I used to nudge you in what I think is the right direction.

Probably I spent < 10 minutes doing my research. So the quality isn’t of the highest standard. I’m not speaking with authority… Just trying to be helpful.

Update: I don’t generally mean it to be condescending… As in, you should have checked Google before asking anyone about it.


Related posts

« Older entries